Hacker News new | past | comments | ask | show | jobs | submit
How many times did you try? Same model running multiple times can produce both very good and very bad results. In my benchmark even 10 runs often not enough to tell for sure if one model is better than another.
loading story #48320830