Hacker News new | past | comments | ask | show | jobs | submit
If these were in the internal evals then the output would be much better. The 4.8 pelicans are pretty meh