Hacker News new | past | comments | ask | show | jobs | submit
Huh? It's a benchmark by Cognition which (1) is building their own models and (2) offers all providers and thus has an incentive to avoid hyping up any one too much.
But you can just say shit now. Tokens might not be too cheap to meter but saying shit increasingly is.