Story Detail of id 42473437 | Liveview Hacker News

neom1 day ago | on: OpenAI O3 breakthrough high score on ARC-AGI-PUB

Why would they give a cost estimate per task on their low compute mode but not their high mode?

"low compute" mode: Uses 6 samples per task, Uses 33M tokens for the semi-private eval set, Costs $17-20 per task, Achieves 75.7% accuracy on semi-private eval

The "high compute" mode: Uses 1024 samples per task (172x more compute), Cost data was withheld at OpenAI's request, Achieves 87.5% accuracy on semi-private eval

Can we just extrapolate $3kish per task on high compute? (wondering if they're withheld because this isn't the case?)

loading story #42473963

#visit	11087449
#session	44988
#live-session	0