Story Detail of id 42476517 | Liveview Hacker News

Jensson1 day ago | on: OpenAI O3 breakthrough high score on ARC-AGI-PUB

We know exactly why, it is because floating point operations aren't associative but the GPU scheduler assumes they are, and the scheduler isn't deterministic. Running the model strictly hurts performance so they don't do that.

fzzzy6 hours ago | parent

Cool, thanks a lot for the explanation. Makes sense.

#visit	11092412
#session	44988
#live-session	0