Story Detail of id 42473423 | Liveview Hacker News

wilg1 day ago | on: OpenAI O3 breakthrough high score on ARC-AGI-PUB

fun! the benchmarks are so interesting because real world use is so variable. sometimes 4o will nail a pretty difficult problem, other times o1 pro mode will fail 10 times on what i would think is a pretty easy programming problem and i waste more time trying to do it with ai

#visit	11088037
#session	44988
#live-session	0