Story Detail of id 48314481 | Liveview Hacker News

fastball19 hours ago | on: Claude Opus 4.8

Not sure I follow. Anthropic included benchmarks where GPT 5.5 outperforms Claude 4.8. Sure maybe that is a strategic error, but that doesn't seems to indicate benchmarks can't be trusted (I personally don't trust them, but not because of this).

#visit	13,436,457
#session	74,665
#live-session	0