> GPT-5.5 and DeepSeek V4 Pro are two of the clearest hallucination leaders, despite being absolutely huge. Because of their immense size they simply did not learn how to say “I don’t know” or recognize intricate logical and technical fallacies.
This implies that bigger models are more likely to hallucinate? That doesn't match my experience.
loading story #48609638