Story Detail of id 47349814 | Liveview Hacker News

kqr11 hours ago | on: Are LLM merge rates not getting better?

With one term it gets more robust in the face of excluding endpoints when constructing the jackknife train/test split, I think. But you're right, it does sound fishy.

fluidcruft11 hours ago | parent

What the post is describing is just ANOVA. If removing a category improves the overall fit then fitting the two terms independently has the same optimal solution (with the two independent terms found to be identical). MSE never increases when adding a category.

This is why you have to reach to things that penalize adding parameters to models when running model comparisons.

loading story #47350824

#visit	13,082,452
#session	74,665
#live-session	0