The quadratic sandwich
https://fedemagnani.github.io/math/2026/04/08/the-quadratic-sandwich.htmlloading story #48249916
Great visualizations. Really enjoyed having a well-written example where mathematical proofs directly help with understanding a practical application.
I wonder what would happen with this analysis if a momentum term was added to the gradient descent. It seems that it would fix the specific failure modes in the examples, but I wonder if there's a corresponding mathematical way of categorizing what kinds of functions can(not) be quickly optimized with GD + momentum.
loading story #48249659
loading story #48248084
loading story #48248661
loading story #48247163
[flagged]