Story Detail of id 48246391 | Liveview Hacker News

laGrenouille10 hours ago | on: The quadratic sandwich

Great visualizations. Really enjoyed having a well-written example where mathematical proofs directly help with understanding a practical application.

I wonder what would happen with this analysis if a momentum term was added to the gradient descent. It seems that it would fix the specific failure modes in the examples, but I wonder if there's a corresponding mathematical way of categorizing what kinds of functions can(not) be quickly optimized with GD + momentum.

#visit	13,335,415
#session	74,665
#live-session	0