Hacker News new | past | comments | ask | show | jobs | submit
It comes from the Jacobian which you can get from auto diff. It measures how much distortion the function created and normalizes it so that you can integrate correctly without blowing up gradients
I mean the whole thing sounds like a deep neural network…