Hacker News new | past | comments | ask | show | jobs | submit
You're correct that this work is not very applicable for LLMs and that the focus here is primarily on latency.