Story Detail of id 48467888 | Liveview Hacker News

You're correct that this work is not very applicable for LLMs and that the focus here is primarily on latency.