Story Detail of id 47686582 | Liveview Hacker News

The point of doing local inference with huge models stored on an SSD is to do it free, even if slow.