Hacker News new | past | comments | ask | show | jobs | submit
The Llama distilled version Q4_K_M should be reasonably fast and good!!