Story Detail of id 47478173 | Liveview Hacker News

haomingkoo6 hours ago | on: Flash-MoE: Running a 397B Parameter Model on a Laptop

Really interesting approach. Curious how the 2-bit quantization affects the model's reasoning ability on longer chains of thought vs shorter prompts. The benchmarkslook solid but real-world usage seems like a different story based on the comments here.

#visit	13,229,177
#session	74,665
#live-session	0