Story Detail of id 48505461 | Liveview Hacker News

kouteiheika4 hours ago | on: Kimi K2.7-Code: open-source coding model with better token efficiency

The model is natively quantized (i.e. it was trained that way in the first place, so this is not a post-training quantization which degrades performance).

knollimar2 hours ago | parent | next

Isn't it not completely quantized? I thought there were some dense parts but most is int4?

theanonymousone3 hours ago | parent

But the huggingface link mentions BF16, F16, and I32?

loading story #48506742

loading story #48507268

#visit	13,787,248
#session	74,665
#live-session	0