In OpenRouter, there is an "int4" tag for Moonshot provider of Kimi K2. 7 Code. Isn't that too low, particularly coming from the very developer of the model? Os that a mistake? How is it in their direct API offer?
The model is natively quantized (i.e. it was trained that way in the first place, so this is not a post-training quantization which degrades performance).
loading story #48507171
loading story #48506224