Javascript is not enabled. This site can still works but it'll be more interactive when javascript is enabled.
loading...
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
julianlam
1 day ago
|
on: Gemma 4 12B: A unified, encoder-free multimodal model
Last time I tried Gemma 4 (26B-A4B) its memory usage would balloon and consume all of my swap until my machine died.
Qwen 3.6 on the other hand barely uses any memory at all for its KV cache.
reply
verdverm
1 day ago
|
parent
Turns out when you block people from the best and biggest hardware, they get innovative. It reminds me of the Pentium days when everyone was shipping inefficient programs because the processor would be better next year.
reply
iknowstuff
23 hours ago
|
root
|
parent
we never stopped doing that!
reply