Story Detail of id 48388849 | Liveview Hacker News

thot_experiment1 day ago | on: Gemma 4 12B: A unified, encoder-free multimodal model

I've always found the Gemma models to vastly under-perform on vision tasks compared to Qwen so that's nothing new.

The Qwen series adopted vision wayyy earlier than anyone else. No idea why the other labs were sleeping on it but they had about 2 years of experimentation without any competition.

#visit	13,568,527
#session	74,665
#live-session	0