Story Detail of id 48388192 | Liveview Hacker News

mchusma1 day ago | on: Gemma 4 12B: A unified, encoder-free multimodal model

Gemma 4 31b outperformed Gemini 3.1 Flash-Lite in our app benchmarks (agentic tool use via api in our application as a part of various workflows). But google won't let you pay to use Gemma models, you have to go elsewhere, I think this may be because it would cannabilize Flash-lite.

dTal20 hours ago | parent | next

Curious logic. Does Google want you to use it or not? Do they want to be paid for tokens or not? why segregate open and closed?

It's not parameter size - there is apparently such a thing as "Gemini Nano", which famously is downloaded automatically by Chrome. How similar is it to Gemma E4B? And how strange - you have the weights, but you don't "have" them?

verdverm1 day ago | parent

You can actually get the gemma-4 models on a per-token API basis, you just have to click some extra buttons (in GCP). Not the same for other open weight models. For those they make you run your own hardware.

Use OpenCode Go instead: https://opencode.ai/go

loading story #48389672

#visit	13,564,130
#session	74,665
#live-session	0