Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency

https://blog.google/innovation-and-ai/technology/developers-tools/quantization-aware-training-gemma-4/

405theanonymousone | 1 week ago | 130 | HN

loading story #48416486

loading story #48415688

loading story #48418820

loading story #48417969

loading story #48415466

loading story #48421755

loading story #48417759

loading story #48422388

loading story #48416638

loading story #48415492

loading story #48420724

loading story #48419995

loading story #48436374

loading story #48441450

loading story #48416138

loading story #48431797

loading story #48416080

loading story #48419848

loading story #48426414

loading story #48417007

loading story #48420244

loading story #48416404

loading story #48422335

loading story #48416661

loading story #48420049

loading story #48415574

loading story #48416272

loading story #48416376

loading story #48416621

steno1321 week ago | parent

I don't get this obsession with smaller models. I've been using Claude and GPT models for years and have had zero issues with them.

I see absolutely no benefit to me as a end user for a local model which is going to take up more of my CPU and memory and slow down my machine. I almost always have Internet and if I don't then not having access to a AI model is the least of my concerns.

loading story #48417879

loading story #48426594

loading story #48417964

loading story #48417884

loading story #48417888

loading story #48418045

#visit	13,844,335
#session	74,665
#live-session	0