Story Detail of id 42768551 | Liveview Hacker News

Does anyone know what kind of HW is required to run it locally? There are instructions but nothing about HW required.

They released a bunch of different sized models and there are already quantized versions showing up on HF.

That 3GB one might work on a CPU machine with 4GB of RAM.

To get good performance you'll want a GPU with that much free VRAM, or an Apple Silicon machine with that much RAM.

Deepseek v3 required about 1tb of VRAM / RAM so 10 A100.

There are various ways to run it with lower vram if you're ok with way worse latency & throughput

Edit: sorry this is for v3, the distilled models can be ran on consumer-grade GPUs

But you really don't know the exact numbers until you try, a lot of it is runtime/environment context specific.

It's just a question of having enough VRAM+RAM to fit the model into memory.