Story Detail of id 47477741 | Liveview Hacker News

m-hodges7 hours ago | on: Flash-MoE: Running a 397B Parameter Model on a Laptop

As frontier models get closer and closer to consumer hardware, what’s the most for the API-driven $trillion labs?

48 GB is not consumer hardware. But fundamentally, there are economies of scale due to batching, power distribution, better utilization etc.., that means data center tokens will be cheaper. Also, as the cost of training (frontier) models increases, it's not clear the Chinese companies will continue open sourcing them. Notice for example, that Qwen-Max is not open source.

loading story #47478034

loading story #47479857

OJFord6 hours ago | parent | next

Assuming 'moat' – they'll push the frontier forward; they don't really have to worry until progress levels off.

At that point, I suppose there's still paid harnesses (people have always paid for IDEs despite FOSS options) partly for mindshare, and they could use expertise & compute capacity to provide application-specific training for enterprises that need it.

BoredomIsFun6 hours ago | parent

> the API-driven $trillion labs?

here we go: https://huggingface.co/collections/trillionlabs/tri-series

#visit	13,229,567
#session	74,665
#live-session	0