Story Detail of id 43054521 | Liveview Hacker News

lifeisstillgood1 week ago | on: We were wrong about GPUs

I used to think this, ut it only works if the abstractions hold - it’s like if we stopped random access memory and went back to tape drives suddenly abstractions matter.

My comment elsewhere goes into but more detail but basically silicon stopped being able to make single threaded code faster in about 2012 - we just have been getting “more parallel cores” since. And now at wafer scale we see 900,000 cores on a “chip”. When 100% parallel coding runs 1 million times faster than your competitors, when following one software engineering path leads to code that can run 1M X, then we will find ways to use that excess capacity - and the engineers who can do it get to win.

I’m not sure how LLMs face this problem.

dbcjv7vhxj1 week ago | parent | next

This.

As soon as the abstractions leak or you run into an underlying issue you suddenly need to understand everything about the underlying system or you're SOOL.

I'd rather have a simpler system I already understand all the proceeding abstractions about.

The overhead of this is minimal when you keep things simple and avoid shiny things.

jonas211 week ago | parent

That's why abstractions like PyTorch exist. You can write a few lines of Python and get good utilization of all those GPU cores.

varelse1 week ago | root | parent

[dead]

#visit	12106500
#session	46836
#live-session	0