Story Detail of id 47477306 | Liveview Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

datadrivenangel9 hours ago | on: Tinybox – A powerful computer for deep learning

Yeah I've got the q4 gpt-oss-120b running at ~40-60 tokens per second on an M5 Pro.