Hacker News new | past | comments | ask | show | jobs | submit
I am curious about the rough compute budget they used for training DeepSeek-R1. I couldn't find anything in their report. Anyone having more information on this?