Story Detail of id 48248956 | Liveview Hacker News

No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.