Story Detail of id 41449881 | Liveview Hacker News

whimsicalism4 months ago | on: Ilya Sutskever's SSI Inc raises $1B

i think we should distinguish between pretraining and polishing/alignment data. what you are describing is most likely the latter (and probably mixed into to pretraining). but if you can't get a mass of tokens from scraping, you're going to be screwed

#visit	11482343
#session	45282
#live-session	0