Story Detail of id 47929877 | Liveview Hacker News

levocardia12 hours ago | on: Talkie: a 13B vintage language model from 1930

Indeed, I found this part extremely interesting. The more general vision of "testing a vintage model on something invented after its training data ended" seems like quite a strong test of "true cognition" (or training data contamination, if you haven't stopped up all the leakage...)

#visit	13,271,090
#session	74,665
#live-session	0