Hacker News new | past | comments | ask | show | jobs | submit
do you mean pre training? so 4.8 is just post training of an old pretrained model?

btw where do they tell you how they trained the model.