Hacker News new | past | comments | ask | show | jobs | submit
It could be a much bigger MoE model
Then it would be slower.