Hacker News new | past | comments | ask | show | jobs | submit

Are Pre-Trained Convolutions Better Than Pre-Trained Transformers? (2021)

https://arxiv.org/abs/2105.03322