Hacker News new | past | comments | ask | show | jobs | submit
Not really. The model is good/fast at OCR, and preprocessing it actually makes it worse because academic paper formatting is very complicated. Sizes, positions, and equations are important.