However, these methods rely on manually curated post- correction data, which are relatively scarce compared to the non-annotated raw images that need to be digitized. ![]() Optical character recognition (OCR) can be used to produce digitized text, and previous work has demonstrated the utility of neural post-correction methods that improve the results of general- purpose OCR systems on recognition of less- well-resourced languages. Much of the existing linguistic data in many languages of the world is locked away in non- digitized books and documents.
0 Comments
Leave a Reply. |