Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Semi-OT (similar language): The national archives in Sweden and Finland published a model for OCR:ing handwritten Swedish text from the 1600s to the 1800s with what to me seems like a very level of accuracy given the source material. (4% character error rate)

https://readcoop.eu/model/the-swedish-lion-i/

https://www.transkribus.org/success-story/creating-the-swedi...

https://huggingface.co/Riksarkivet

They have also published a fairly large volume of OCR:ed texts (IIRC birth/death notices from church records) using this model online. As a beginner genealogist it's been fun to follow.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: