Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Nice. I worked on something similar but far less robust: https://github.com/eihli/image-table-ocr. It fails to find the tables on the example images at extract-table.com, but the code is heavily commented at https://eihli.github.io/image-table-ocr/pdf_table_extraction... so there's high visibility into what's going on and what needs to change to get it to work with images of different sizes/fonts.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: