Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Local LLMs I've found to not be good enough for OCR (while being a lot more resource hungry), and OpenAI models I want to avoid for privacy reasons. Default tesseract does the job for me, since my only requirements for the results it "I can easily find what I need with full-text search" - I rarely need to actually copy the text from the resulting PDFs


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: