I use paperless-ngx for digitizing all my documents, it also uses Tesseract. The...

oigursh · 2025-10-13T13:41:23 1760362883

There's https://github.com/icereed/paperless-gpt as a plugin

graynk · 2025-10-27T12:25:15 1761567915

Local LLMs I've found to not be good enough for OCR (while being a lot more resource hungry), and OpenAI models I want to avoid for privacy reasons. Default tesseract does the job for me, since my only requirements for the results it "I can easily find what I need with full-text search" - I rarely need to actually copy the text from the resulting PDFs