Improved

Sharper OCR for scans and phone photos

Documents with low-quality embedded text (phone scanners, junk-character PDFs) now route through real OCR instead of trusting the broken text layer. When nothing is a confident match, docs land in “uncategorized” rather than being forced into a wrong fit.