The local OCR that scored best, and let the chatbot show the diagrams

Fri, 05 Jun 2026 11:00:00 +0000

In the last post I built a strict exact-match test for the OCR behind a 1994 Yamaha XV250 Virago manual chatbot and scored four local pipelines against 100 hand-verified values. The live corpus (Docling running EasyOCR) scored 61 percent; the best of the four was Docling + RapidOCR at 85 percent, and it got there doing genuine OCR on the page pixels, with nothing leaving the machine and no per-page API bill.

AI Search on Jamie Ede

The local OCR that scored best, and let the chatbot show the diagrams