Tag: ocr

Setting up Tesseract on Ubuntu 18.04

I was presented with a task to quickly prepare a bare-metal Linux machine to run Tesseract OCR for optical character recognition task. The criterion was to to the recognition as fast as possible. In my tests, the Tesseract 5 was far more precise than Tesseract 4, no matter what o…

Inverting colors helps Tesseract

Been doing some work with OCR automation using Tesseract and discovered that it is really helpful to invert the image before doing the character recognition. Especially on black surfaces with the laser engraved characters. This quite makes sense, since the background is already i…