sikuli-driver team mailing list archive

Thread
Date

Re: [Question #402228]: How to improve OCR accuracy or give it a font

To: sikuli-driver@xxxxxxxxxxxxxxxxxxx
From: RaiMan <question402228@xxxxxxxxxxxxxxxxxxxxx>
Date: Sun, 25 Sep 2016 07:47:23 -0000
Reply-to: question402228@xxxxxxxxxxxxxxxxxxxxx
Sender: bounces@xxxxxxxxxxxxx

Question #402228 on Sikuli changed:
https://answers.launchpad.net/sikuli/+question/402228

RaiMan proposed the following answer:
to learn more about Tesseract (also used by the mentioned website) look here:
https://github.com/tesseract-ocr/tesseract

If you have installed SikuliX with Tesseract/OCR support, the tessdata folder location is shown by:
print Settings.OcrDataPath.

I cannot help you further, since I do not have any experiences wit that.
The text/OCR stuff is unchanged since 5 years and still as it was left
by the original developers. You do not have any further options besides
trying to use optimized regions.

You might have a chance to find more about that using google (Sikuli + tesseract), since I know that some people went this way.
... e.g. this (sikulix tesseract training) revealed that: 
https://answers.launchpad.net/sikuli/+question/273729

Another option for you might be to install Tesseract and use the command
in a subprocess, to get the text from an image, after having it
preprocessed according to the Tesseract quality prerequisites.

-- 
You received this question notification because your team Sikuli Drivers
is an answer contact for Sikuli.