← Back to team overview

sikuli-driver team mailing list archive

Re: [Question #402228]: How to improve OCR accuracy or give it a font

 

Question #402228 on Sikuli changed:
https://answers.launchpad.net/sikuli/+question/402228

RaiMan proposed the following answer:
to learn more about Tesseract (also used by the mentioned website) look here:
https://github.com/tesseract-ocr/tesseract

If you have installed SikuliX with Tesseract/OCR support, the tessdata folder location is shown by:
print Settings.OcrDataPath.

I cannot help you further, since I do not have any experiences wit that.
The text/OCR stuff is unchanged since 5 years and still as it was left
by the original developers. You do not have any further options besides
trying to use optimized regions.

You might have a chance to find more about that using google (Sikuli + tesseract), since I know that some people went this way.
... e.g. this (sikulix tesseract training) revealed that: 
https://answers.launchpad.net/sikuli/+question/273729

Another option for you might be to install Tesseract and use the command
in a subprocess, to get the text from an image, after having it
preprocessed according to the Tesseract quality prerequisites.

-- 
You received this question notification because your team Sikuli Drivers
is an answer contact for Sikuli.