← Back to team overview

sikuli-driver team mailing list archive

Re: [Question #211334]: OCR usage and integration with Sikuli

 

Question #211334 on Sikuli changed:
https://answers.launchpad.net/sikuli/+question/211334

    Status: Open => Answered

RaiMan proposed the following answer:
both ImageMagick (convert: work on images) and Tesseract (tesseract: get
text from an image) have commands, that can be used on the command line
and taking files as input and giving files as output.

So principally a shell script
convert input.png ... some actions output.tiff  //* optimize for OCR 
tesseract output.tiff ... additional options 

would create a textfile in the end containing what Tesseract could read.

This shellscript can be run from a Java program, that produces input.png
using Sikuli features and finally reads output.txt to get the textual
result.

At least the first step (convert) can be done in memory using JMagick
and BufferedImages (which you can get from Sikuli too).

-- 
You received this question notification because you are a member of
Sikuli Drivers, which is an answer contact for Sikuli.