sikuli-driver team mailing list archive
-
sikuli-driver team
-
Mailing list archive
-
Message #36776
Re: [Question #285002]: Tesseract with CJK and 1.1.1
Question #285002 on Sikuli changed:
https://answers.launchpad.net/sikuli/+question/285002
Barry Janzen gave more information on the question:
OK, I got a bit closer, looking at
https://answers.launchpad.net/sikuli/+faq/2709
Some things have changed since then: the code.google.com link is now on
github at https://github.com/tesseract-ocr/tessdata.
I downloaded the zip, then didn't know where to put the .traineddata
files I wanted, so I ran some diagnostics and found my eng.traineddata
is located at ~/Library/Application
Support/Sikulix/SikulixTesseract/tessdata/eng.traineddata
I put the jpn.traineddata file from github here. I included in my
script
import org.sikuli.script.TextRecognizer as TR
Settings.OcrTextSearch = True
Settings.OcrTextRead = True
Settings.OcrLanguage = "jpn"
TR.reset()
When I played around with the language settings, such as Mac's "ja" or
Locale "ja_JA", or tesseract's "jpn" I get
Failed loading language 'jpn'
Tesseract couldn't load any languages!
So I'm getting closer, but not quite there.
--
You received this question notification because your team Sikuli Drivers
is an answer contact for Sikuli.