← Back to team overview

sikuli-driver team mailing list archive

Re: [Question #285002]: Tesseract with CJK and 1.1.1

 

Question #285002 on Sikuli changed:
https://answers.launchpad.net/sikuli/+question/285002

Barry Janzen gave more information on the question:
OK, I got a bit closer, looking at
https://answers.launchpad.net/sikuli/+faq/2709

Some things have changed since then:  the code.google.com link is now on
github at https://github.com/tesseract-ocr/tessdata.

I downloaded the zip, then didn't know where to put the .traineddata
files I wanted, so I ran some diagnostics and found my eng.traineddata
is located at ~/Library/Application
Support/Sikulix/SikulixTesseract/tessdata/eng.traineddata

I put the jpn.traineddata file from github here.  I included in my
script

import org.sikuli.script.TextRecognizer as TR
Settings.OcrTextSearch = True
Settings.OcrTextRead = True
Settings.OcrLanguage = "jpn"
TR.reset()

When I played around with the language settings, such as Mac's "ja" or
Locale "ja_JA", or tesseract's "jpn" I get

Failed loading language 'jpn'
Tesseract couldn't load any languages!

So I'm getting closer, but not quite there.

-- 
You received this question notification because your team Sikuli Drivers
is an answer contact for Sikuli.