sikuli-driver team mailing list archive
-
sikuli-driver team
-
Mailing list archive
-
Message #16530
[Bug 795391] Re: [request] OCR/tesseract: allow new training sets for other languages and more tesseract features
** Changed in: sikuli
Status: New => Incomplete
** Changed in: sikuli
Status: Incomplete => In Progress
** Changed in: sikuli
Importance: Wishlist => Medium
** Changed in: sikuli
Assignee: (unassigned) => RaiMan (raimund-hocke)
** Tags added: fkt-text
--
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/795391
Title:
[request] OCR/tesseract: allow new training sets for other languages
and more tesseract features
Status in Sikuli:
In Progress
Bug description:
--- implementation of other languages
A training set can be created for tesseract 2.04 as described here:
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
and implemented in sikuli-script.jar.
If it contains non english characters, Sikuli crashes (e.g.
norwegian).
--- Tesseracts bold-feature is not used properly by Sikuli: only @ is
returned instead of @x for bold characters.
To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/795391/+subscriptions
References