← Back to team overview

sikuli-driver team mailing list archive

[Bug 795391] Re: [request] OCR/tesseract: allow new training sets for other languages and more tesseract features

 

** Changed in: sikuli
       Status: New => Incomplete

** Changed in: sikuli
       Status: Incomplete => In Progress

** Changed in: sikuli
   Importance: Wishlist => Medium

** Changed in: sikuli
     Assignee: (unassigned) => RaiMan (raimund-hocke)

** Tags added: fkt-text

-- 
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/795391

Title:
  [request] OCR/tesseract: allow new training sets for other languages
  and more tesseract features

Status in Sikuli:
  In Progress

Bug description:
  --- implementation of other languages
  A training set can be created for tesseract 2.04 as described here:
  http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
  and implemented in sikuli-script.jar.

  If it contains non english characters, Sikuli crashes (e.g.
  norwegian).

  --- Tesseracts bold-feature is not used properly by Sikuli: only @ is
  returned instead of @x for bold characters.

To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/795391/+subscriptions


References