← Back to team overview

sikuli-driver team mailing list archive

[Bug 795391] [NEW] [request] OCR/tesseract: allow new training sets for other languages and more tesseract features

 

Public bug reported:

--- implementation of other languages
A training set can be created for tesseract 2.04 as described here:
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
and implemented in sikuli-script.jar.

If it contains non english characters, Sikuli crashes (e.g. norwegian).

--- Tesseracts bold-feature is not used properly by Sikuli: only @ is
returned instead of @x for bold characters.

** Affects: sikuli
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/795391

Title:
  [request] OCR/tesseract: allow new training sets for other languages
  and more tesseract features

Status in Sikuli:
  New

Bug description:
  --- implementation of other languages
  A training set can be created for tesseract 2.04 as described here:
  http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
  and implemented in sikuli-script.jar.

  If it contains non english characters, Sikuli crashes (e.g.
  norwegian).

  --- Tesseracts bold-feature is not used properly by Sikuli: only @ is
  returned instead of @x for bold characters.

To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/795391/+subscriptions


Follow ups

References