← Back to team overview

sikuli-driver team mailing list archive

[Bug 1365575] Re: [request] tesseract: want to use tessedit_char_whitelist

 

a solution see related question

-- 
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/1365575

Title:
  [request] tesseract: want to use tessedit_char_whitelist

Status in Sikuli:
  In Progress

Bug description:
  I'm trying to override the tessedit_char_whitelist Tesseract config
  parameter. I want to tell Tesseract to only match on AlphaNumberic
  characters (don't include punctuation etc). The full parameter
  definition is:

  tessedit_char_whitelist
  abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

  Problem is I don't know where to put this in the tessdata folder.
  Someone said use the "bazaar" pattern matching file. In other words
  define a file in the config directory called "bazaar_test" and put
  that line in it.

  First I tried it with a stand alone install of Tesseract and it
  actually worked! Then I tried it in Sekuli's tessdata directory but
  didn't have any luck.

  Has anyone ever tried to do this before?

To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/1365575/+subscriptions


References