← Back to team overview

sikuli-driver team mailing list archive

[Bug 1365575] [NEW] [request] tesseract: want to use tessedit_char_whitelist

 

Public bug reported:

I'm trying to override the tessedit_char_whitelist Tesseract config
parameter. I want to tell Tesseract to only match on AlphaNumberic
characters (don't include punctuation etc). The full parameter
definition is:

tessedit_char_whitelist
abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

Problem is I don't know where to put this in the tessdata folder.
Someone said use the "bazaar" pattern matching file. In other words
define a file in the config directory called "bazaar_test" and put that
line in it.

First I tried it with a stand alone install of Tesseract and it actually
worked! Then I tried it in Sekuli's tessdata directory but didn't have
any luck.

Has anyone ever tried to do this before?

** Affects: sikuli
     Importance: Medium
     Assignee: RaiMan (raimund-hocke)
         Status: In Progress

** Changed in: sikuli
       Status: New => In Progress

** Changed in: sikuli
   Importance: Undecided => Medium

** Changed in: sikuli
     Assignee: (unassigned) => RaiMan (raimund-hocke)

** Changed in: sikuli
    Milestone: None => 1.2.0

-- 
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/1365575

Title:
  [request] tesseract: want to use tessedit_char_whitelist

Status in Sikuli:
  In Progress

Bug description:
  I'm trying to override the tessedit_char_whitelist Tesseract config
  parameter. I want to tell Tesseract to only match on AlphaNumberic
  characters (don't include punctuation etc). The full parameter
  definition is:

  tessedit_char_whitelist
  abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

  Problem is I don't know where to put this in the tessdata folder.
  Someone said use the "bazaar" pattern matching file. In other words
  define a file in the config directory called "bazaar_test" and put
  that line in it.

  First I tried it with a stand alone install of Tesseract and it
  actually worked! Then I tried it in Sekuli's tessdata directory but
  didn't have any luck.

  Has anyone ever tried to do this before?

To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/1365575/+subscriptions


Follow ups

References