sikuli-driver team mailing list archive
-
sikuli-driver team
-
Mailing list archive
-
Message #53362
[Bug 1365575] Re: [request] tesseract: want to use tessedit_char_whitelist
** Changed in: sikuli
Status: In Progress => Fix Released
--
You received this bug notification because you are a member of Sikuli
Drivers, which is subscribed to Sikuli.
https://bugs.launchpad.net/bugs/1365575
Title:
[request] tesseract: want to use tessedit_char_whitelist
Status in Sikuli:
Fix Released
Bug description:
I'm trying to override the tessedit_char_whitelist Tesseract config
parameter. I want to tell Tesseract to only match on AlphaNumberic
characters (don't include punctuation etc). The full parameter
definition is:
tessedit_char_whitelist
abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789
Problem is I don't know where to put this in the tessdata folder.
Someone said use the "bazaar" pattern matching file. In other words
define a file in the config directory called "bazaar_test" and put
that line in it.
First I tried it with a stand alone install of Tesseract and it
actually worked! Then I tried it in Sekuli's tessdata directory but
didn't have any luck.
Has anyone ever tried to do this before?
To manage notifications about this bug go to:
https://bugs.launchpad.net/sikuli/+bug/1365575/+subscriptions
References