← Back to team overview

sikuli-driver team mailing list archive

[Question #254041]: tessedit_char_whitelist

 

New question #254041 on Sikuli:
https://answers.launchpad.net/sikuli/+question/254041

I'm trying to override the tessedit_char_whitelist Tesseract config parameter. I want to tell Tesseract to only match on AlphaNumberic characters (don't include punctuation etc). The full parameter definition is:

tessedit_char_whitelist abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

Problem is I don't know where to put this in the tessdata folder. Someone said use the "bazaar" pattern matching file. In other words define a file in the config directory called "bazaar_test" and put that line in it. 

First I tried it with a stand alone install of Tesseract and it actually worked! Then I tried it in Sekuli's tessdata directory but didn't have any luck.

Has anyone ever tried to do this before?

-- 
You received this question notification because you are a member of
Sikuli Drivers, which is an answer contact for Sikuli.