← Back to team overview

cuneiform team mailing list archive

[Bug 395351] Re: Add --nopictures, --tables=n to cli

 

I saw the earlier discussion about this.  I believe the missing code is
for table *output* (eg as a spreadsheet).  Enabling tables definitely
allows the OCR code to look inside outlined areas, which it otherwise
will not.  In fact, if you OCR a page with a border it will not
recognize anything without these new options.

Also, assuming bzr send is reasonable, the two patches are separate, so
you could apply --nopictures.

-- 
Add --nopictures, --tables=n to cli
https://bugs.launchpad.net/bugs/395351
You received this bug notification because you are a member of Cuneiform
Linux, which is the registrant for Cuneiform for Linux.

Status in Linux port of Cuneiform: New

Bug description:
I found that cuneiform would not OCR anything inside an outline (any kind of box) and would consider it either a picture or a table.  With '--nopicture' it seems to just ignore those same areas.  If you also add --tables it will successfully OCR inside the table.

I've never used bzr before this so I've probably botched the patch somehow, but it is fairly trivial.



References