cuneiform team mailing list archive
-
cuneiform team
-
Mailing list archive
-
Message #00331
[Bug 395351] Re: Add --nopictures, --tables=n to cli
I saw the earlier discussion about this. I believe the missing code is
for table *output* (eg as a spreadsheet). Enabling tables definitely
allows the OCR code to look inside outlined areas, which it otherwise
will not. In fact, if you OCR a page with a border it will not
recognize anything without these new options.
Also, assuming bzr send is reasonable, the two patches are separate, so
you could apply --nopictures.
--
Add --nopictures, --tables=n to cli
https://bugs.launchpad.net/bugs/395351
You received this bug notification because you are a member of Cuneiform
Linux, which is the registrant for Cuneiform for Linux.
Status in Linux port of Cuneiform: New
Bug description:
I found that cuneiform would not OCR anything inside an outline (any kind of box) and would consider it either a picture or a table. With '--nopicture' it seems to just ignore those same areas. If you also add --tables it will successfully OCR inside the table.
I've never used bzr before this so I've probably botched the patch somehow, but it is fairly trivial.
References