c2c-oerpscenario team mailing list archive
-
c2c-oerpscenario team
-
Mailing list archive
-
Message #21726
Re: [Bug 758717] [NEW] attachments - indexed content of pdf not very good
On Tuesday 12 April 2011, you wrote:
> Public bug reported:
>
> IMHO it's not an OpenERP problem, but a pdf2txt issue.
> nevertheless OpenERP will be blamed if the content is not correctly
> indexed. may be the page should say "experimental" somewhere
>
I always tell people that all content indexers *try* to extract the text of
the files. It doesn't mean that they will always produce the same result as a
human eye would read at the same files...
I'm afraid there is little we can do there.
However, if you see at the code, it is modular enough so that anybody that can
provide better tools (say, a replacement for pdf2txt) can plug them in and
improve the indexing.
--
You received this bug notification because you are a member of C2C
OERPScenario, which is subscribed to the OpenERP Project Group.
https://bugs.launchpad.net/bugs/758717
Title:
attachments - indexed content of pdf not very good
Status in OpenERP Modules (addons):
New
Bug description:
IMHO it's not an OpenERP problem, but a pdf2txt issue.
nevertheless OpenERP will be blamed if the content is not correctly indexed.
may be the page should say "experimental" somewhere
References