← Back to team overview

c2c-oerpscenario team mailing list archive

Re: [Bug 758717] [NEW] attachments - indexed content of pdf not very good

 

On Tuesday 12 April 2011, you wrote:
> Public bug reported:
> 
> IMHO it's not an OpenERP problem, but a pdf2txt issue.
> nevertheless OpenERP will be blamed if the content is not correctly
> indexed. may be the page should say  "experimental" somewhere
> 

I always tell people that all content indexers *try* to extract the text of 
the files. It doesn't mean that they will always produce the same result as a 
human eye would read at the same files...

I'm afraid there is little we can do there.
However, if you see at the code, it is modular enough so that anybody that can 
provide better tools (say, a replacement for pdf2txt) can plug them in and 
improve the indexing.

-- 
You received this bug notification because you are a member of C2C
OERPScenario, which is subscribed to the OpenERP Project Group.
https://bugs.launchpad.net/bugs/758717

Title:
  attachments - indexed content of pdf not very good

Status in OpenERP Modules (addons):
  New

Bug description:
  IMHO it's not an OpenERP problem, but a pdf2txt issue.
  nevertheless OpenERP will be blamed if the content is not correctly indexed.
  may be the page should say  "experimental" somewhere



References