← Back to team overview

cuneiform team mailing list archive

[Bug 623438] Re: Font size not correct in merged sandvich PDF

 

I have filed a bug there at exactimage (which is the package containing
hocr2pdf) - see comment #25

With my last comment I wanted to point out that for creating a proper
final sandvich PDF more information might be necessary - maybe bounding
boxes for words also - and maybe you could identify Descent and Ascent
information for better font choosing.

Anyway, I am still waiting for any response from exactimage developer(s)
to tell their view of the things...

I currently cannot do much more than filing bugs and testing. And I even tried to reach them by phone.
It it has been determined what needs to be done, maybe there is an option to pay for implementation/fix, but currently I don't have an idea if and how the problem can be solved (and approximately what amount of work).

-- 
Font size not correct in merged sandvich PDF
https://bugs.launchpad.net/bugs/623438
You received this bug notification because you are a member of Cuneiform
Linux, which is the registrant for Cuneiform for Linux.

Status in Linux port of Cuneiform: Invalid

Bug description:
After processing with Cuneiform for Linux 1.0.0 and hOCR to PDF converter, version 0.7.4 (should be the most current version) I get a sandvich pdf that looks nice until I select text.

See the sample 5AADFEE1-0000.* files in the attachment and the result.pdf.
The effect is shown in screen087.png

For another file (Test10pages.pdf) the effect is either worse - basically I cannot really select any more text to copy because I only can guess where to move with the mouse.

It looks like that the font size in the HTML is somehow not correct - I am not an expert, but this link might help you:
http://www.emdpi.com/fontsize.html





References