According to Geoff Hutchison:
>
>
> On Mon, 1 Feb 1999, Gilles Detillieux wrote:
>
> > I don't know where on the bug-fix/new-feature spectrum this fits, but I
> > patched the PDF processing code to handle some PDFs we have here. They
> > were generated by Acrobat PDFwriter from some Corel Draw files, and the
> > PostScript files that acroread made from them did some strange stuff with
> > character spacing - essentially it would commonly crank up the character
>
> I would generally categorize this as a bug, right? I'm assuming the patch
> doesn't affect other PDFs?
The only other PDF on our web site is a table from a WordPerfect document.
It was indexed fine before, and still is. However, as I have so few
PDFs, I was hoping other users could test out this patch, to make sure
it doesn't cause problems with other PDFs. As long as the units used
for the Tc command in PDFs is consistent, it should not pose a problem,
but I'd like some independent confirmation (i.e. testing) of this.
Maybe if the patch is included in the next snapshot, we can post a
message to the whole htdig mailing list, asking for testers for this
and a lot of other new changes/fixes, before going to final release.
By the way, another problem with these weird PDFs from CorelDraw files
is that occasionally they'd insert a TD positioning command right in the
middle of a word, leading htdig to break it into two words. That is not
nearly as easy to fix, and as it doesn't do this very frequently, I'm not
going to bother with trying to fix it.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.