I'm trying to index pdf files. I'm using htdig 3.1.4 on Mandrake 6.1.
I first tried Acroread. Acroread 4.0 fails with a "segmentation fault"
problem. Acroread 3.0 indexes, but the text in the search results is binary
gibberish.
I then decided to try xpdf. I got the xpdf binaries downloaded, but now I'm
stuck on accessing parse_doc.pl from your
http://www.htdig.org/files/contrib/parsers/ directory because is is stored
as parse_doc.pl.gz.
"gunzip parse_doc.pl.gz" gives this error:
gunzip: parse_doc.pl.gz: not in gzip format
So how do I access parse_doc.pl.gz?
TIA.
Wayne Larmon
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.