I'm trying to index pdf files.  I'm using htdig 3.1.4 on Mandrake 6.1.

I first tried Acroread.  Acroread 4.0 fails with a "segmentation fault"
problem.  Acroread 3.0 indexes, but the text in the search results is binary
gibberish.

I then decided to try xpdf.  I got the xpdf binaries downloaded, but now I'm
stuck on accessing parse_doc.pl from your
http://www.htdig.org/files/contrib/parsers/ directory because is is stored
as parse_doc.pl.gz.

"gunzip parse_doc.pl.gz" gives this error:

gunzip: parse_doc.pl.gz: not in gzip format

So how do I access parse_doc.pl.gz?

TIA.

Wayne Larmon




------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to