Hello Wayne,
I was able to use gunzip on the file at
http://www.htdig.org/files/contrib/parsers/parse_doc.pl.gz. You might also try
ftp://ftp.htdig.org/pub/htdigcontrib/parsers. The uncompressed version of the
file appears in the htdig-3.1.3 directory thats created when you untar the file
htdig-3.1.3.tar.gz under the contrib directory.
I hope this helps.
Mark Gannon
On Sat, 11 Dec 1999,
you wrote: > I'm trying to index pdf files. I'm using htdig 3.1.4 on Mandrake
6.1. >
> I first tried Acroread. Acroread 4.0 fails with a "segmentation fault"
> problem. Acroread 3.0 indexes, but the text in the search results is binary
> gibberish.
>
> I then decided to try xpdf. I got the xpdf binaries downloaded, but now I'm
> stuck on accessing parse_doc.pl from your
> http://www.htdig.org/files/contrib/parsers/ directory because is is stored
> as parse_doc.pl.gz.
>
> "gunzip parse_doc.pl.gz" gives this error:
>
> gunzip: parse_doc.pl.gz: not in gzip format
>
> So how do I access parse_doc.pl.gz?
>
> TIA.
>
> Wayne Larmon
>
>
>
>
> ------------------------------------
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.