> > Hello Dan, Hello Martin,
> > > AFAIK if you don't want to use an external parser, there is only one > possibility left: I do want to use an external parser but I need not xpdf or I need to get xpdf installed minimally. > > Use the internal parser command as described in > > http://www.htdig.org/attrs.html#pdf_parser I'm using 3.2.0, not sure if it has that attribute > > Hence, for this to work you need to install Acrobat Reader > from Adobe. Dis- pite of what OS you use and the latest > security flaws for Acrobat Reader, which are hopefully fixed > for your OS also, the internal pdf_parser command > is very limited. > > htmldoc or doc2html just calls xpdf for the pdf2text Really? I have htmldoc on another server and not a wimper about xpdf anywher on the server except in the ports collection. I think htmldoc only does html to html,postscript,or pdf but not the other way around anyway. > conversion. An additional disadvantage using Acrobat Reader > is the fact, that you will index Postscript files and you > _have_ to adjust the max_doc_size in you htdig.conf above the > size of your biggest PDF/PostScript file. > > BUT: > Installing some X11 library stuff does not mean to run the > service :)). If you use some kind of Linux with rpm, try Actually I'm trying via rpm but it keeps failing. I'll try to do xpdf with --nodeps like you suggested. I'm on RedHat 8.0 if that helps. Thanks for the reply! Dan > installing only the libraries with --nodeps. Using Solaris > and depending on which Solaris (7/8/9) you use, you don't > need to satisfy all dependancies. Ask me again for a list of > packages :). > > Yours, > > Martin > > -- > > -------------------------------------------------------- > arago AG, Institut fuer komplexes Datenmanagement > Am Niddatal 3, 60488 Frankfurt/Main, [EMAIL PROTECTED] > Tel. 069/405680, Fax 069/40568111, http://www.arago.de > -------------------------------------------------------- > > > On Mon, Jun 23, 2003 at 02:06:03PM -0500, Dan Muey wrote: > > Hello list, > > > > I'd like to parse and index pdf files but when I try to > install xpdf > > it wants/needs to install a bunch of x windows stuff which I don't > > want to do but even if I try to it keeps failing. > > > > So what I'd like to ask is this: > > > > Has anyone successfully used something else beside xpdf, > like htmldoc > > for instance, to be able to index pdf files? > > > > If so any pointers/documentation would be very helpful. > > > > TIA > > > > Dan > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: INetU > > Attention Web Developers & Consultants: Become An INetU Hosting > > Partner. Refer Dedicated Servers. We Manage Them. You Get > 10% Monthly > > Commission! INetU Dedicated Managed Hosting > > http://www.inetu.net/partner/index.php > > _______________________________________________ > > htdig-general mailing list <[EMAIL PROTECTED]> > > To unsubscribe, send a message to > <[EMAIL PROTECTED]> with a subject > of unsubscribe > > FAQ: http://htdig.sourceforge.net/FAQ.html > ------------------------------------------------------- This SF.Net email is sponsored by: INetU Attention Web Developers & Consultants: Become An INetU Hosting Partner. Refer Dedicated Servers. We Manage Them. You Get 10% Monthly Commission! INetU Dedicated Managed Hosting http://www.inetu.net/partner/index.php _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

