On Tue, 19 Jun 2001 14:51:15 +0100 Marcus Valentine
<[EMAIL PROTECTED]> wrote:
> At 12:37 19/06/01 +0100, David Adams wrote:
> >If you are using version 3.0 of doc2html, then script doc2html.pl calls a
> >separate script, pdf2html.pl, to do the conversion. This script in turn
> >calls pdfinfo and pdftotext from the xpdf package.
> >
> >1) Try using both pdfinfo and pdftotext from the command line.
> > If they don't work then either you hav'nt installed them properly or
> >there is
> > something wrong with your .PDF file.
>
> They work
>
> >2) Call pdf2html.pl from the command line:
> >
> > <full path name>pdf2html.pl <pathname to .PDF file>
> >
> > to check that it is producing HTML output.
>
> It does
>
> >3) If it does not work then check your installation of pdftotext, and
> >your tailoring of pdf2html.pl.
> >
> >4) Else if it does work then try doc2html.pl from the command line:
> >
> > <full path name>doc2html.pl <pathname to .PDF file> application/pdf
> >Name
>
> It works from the command line - I had to but e:\cygwin\bin back into the
> path because something (what?) needs gzip. The doc2html log file contains
>
> ? [application/pdf] PDF (pdf2html) 15680
I don't know why gzip is needed, but if it is then you must
see that its directory is included in PATH before invoking
htdig.
>
> But it still doesn't work when called by htdig. Furthermore - the log file
> isn't created. Does this mean doc2html isn't being called properly?
It does. I presume htdig is indexing HTML files OK?
The first thing to check is your htdig configuration file.
Look at the external_parsers attribute in particular, it
should be something like:
external_parsers: application/pdf->text/html \
<full path name>doc2html.pl
The only other thing I can think of is the environmental
variable TMPDIR, set before htdig is invoked, which must
point to a directory where htdig can write files.
>
> >Hope that helps.
> >
> >--
> >David Adams
> >Computing Services
> >Southampton University
> >
> >
----------------------
David Adams
[EMAIL PROTECTED]
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html