On Tue, 19 Jun 2001 14:51:15 +0100 Marcus Valentine 
<[EMAIL PROTECTED]> wrote:

> At 12:37 19/06/01 +0100, David Adams wrote:
> >If you are using version 3.0 of doc2html, then script doc2html.pl calls a
> >separate script, pdf2html.pl, to do the conversion.  This script in turn
> >calls pdfinfo and pdftotext from the xpdf package.
> >
> >1)    Try using both pdfinfo and pdftotext from the command line.
> >        If they don't work then either you hav'nt installed them properly or
> >there is
> >        something wrong with your .PDF file.
> 
> They work
> 
> >2)    Call pdf2html.pl from the command line:
> >
> >        <full path name>pdf2html.pl  <pathname to .PDF file>
> >
> >    to check that it is producing HTML output.
> 
> It does
> 
> >3)    If it does not work then check your installation of pdftotext, and
> >your tailoring of pdf2html.pl.
> >
> >4)    Else if it does work then try doc2html.pl from the command line:
> >
> >        <full path name>doc2html.pl <pathname to .PDF file> application/pdf
> >Name
> 
> It works from the command line - I had to but e:\cygwin\bin back into the
> path because something (what?) needs gzip.  The doc2html log file contains
> 
> ? [application/pdf] PDF (pdf2html) 15680

I don't know why gzip is needed, but if it is then you must 
see that its directory is included in PATH before invoking 
htdig.

> 
> But it still doesn't work when called by htdig. Furthermore - the log file
> isn't created.  Does this mean doc2html isn't being called properly?

It does.  I presume htdig is indexing HTML files OK?

The first thing to check is your htdig configuration file.
Look at the external_parsers attribute in particular, it 
should be something like:

external_parsers: application/pdf->text/html \
  <full path name>doc2html.pl


The only other thing I can think of is the environmental 
variable TMPDIR, set before htdig is invoked, which must 
point to a directory where htdig can write files.

> 
> >Hope that helps.
> >
> >--
> >David Adams
> >Computing Services
> >Southampton University
> >
> >

----------------------
David Adams
[EMAIL PROTECTED]


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to