Gilles R. Detillieux wrote:
>
> According to gkalter:
> > Hope this mailing-list is the right one..;-)
> >
> > Today I got htdig to work pretty well on a site containing many
> > PDF-Files.
> >
> > • Cobalt Raq2 micorserver (mips) with RedHat based Linux
> >
> > After updating the C++ Compiler (see mailing list) I got rid of the
> > segmenatition
> > error messages and htdig worked well.
> >
> > Cryptic outputs of the search form were solved by adding a ".cgi"
> > extension to htsearch
> > in the local cgi-bin folder. Solution also found in the list - thanks to
> > all those helpful people!
>
> I think the FAQ also has some pointers on getting the CGI to work.
>
> > Because I wanted to get direct links to single PDF Pages out of the
> > found excerpts I got
> > the pdftodig.py script for external parsing of PDF-Files. (Do I have to
> > mention that python
> > IS NOT installed on Cobalt Raqs?) O.K. this problem could also be
> > solved.
>
> It would also be a fairly trivial change to the perl scripts conv_doc.pl
> or doc2html.pl to make it replace form feeds in pdftotext output with
> the correct HTML <a name="..."> tags for the anchors. You'd then be
> using an external converter, rather than an external parser, and possibly
> avoiding parser-related problems.
>
> > Now everything works pretty good with one little exception.
> >
> > Using a complete search string e.g. "Sensor" lists all matching
> > documents and the text contains
> > the search word (bold typeface) with a link to the specific single Page
> > of the found PDF file.
> > (Great!)
I think I may be missing something here, perhaps somebody can explain
for me. Am I right in thinking that the whole and only point of this is
to produce, in the lists produced by htsearch, excerpts from the first
page of .PDF documents containing a search word?
Or does one really get a link which when followed brings up the .PDF
document open at the relevant page? If so, that would be quite something,
especially if it worked for a range of browsers. What would be the correct
HTML <a name="..."> tags for the anchors?
--
David Adams
<[EMAIL PROTECTED]>
Computing Services
University of Southampton
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>