|
Dump parse_doc.pl, and use the external converter
doc2html.pl instead.
-- David Adams Computing Services Southampton University
----- Original Message -----
Sent: Wednesday, February 28, 2001 3:39
AM
Subject: [htdig] PDF and using PDF
keywords and/or titles
I hope someone can
help....I have been searching for this one.....
I have installed htdig
and got the PDF parser using 'parse_doc.pl' to work just fine. But I
would also like to add the acrobat "Title" to the word list and to use the
actual keywords in the acrobat document to be added to the word list.
I see that parse_doc.pl does get the title to be used as the link
when htdig delivers the list of found pdf files. But, the words in
the pdf title are never added to the word list.
Has anyone
added the title and especially the acrobat keywords to the word list?
What we are trying to do is to use the keywords in the acrobat
file to organize documents and to make canned searches for the users by
special keywords we will add to the pdfs. Right now the only
workaround we have is to add a page to the end of the acrobat file or in
the footer with our special key words.
Any ideas?
Thanks!
Michael
|