Hi,

it seems there is something wrong with pdf indexing on my machine:

pdf documents get indexed by htdig (at least it lists them in -v mode with 
filesize and without protest) but never show up in the search results.

Are there any settings I could have forgot or any required parameters for 
htsearch?

:-) Dennis

I'm using:

htdig 3.1.5 on Suse Linux 7.3, Arcobat Reader 4

with this config:

database_dir:           /opt/www/htdig/db
start_url:                      http://akropolis/inbas/htdocs/index.html
limit_urls_to:          http://akropolis/inbas/
exclude_urls:           /cgi-bin/ .cgi template=  inka_original
bad_extensions:         .php .css .inc .wav .gz .z .sit .au .zip .tar .hqx .exe 
.com .gif \
                        .jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi
maintainer:             [EMAIL PROTECTED]
max_head_length:        10000
max_doc_size:           10000000
excerpt_length:         200
no_excerpt_show_top:    true
search_algorith:        exact:1 endings:0.5 substring: 0.5
lang_dir:               ${common_dir}/german
bad_word_list:          ${lang_dir}/bad_words
endings_affix_file:     ${lang_dir}/german.aff
endings_dictionary:     ${lang_dir}/german.0
endings_root2word_db:   ${lang_dir}/root2word.db
endings_word2root_db:   ${lang_dir}/word2root.db
locale: de_DE
keyword_meta_tag_names: keywords description
pdf_parser: /usr/local/Acrobat4/bin/acroread -toPostScript
template_map:   Long long ${common_dir}/long.html \
                Short short ${common_dir}/short.html \
                php php /httpd/htdocs/inbas/include/htdigout.html
 template_name: long
-----  Marlis & Dennis Merbach  -----
           Diplombiologen
        http://www.biopry.de
     http://www.webkonzepte.de
-------------------------------------


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to