Hello,

I get and install Htdig on a Web server with french document on Compaq
Proliant 200 computer, with Linux Red Hat 5.0, kernel 2.0.33 and Apache
1.2.5

It work but I have a problem with accent. I can retrieve some word like
"acad�mie" in html pages but not in all pages where there is the word
"acad�mie". And if I search "acad", I can see the pages where there is
the word "acad�mies" because in the db.wordlist file this word is
present.

I suppose that when Htdig see "acad�mie", he detect 2 word "acad" and
"mie",  because character '�' or é but he detect also One word
"acad�mie" in other page !?

I see in the mailing list, that other people have the same problem. I
change my htdig file configuration (see follow) and add some directives
preconised by different people, like
locale fr, or locale fr_FR.ISO_8859-1, valid_punctuation

But he doesn'y=t work correctly.

HTDIG.CONF
bad_word_list:          ${common_dir}/mots_exclus
locale: fr_FR.ISO_8859-1
iso_8601: true
valid_punctuation: "()!?,
search_algorithm:       exact:1 synonyms:0.5 endings:0.1
# Affix rules file
endings_affix_file ${common_dir}/francais.aff
# Dictionary file
endings_dictionary ${common_dir}/francais.0

An idea ?

Thanks.
Andre.

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to