Hello people,
I am setting up htdig for a project here and I am trying
to index properly german umlauts ...
what I've done so far: according to http://www.htdig.org/FAQ.html#q4.10
I made the following entries on htdig.conf
# ---------------- debut de germanization ----------------------
locale: de_DE
lang_dir: /usr/local/htdig/german
#bad_word_list: ${lang_dir}/bad_words
endings_affix_file: ${lang_dir}/german.aff
endings_dictionary: ${lang_dir}/zusammen.txt.sq
#endings_root2word_db: ${lang_dir}/root2word.db
#endings_word2root_db: ${lang_dir}/word2root.db
# ---------------- end germanization ---------------------------
the dictionary and affix word I have taken from
http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html
namely I tried Heinz Knutzen's one and also Martin Schulz's (first and
third from the list there). I just copied the dictionary and .aff files
in ${lang_dir} and then run the "rundig" script and tried the search
but no results .....
the point is that I don't know if I should manipulate those dictionary
files (use the hashed version?) or use them as they are. I even tried
"locale: de_DE.ISO_8859-1" and "locale: de" in htdig.conf ....
final notes: running on FreeBSD 4.2 with htdig3.2.0b3, maybe I should
use
the htdig stable release ... ?
TIA
antonello
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html