>I'm in the exact same situation as Gabriele but with the French locale (my
>libc seems too old). What I've understood with htdig doc is if the locale
>is set correctly the a search for the accentuated letters should return
>them if they are in the database.
That's one point ... I looked at my /usr/lib dir but didn't find any locale
file.
By performing a 'find / -name locale -print' command, that's what I am given:
...
/usr/bin/locale
/usr/share/locale (directory)
/usr/share/groff/tmac/mm/locale
...
I also had a look at the man page for the setlocale function, and it says
that I can also use the "ISO-8859-1" locale.
That's the extract of the man page:
Linux supports the portable locales "C" and "POSIX" and
also the European Latin-1 "ISO-8859-1" , and Russian
"KOI-8" locales.
I tried to specify "locale: ISO-8859-1" in the configuration file, but by
running htdig in the verbose mode I obtain the message I am hating the most
at these days: "Unknown locale !!!" ...
How does locale work? That's my question ... do I need to install a
specific locale for Italy? If yes ... where can I find it?
>however what we need in France, and probably in Italy and Spain too, among
>others, is more complicated: it is a way to make accentuated (?)
>characters treated by htsearch just like the same letter without the
>accent, and dealing correctly with the uppercase/lowercase.
>the database should store characters exactly as they are in the html
>documents, but the search function must return OK in these cases:
>
> * the user typed the word without any accent
> * the user typed the word with some or all accents (if all then
>the user typed the word exactly as it is stored in the database)
> * the user typed the word with bad accents (yes)
> * the user typed the word mixing lower and upper, eventually
>accentuated, letters.
> * ... (Did I forget something ?)
And that's our point. For example in Italy we have the word "caff�". But
many people can write it also "caffe'". And it could be searched through
this query:
- caff�
- caff�
- caffe'
- caffe (not very correct ...)
Unfortunately, now, if I search for "perch�" (the site is full of this
word, that means "because" and "why" at the same time, and it's not in the
bad_words) I obtain a no match result. So far, I have always skipped this
problem ... but now, as well as the search engine is used every day more, I
want to resolve it, if possible ...
>I think Loic and Marcel know what I mean better than me since they know
>the htdig code very well.
I think who better than them may know?
Ciao and thanks
-Gabriele
-------------------------------------------------
Gabriele Bartolini
Computer Programmer (are U sure?)
U.O. Rete Civica - Comune di Prato
Prato - Italia - Europa
e-mail: [EMAIL PROTECTED]
http://www.po-net.prato.it
"Life teaches you never stop learning ..."
-------------------------------------------------
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.