On Tue, 14 Dec 1999, Gabriele Bartolini wrote:
> How can I make it index them (cos, so far, I have made it skip them). In
> fact, still now, I cannot set right the locale for Italy ... But is there a
> generic setting for characters, like iso-8859-1 (italian's one belongs to
> this) which I can set?
I'm in the exact same situation as Gabriele but with the French locale (my
libc seems too old). What I've understood with htdig doc is if the locale
is set correctly the a search for the accentuated letters should return
them if they are in the database.
however what we need in France, and probably in Italy and Spain too, among
others, is more complicated: it is a way to make accentuated (?)
characters treated by htsearch just like the same letter without the
accent, and dealing correctly with the uppercase/lowercase.
the database should store characters exactly as they are in the html
documents, but the search function must return OK in these cases:
* the user typed the word without any accent
* the user typed the word with some or all accents (if all then
the user typed the word exactly as it is stored in the database)
* the user typed the word with bad accents (yes)
* the user typed the word mixing lower and upper, eventually
accentuated, letters.
* ... (Did I forget something ?)
I think Loic and Marcel know what I mean better than me since they know
the htdig code very well.
htdig is good, it is now the search engine on my University's web server,
but it still lacks these features to be VERY good.
bye,
Jerome
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.