Jacques Reynes wrote:
> But I have a second problem: In France, users could have two sorts of
> keybords: QWERTY or AZERTY.
> If they use the AZERTY one, there is no problem they can type and query
> texts with french accent
> If they use an QWERTY keybord, it is quite impossible to query a text
> including french accent. So often
> they ask a string without accent.
> Example: To ask for �tudiant they type etudiant and the answer is this word
> is not in the server.
>
> The idea I had is to use synonyms:
> synonym �tudiant etudiant
>
> but now I'm unable to find �tudiants, �tudiante, ...
The thing I am trying to do is to convert every iso-encoded
character to it's HTML equivalent (since the file I will be searching
will be in HTML format anyway) so that �tudiant becomes étudiant.
There is a little C program called iso2html that will do that.
What I don't know is where to do the conversion:
Solution 1: Hack the source directly i.e insert a fonction
ParsedString.cc (is it the good place?) to do the job but
I'll have to convert iso2html to C++ which is not a
pic-nic ; )
Solution 2: Do a CGI script that will do the conversion
to the query string and then call htsearch. I don't figure
yet how to do that, that is : call a CGI inside another.
I am currently using SIOD as a scripting language, but
if anybody know how to do that in Perl or any other
language, that would be fine for me.
Thanks Anyway...
À la prochaine...
[EMAIL PROTECTED]
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.