Jacques Reynes wrote:

> But I have a second problem: In France, users could have two sorts of
> keybords: QWERTY or AZERTY.
> If they use the AZERTY one, there is no problem they can type and query
> texts with french accent
> If they use an QWERTY keybord, it is quite impossible to query a text
> including french accent. So often
> they ask a string without accent.
> Example: To ask for �tudiant they type etudiant and the answer is this word
> is not in the server.
> 
> The idea I had is to use synonyms:
> synonym �tudiant etudiant
> 
> but now I'm unable to find �tudiants, �tudiante, ...

        The thing I am trying  to do is to convert every iso-encoded 
character to it's HTML equivalent (since the file I will be searching
will be in HTML format anyway) so that �tudiant becomes étudiant.
There is a little C program called iso2html that will do that. 

        What I don't know is where to do the conversion: 

        Solution 1: Hack the source directly i.e insert a fonction 
        ParsedString.cc (is it the good place?) to do the job but
        I'll have to convert iso2html to C++ which is not a
        pic-nic ; )

        Solution 2: Do a CGI script that will do the conversion 
        to the query string and then call htsearch. I don't figure
        yet how to do that, that is : call a CGI inside another.
        I am currently using SIOD as a scripting language, but
        if anybody know how to do that in Perl or any other 
        language, that would be fine for me.

 Thanks Anyway...
 À la prochaine...                 
 
[EMAIL PROTECTED]
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to