Hi, all.
(3.1.6 on RedHat 7.2)

Now I kwow to perform prefix search I found a problem with accents for a 
spanish site, that make all methods incompatible.

Say I search for the word 'neurop�ptido' (if you see a strange letter 
between 'neurop' and 'ptido' you should interpret it as an 'e' with an 
acute tilde above).

As the database is fuzzy-indexed with accents and endings, and the words in 
the original documents are accentuated (tilded, have the written accent 
on), htsearch internally looks up for the following words:

neuropeptido
neurop�ptido
neurop�ptido (uppercase E with acute tilde above)
neuropeptidos
neurop�ptidos
neurop�ptidos (uppercase E with acute tilde above)

and I get the expected results. I get the same results if I search for 
'neuropeptido', 'neurop�ptidos' or 'neuropeptidos'.

But if I search (prefix search) for the word 'neurop�p*',  the uppercase 
accented versions ('neurop�ptido' and 'neurop�ptidos') are not looked up in 
the index and I get less matches.

Finally, if I search for the word 'neuropep*', only nonaccentuated versions 
are looked up in the index, and I get no matches at all because 
non-accentuated versions of these words are not in any document.

Due to the nature of the site, most of the words searched by visitors (as 
neurop�ptido) are not included in the spanish standard package. I supose it 
is an addiotional difficulty.

Anyone can help?
Thanks in advance.




-------------------------------------------------------
This sf.net email is sponsored by: OSDN - Tired of that same old
cell phone?  Get a new here for FREE!
https://www.inphonic.com/r.asp?r=sourceforge1&refcode1=vs3390
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to