Hi, all.
(3.1.6 on RedHat 7.2)
Now I kwow to perform prefix search I found a problem with accents for a
spanish site, that make all methods incompatible.
Say I search for the word 'neurop�ptido' (if you see a strange letter
between 'neurop' and 'ptido' you should interpret it as an 'e' with an
acute tilde above).
As the database is fuzzy-indexed with accents and endings, and the words in
the original documents are accentuated (tilded, have the written accent
on), htsearch internally looks up for the following words:
neuropeptido
neurop�ptido
neurop�ptido (uppercase E with acute tilde above)
neuropeptidos
neurop�ptidos
neurop�ptidos (uppercase E with acute tilde above)
and I get the expected results. I get the same results if I search for
'neuropeptido', 'neurop�ptidos' or 'neuropeptidos'.
But if I search (prefix search) for the word 'neurop�p*', the uppercase
accented versions ('neurop�ptido' and 'neurop�ptidos') are not looked up in
the index and I get less matches.
Finally, if I search for the word 'neuropep*', only nonaccentuated versions
are looked up in the index, and I get no matches at all because
non-accentuated versions of these words are not in any document.
Due to the nature of the site, most of the words searched by visitors (as
neurop�ptido) are not included in the spanish standard package. I supose it
is an addiotional difficulty.
Anyone can help?
Thanks in advance.
-------------------------------------------------------
This sf.net email is sponsored by: OSDN - Tired of that same old
cell phone? Get a new here for FREE!
https://www.inphonic.com/r.asp?r=sourceforge1&refcode1=vs3390
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html