: Thks for the reply, ut I don't know how to do this change in
: SOLatin1AccentFilter.
: Can you give me some advice in this action?
I've never really looked at the internals of ISOLatin1AccentFilter, but
the basic idea is to subclass it with a new TokenFilter that maintains a
one token buffer
My own solution until I have another one better, I use FuzzyQuery for every
term in the phrase.
For example My work is the worst - My~ work~ is~ the~ worst
What do you think about this uggly solution? I don't have anything more
ideas.
2006/5/24, Dan Wiggin [EMAIL PROTECTED]:
I need some
I think I'm missing something here. the whole point of the
ISOLatin1AccentFilter is to replace accented characters with their
unaccented equivalent -- it sounds like that's working just fine, If you
want teh words in teh term vector to contain the accents, why don't you
stop using that filter?
I need some functionality and I don't know how to do.
The problem is special characters like à, ä , ç or ñ latin characters in the
text.
Now I use iso latin filter, but the problem is when I want to obtain most
term used. These term are stored without ` ´ ^ or another character
attribute.
For