Re: Question about special characters

2006-05-26 Thread Chris Hostetter
: Thks for the reply, ut I don't know how to do this change in : SOLatin1AccentFilter. : Can you give me some advice in this action? I've never really looked at the internals of ISOLatin1AccentFilter, but the basic idea is to subclass it with a new TokenFilter that maintains a one token buffer

Re: Question about special characters

2006-05-25 Thread Dan Wiggin
My own solution until I have another one better, I use FuzzyQuery for every term in the phrase. For example My work is the worst - My~ work~ is~ the~ worst What do you think about this uggly solution? I don't have anything more ideas. 2006/5/24, Dan Wiggin [EMAIL PROTECTED]: I need some

Re: Question about special characters

2006-05-25 Thread Chris Hostetter
I think I'm missing something here. the whole point of the ISOLatin1AccentFilter is to replace accented characters with their unaccented equivalent -- it sounds like that's working just fine, If you want teh words in teh term vector to contain the accents, why don't you stop using that filter?

Question about special characters

2006-05-24 Thread Dan Wiggin
I need some functionality and I don't know how to do. The problem is special characters like à, ä , ç or ñ latin characters in the text. Now I use iso latin filter, but the problem is when I want to obtain most term used. These term are stored without ` ´ ^ or another character attribute. For