Hi, > Did you take a look at IsoLatin1AccentFilter ? It nearly do the same i need, but not perfectly. public final Token next() throws java.io.IOException { final Token t = input.next(); if (t == null) return null; return new Token(removeAccents(t.termText()), t.startOffset(), t.endOffset(), t.type()); } Here also a new Token is created. The question i have, why the endoffset is not corrected for the new created token? Some times the new token is bigger than before. Complete code link: http://developer.spikesource.com/spikewatch.logs/fedora-3-i386/2221/lucene/reports/clover/org/apache/lucene/analysis/ISOLatin1AccentFilter.html
--------------------------------- Keine Lust auf Tippen? Rufen Sie Ihre Freunde einfach an. Yahoo! Messenger. Jetzt installieren .