I think you should write your own Analyzer and use:
* StandardTokenizer for tokenization and ACRONYM detection.
* StopFilter for stopwrods handling.
The Analyzer you write should override tokenStream() and do something like:
TokenStream
Hi,
If i want Lucene to preserve dots of acronyms(example: U.K,U.S.A. etc),
which analyzer do i need to use and how? I also want to input a set of stop
words to Lucene while doing this.
--
View this message in context:
http://www.nabble.com/Preserving-dots-of-an-acronym-while-indexing-in-Lucen