Hi, Thank you very much for your quick responses.
Jack Krupansky, The main use case is searching in file names. For example, lucene.txt, lucene_new.txt, lucene_1_new.txt. If I use 'lucene', I need to get all 3 files. with 'new' I need to get last two files. Please note that Standard analyzer/tokenizer of lucene 3.6 is not giving us the results with tokenization of "." and "_". Are you referring to later versions than 3.6 ? Ahmet, 1. Not sure if LetterTokenizer helps with the above example of having numbers and letters in file names. 2. WordDelimeterFilter does not seem to be lucene 3.6 3. MappingCharFilter is what I am already using overriding initReader method in my CustomAnalyzer (Source copied from StandardAnalyzer (final class)). Is this a good way to make use of final class StandardAnalyzer with some custom changes ? Or composition is better ? Thank you again, Best Regards On Tue, Apr 12, 2016 at 8:45 PM, Jack Krupansky <jack.krupan...@gmail.com> wrote: > The standard analyzer/tokenizer should do a decent job of splitting on dot, > hyphen, and underscore, in addition to whitespace and other punctuation. > > Can you post some specific test cases you are concerned with? (You should > always run some test cases.) > > -- Jack Krupansky > > On Tue, Apr 12, 2016 at 10:35 AM, Ahmet Arslan <iori...@yahoo.com.invalid> > wrote: > > > Hi Chamarty, > > > > Well, there are a lot of options here. > > > > 1) Use LetterTokenizer > > 2) Use WordDelimeterFilter combined with WhiteSpaceTokenizer > > 3) Use MappingCharFilter to replace those characters with spaces > > . > > . > > . > > > > Ahmet > > > > > > On Tuesday, April 12, 2016 3:58 PM, PrasannaKumar Chamarty < > > tech.kumar...@gmail.com> wrote: > > > > > > > > Hi, > > > > What is the best way (in terms of maintenance required with new lucene > > releases) to allow splitting of words on "." and "_" for indexing ? Thank > > you. > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > > >