It looks to me like you simply want to split the incoming query by the hyphen, so that it searches for exact codes like this ³CHQ PAID² ³INWARD TRAN² ³HDFC LTD².
If that¹s true, I¹d either just change the query at the client to do what you want, or look into something like the PatternTokenizer: https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PatternTo kenizerFactory Apologies if I¹m not understanding your use case. Thanks, Jim On 2/2/15, 3:56 AM, "Nivedita" <nivedita.pa...@tcs.com> wrote: >Hi, > >I want to tokenize query like "CHQ PAID-INWARD TRAN-HDFC LTD" in such a >way >that it should give me result documnet containing HDFC LTD and not HDFC >MF. > >How can I do this. >I Have already applied below Tokenizers > > <fieldType name="text_general" class="solr.TextField" >positionIncrementGap="100"> > <analyzer type="index"> > <tokenizer class="solr.WhitespaceTokenizerFactory"/> > > <filter class="solr.StopFilterFactory" ignoreCase="true" >words="stopwords.txt" /> > > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.TrimFilterFactory" /> > <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.StandardTokenizerFactory"/> > > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="1" >generateNumberParts="1" catenateWords="0" catenateNumbers="0" >catenateAll="0" splitOnCaseChange="1"/> > <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" >ignoreCase="true" expand="true"/> > <filter class="solr.EdgeNGramFilterFactory" minGramSize="3" >maxGramSize="25" side="front"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.StopFilterFactory" words="stopwords.txt" >ignoreCase="true"/> > <filter class="solr.TrimFilterFactory" /> > </analyzer> > </fieldType> > > >Please help. > > > >-- >View this message in context: >http://lucene.472066.n3.nabble.com/Solr-pattern-tokenizer-tp4183421.html >Sent from the Solr - User mailing list archive at Nabble.com.