Re: Solr pattern tokenizer

Jim . Musil Mon, 02 Feb 2015 08:35:32 -0800

It looks to me like you simply want to split the incoming query by the
hyphen, so that it searches for exact codes like this ³CHQ PAID² ³INWARD
TRAN² ³HDFC LTD².


If that¹s true, I¹d either just change the query at the client to do what
you want, or look into something like the PatternTokenizer:

https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PatternTo
kenizerFactory


Apologies if I¹m not understanding your use case.

Thanks,
Jim

On 2/2/15, 3:56 AM, "Nivedita" <nivedita.pa...@tcs.com> wrote:

>Hi,
>
>I want to tokenize query like "CHQ PAID-INWARD TRAN-HDFC LTD"  in such a
>way
>that it should give me result documnet containing HDFC LTD and not HDFC
>MF. 
>
>How can I do this.
>I Have already applied below Tokenizers
>
> <fieldType name="text_general" class="solr.TextField"
>positionIncrementGap="100">
>      <analyzer type="index">
>        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>               
>        <filter class="solr.StopFilterFactory" ignoreCase="true"
>words="stopwords.txt" />
>        
>        <filter class="solr.LowerCaseFilterFactory"/>
>        <filter class="solr.TrimFilterFactory" />
>        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
>      </analyzer>
>      <analyzer type="query">
>        <tokenizer class="solr.StandardTokenizerFactory"/>
>               
>               <filter class="solr.WordDelimiterFilterFactory" 
> generateWordParts="1"
>generateNumberParts="1" catenateWords="0" catenateNumbers="0"
>catenateAll="0" splitOnCaseChange="1"/>
>        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
>ignoreCase="true" expand="true"/>
>       <filter class="solr.EdgeNGramFilterFactory" minGramSize="3"
>maxGramSize="25" side="front"/>
>        <filter class="solr.LowerCaseFilterFactory"/>
>               <filter class="solr.StopFilterFactory" words="stopwords.txt"
>ignoreCase="true"/>
>        <filter class="solr.TrimFilterFactory" />
>      </analyzer>
>    </fieldType>
>
>
>Please help.
>
>
>
>--
>View this message in context:
>http://lucene.472066.n3.nabble.com/Solr-pattern-tokenizer-tp4183421.html
>Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr pattern tokenizer

Reply via email to