Re: Why I get a hit on %, &, but not on !, @, #, $, ^, *

Jack Krupansky Mon, 13 Jul 2015 20:13:37 -0700

Oops... that's the "types" attribute.

-- Jack Krupansky


On Mon, Jul 13, 2015 at 11:11 PM, Jack Krupansky <jack.krupan...@gmail.com>
wrote:

> The word delimiter filter is remmoving special characters. You can add a
> file containing a list of the special characters that you wish to treat as
> alpha, using the "type" parameter.
>
> -- Jack Krupansky
>
> On Mon, Jul 13, 2015 at 6:43 PM, Steven White <swhite4...@gmail.com>
> wrote:
>
>> Hi Everyone,
>>
>> I think the subject line said it all.  Here is the schema I'm using:
>>
>> <fieldType name="my_text" class="solr.TextField"
>> positionIncrementGap="100"
>> autoGeneratePhraseQueries="true">
>>   <analyzer>
>> <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>> <filter class="solr.StopFilterFactory" ignoreCase="true"
>> words="lang/stopwords_en.txt"/>
>> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
>> generateNumberParts="1" catenateWords="1" catenateNumbers="1"
>> catenateAll="1" splitOnCaseChange="0" splitOnNumerics="1"
>> stemEnglishPossessive="1" preserveOriginal="1"/>
>> <filter class="solr.LowerCaseFilterFactory"/>
>> <filter class="solr.KeywordMarkerFilterFactory"
>> protected="protwords.txt"/>
>> <filter class="solr.PorterStemFilterFactory"/>
>> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
>>   </analyzer>
>> </fieldType>
>>
>> I'm guessing this is due to how solr.WhitespaceTokenizerFactory works and
>> those that it is not indexing are removed because they are considered
>> "white-spaces"?  If so, how can I include %, &, etc. into this
>> none-indexed
>> list?  I would rather see all these not indexed vs some are and some are
>> not causing confusion to my users.
>>
>> Thanks
>>
>> Steve
>>
>
>

Re: Why I get a hit on %, &, but not on !, @, #, $, ^, *

Reply via email to