Hi all

Please take a look at this strange behavior (connected with stemming I
suppose):


type:

<fieldtype name="customTextField" class="solr.TextField" indexed="true"
stored="false">
      <analyzer type="query">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="
stopwords.txt"/>
        <filter class="solr.EnglishPorterFilterFactory" protected="
protwords.txt"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="index">
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="
stopwords.txt"/>
        <filter class="solr.EnglishPorterFilterFactory" protected="
protwords.txt"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
</fieldtype>

field:

<field name="name"  type="customTextField" indexed="true"  stored="false"/>



I'm adding a document:

<add><doc><field name="id">999999</field><field
name="name">Apple</field></doc></add>

<commit/>


Queriyng "name:apple" - 0 results. Searching "name:Apple" - 1 result. But
"name:appl*" - 1 result


Adding next document:

<add><doc><field name="id">88888</field><field
name="name">Somenamele</field></doc></add>

<commit/>


Searching for "name:somenamele" - 1 result, for "name:Somenamele" - 1 result


What is the problem with "Apple" ? Maybe StandardTokenizer understands it as
trademark :) ?


Thank you in advence


-- 
Best regards,
Traut

Reply via email to