Aro, thanks for your interest and response.

I'm using the "stock" definition in the supplied config.xml, as follows:

<fieldType name="text_en" class="solr.TextField"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="true"/>
<filter class="solr.LowerCaseFilterFactory"/><filter
class="solr.EnglishPossessiveFilterFactory"/>
<filter class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt"/>
<filter class="solr.KStemFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="true"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.EnglishPossessiveFilterFactory"/>
<filter class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt"/>
<filter class="solr.KStemFilterFactory"/></analyzer>
</fieldType>

When viewing the debug output of the results, I have:

<str name=...(The document's unique index field is here)...>

2.097683 = (MATCH) sum of:
  1.072057 = (MATCH) weight(text:schedul in 1595), product of:
    0.75786966 = queryWeight(text:schedul), product of:
      4.355735 = idf(docFreq=59, maxDocs=1720)
      0.17399353 = queryNorm
    1.4145664 = (MATCH) fieldWeight(text:schedul in 1595), product of:
      1.7320508 = tf(termFreq(text:schedul)=3)
      4.355735 = idf(docFreq=59, maxDocs=1720)
      0.1875 = fieldNorm(field=text, doc=1595)
  1.0256261 = (MATCH) weight(text:pickup in 1595), product of:
    0.652406 = queryWeight(text:pickup), product of:
      3.7495992 = idf(docFreq=109, maxDocs=1720)
      0.17399353 = queryNorm
    1.5720673 = (MATCH) fieldWeight(text:pickup in 1595), product of:
      2.236068 = tf(termFreq(text:pickup)=5)
      3.7495992 = idf(docFreq=109, maxDocs=1720)
      0.1875 = fieldNorm(field=text, doc=1595)
</str>

The stem "schedul" is an indication that stemming has occurred on the
query. However, you gave me an idea; I HAVE changed what I thought were
small things to the config.xml without reindexing the content corpus.
It's possible I shot myself in the proverbial foot if I changed to
"text_en" without reindexing. I'll do that shortly (meaning tomorrow
morning) and will report back with my results.

Appreciate the interest...
Tim

Reply via email to