Re: stemming (maybe?) question

Jon Drukman Mon, 16 Mar 2009 17:34:58 -0700

Yonik Seeley wrote:

On Thu, Mar 12, 2009 at 1:36 PM, Jon Drukman <jdruk...@gmail.com> wrote:

is it possible to make solr think that "omeara" and "o'meara" are the same
thing?


WordDelimiter would handle it if the document had "o'meara" (but you
may or may not want the other stuff that comes with
WordDelimiterFilter).
You could also use a PatternReplaceFilter to normalize tokens like this.

the document does have o'meara in it. i tried creating a new field typebased on the wiki information.

<fieldType name="text_user" class="solr.TextField"positionIncrementGap="100">

  <fieldtype name="subword" class="solr.TextField">
      <analyzer type="query">
          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
          <filter class="solr.WordDelimiterFilterFactory"
                generateWordParts="1"
                generateNumberParts="1"
                catenateWords="0"
                catenateNumbers="0"
                catenateAll="0"
                />
          <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="index">
          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
          <filter class="solr.WordDelimiterFilterFactory"
                generateWordParts="1"
                generateNumberParts="1"
                catenateWords="1"
                catenateNumbers="1"
                catenateAll="0"
                />
          <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldtype>
</fieldType>

i reindexed everything but now any search on that field returns zeroresults. what did i do wrong?


-jsd-

Re: stemming (maybe?) question

Reply via email to