Yonik Seeley wrote:
On Thu, Mar 12, 2009 at 1:36 PM, Jon Drukman <jdruk...@gmail.com> wrote:
is it possible to make solr think that "omeara" and "o'meara" are the same
thing?

WordDelimiter would handle it if the document had "o'meara" (but you
may or may not want the other stuff that comes with
WordDelimiterFilter).
You could also use a PatternReplaceFilter to normalize tokens like this.

the document does have o'meara in it. i tried creating a new field type based on the wiki information.

<fieldType name="text_user" class="solr.TextField" positionIncrementGap="100">
  <fieldtype name="subword" class="solr.TextField">
      <analyzer type="query">
          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
          <filter class="solr.WordDelimiterFilterFactory"
                generateWordParts="1"
                generateNumberParts="1"
                catenateWords="0"
                catenateNumbers="0"
                catenateAll="0"
                />
          <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="index">
          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
          <filter class="solr.WordDelimiterFilterFactory"
                generateWordParts="1"
                generateNumberParts="1"
                catenateWords="1"
                catenateNumbers="1"
                catenateAll="0"
                />
          <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldtype>
</fieldType>


i reindexed everything but now any search on that field returns zero results. what did i do wrong?

-jsd-

Reply via email to