Re: stemming (maybe?) question

Yonik Seeley Mon, 16 Mar 2009 18:16:07 -0700

Not sure... I just took the stock solr example, and it worked fine.

I inserted "o'meara" into example/exampledocs/solr.xml
 <field name="features">Advanced o'meara Full-Text Search
Capabilities using Lucene</field>


the indexed everything:  ./post.sh *.xml

Then queried in various ways:
q=o'meara
q=omeara
q=o%20meara

All of the queries found the solr doc.

-Yonik
http://www.lucidimagination.com


On Mon, Mar 16, 2009 at 8:34 PM, Jon Drukman <jdruk...@gmail.com> wrote:
> Yonik Seeley wrote:
>>
>> On Thu, Mar 12, 2009 at 1:36 PM, Jon Drukman <jdruk...@gmail.com> wrote:
>>>
>>> is it possible to make solr think that "omeara" and "o'meara" are the
>>> same
>>> thing?
>>
>> WordDelimiter would handle it if the document had "o'meara" (but you
>> may or may not want the other stuff that comes with
>> WordDelimiterFilter).
>> You could also use a PatternReplaceFilter to normalize tokens like this.
>
> the document does have o'meara in it.  i tried creating a new field type
> based on the wiki information.
>
> <fieldType name="text_user" class="solr.TextField"
> positionIncrementGap="100">
>  <fieldtype name="subword" class="solr.TextField">
>      <analyzer type="query">
>          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>          <filter class="solr.WordDelimiterFilterFactory"
>                generateWordParts="1"
>                generateNumberParts="1"
>                catenateWords="0"
>                catenateNumbers="0"
>                catenateAll="0"
>                />
>          <filter class="solr.LowerCaseFilterFactory"/>
>      </analyzer>
>      <analyzer type="index">
>          <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>          <filter class="solr.WordDelimiterFilterFactory"
>                generateWordParts="1"
>                generateNumberParts="1"
>                catenateWords="1"
>                catenateNumbers="1"
>                catenateAll="0"
>                />
>          <filter class="solr.LowerCaseFilterFactory"/>
>      </analyzer>
>    </fieldtype>
> </fieldType>
>
>
> i reindexed everything but now any search on that field returns zero
> results.  what did i do wrong?
>
> -jsd-

Re: stemming (maybe?) question

Reply via email to