Hi Derek,
Why does aggressive stemming worries you? You might have false positives, but that is desired behavior in most cases. In your case "iphone" documents will also be returned for "iphon" query. Is this something that is not desired behavior? You can have more than one field if you want to prefer matches with exact wording, but that is unnecessary overhead in most cases.

Regards,
Emir

--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/


On 29.02.2016 10:40, Derek Poh wrote:
Hi

I am using EnglishMinimalStemFilterFactory to reducewords in plural forms to singular forms. The filter factory is not reducingthe plural formof 'es' to the singular form correctly. It is reducing correctly for plural form of 's'.
"boxes" is reduced to "boxe" instead of "box"
"glasses" to "glasse" instead of "glass" etc.

I tried with PorterStemFilterFactory, itis able to reduce the plural 'es' formto singular form correctly. However itreduced "iphones" to "iphon" instead.

Is there other filter factory that can reduce pluralto singular correctly?

The field type definition of the field.
<fieldType class="solr.TextField" name="gs_keyword_exact" positionIncrementGap="100">
        <analyzer type="index">
            <tokenizer class="solr.KeywordTokenizerFactory" />
            <filter class="solr.LowerCaseFilterFactory" />
            <filter class="solr.EnglishMinimalStemFilterFactory" />
        </analyzer>
        <analyzer type="query">
            <tokenizer class="solr.KeywordTokenizerFactory" />
            <filter class="solr.LowerCaseFilterFactory" />
            <filter class="solr.EnglishMinimalStemFilterFactory" />
        </analyzer>
    </fieldType>

----------------------
CONFIDENTIALITY NOTICE
This e-mail (including any attachments) may contain confidential and/or privileged information. If you are not the intended recipient or have received this e-mail in error, please inform the sender immediately and delete this e-mail (including any attachments) from your computer, and you must not use, disclose to anyone else or copy this e-mail (including any attachments), whether in whole or in part. This e-mail and any reply to it may be monitored for security, legal, regulatory compliance and/or other appropriate reasons.

Reply via email to