Hi Derek,
Why does aggressive stemming worries you? You might have false
positives, but that is desired behavior in most cases. In your case
"iphone" documents will also be returned for "iphon" query. Is this
something that is not desired behavior? You can have more than one field
if you want to prefer matches with exact wording, but that is
unnecessary overhead in most cases.
Regards,
Emir
--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/
On 29.02.2016 10:40, Derek Poh wrote:
Hi
I am using EnglishMinimalStemFilterFactory to reducewords in plural
forms to singular forms.
The filter factory is not reducingthe plural formof 'es' to the
singular form correctly. It is reducing correctly for plural form of 's'.
"boxes" is reduced to "boxe" instead of "box"
"glasses" to "glasse" instead of "glass" etc.
I tried with PorterStemFilterFactory, itis able to reduce the plural
'es' formto singular form correctly. However itreduced "iphones" to
"iphon" instead.
Is there other filter factory that can reduce pluralto singular
correctly?
The field type definition of the field.
<fieldType class="solr.TextField" name="gs_keyword_exact"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.KeywordTokenizerFactory" />
<filter class="solr.LowerCaseFilterFactory" />
<filter class="solr.EnglishMinimalStemFilterFactory" />
</analyzer>
<analyzer type="query">
<tokenizer class="solr.KeywordTokenizerFactory" />
<filter class="solr.LowerCaseFilterFactory" />
<filter class="solr.EnglishMinimalStemFilterFactory" />
</analyzer>
</fieldType>
----------------------
CONFIDENTIALITY NOTICE
This e-mail (including any attachments) may contain confidential
and/or privileged information. If you are not the intended recipient
or have received this e-mail in error, please inform the sender
immediately and delete this e-mail (including any attachments) from
your computer, and you must not use, disclose to anyone else or copy
this e-mail (including any attachments), whether in whole or in part.
This e-mail and any reply to it may be monitored for security, legal,
regulatory compliance and/or other appropriate reasons.