Step created SOLR-9282:
--------------------------
Summary: greek accent insensitive search
Key: SOLR-9282
URL: https://issues.apache.org/jira/browse/SOLR-9282
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Components: SearchComponents - other
Affects Versions: 5.0
Environment: Server Linux + Tomcat
Client Windows 7 + JVM 1.8
Reporter: Step
Priority: Critical
We created a new field type, this field type is used for a sentence that
contains text in latin and old greek language
the text can include greek words with accents
we want to be able to do an accent insensitive search so for example:
if i search the word βιβλος i want to find in the text the word βίβλος with
iota coronis accent.
Similarly if I search the word βίβλος with iota acute accent i again want to
find in the text the word βίβλος with iota coronis accent.
I looked for solutions and i found the filter ASCIIFoldingFilterFactory
i installed that filter but do not make the correct job for old greek language
<fieldType name="text_acs" class="solr.TextField" positionIncrementGap="1000">
<analyzer type="index">
<tokenizer class="solr.StandardTokenizerFactory" />
<filter class="solr.ASCIIFoldingFilterFactory" />
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.ASCIIFoldingFilterFactory"
/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
</fieldType>
Please we need a solution to make accent insensitive searches.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]