[ 
https://issues.apache.org/jira/browse/LUCENE-6053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14201993#comment-14201993
 ] 

Robert Muir commented on LUCENE-6053:
-------------------------------------

Looks good (caveat: I am not intimately familiar with the normalizations of 
diacritics here).

Should we add a note to SerbianNormalizationFilter that it expects lowercase 
input?

> Serbian Analyzer
> ----------------
>
>                 Key: LUCENE-6053
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6053
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Nikola Smolenski
>         Attachments: LUCENE-Serbian.patch
>
>
> This is analyzer for Serbian language, so far consisting only of a 
> normalizer. Serbian language uses both Cyrillic and Latin alphabet, so the 
> normalizer works with both alphabets.
> In the future, I'll see to add stopwords, stemmer and so on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to