[ 
https://issues.apache.org/jira/browse/SOLR-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662871#comment-13662871
 ] 

Erlend Garåsen edited comment on SOLR-4565 at 5/22/13 9:33 AM:
---------------------------------------------------------------

There are not so many differences between the stemming rules for these two 
languages. The only difference is that you must skip some rules for Nynorsk if 
you have configured the stemmer to only use Bokmål.

Both Nynorsk and Bokmål have endings with "-ene", for instance many feminine 
indefinite nouns in plural form such as "jentene" (same for both languages). 
For these nouns, you must only exclude stemming for words ending with "-ane" if 
you have configured it for Bokmål.

The same rules apply to masculine indefinite nouns in plural form for Nynorsk, 
i.e. endings with "-ar". The stemmer must skip those endings as long as only 
Bokmål is used.
                
      was (Author: erlendfg):
    There are not so many differences between the stemming rules for these two 
languages. The only difference is that you must skip some rules for Nynorsk if 
you have configuring the stemmer to only use Bokmål.

Both Nynorsk and Bokmål have endings with "-ene", for instance many feminine 
indefinite nouns in plural form such as "jentene" (same for both languages). 
For these nouns, you must only exclude stemming for words ending with "-ane" if 
you have configured it for Bokmål.

The same rules apply to masculine indefinite nouns in plural form for Nynorsk, 
i.e. endings with "-ar". The stemmer must skip those endings as long as only 
Bokmål is used.
                  
> Extend NorwegianMinimalStemFilter to handle "nynorsk"
> -----------------------------------------------------
>
>                 Key: SOLR-4565
>                 URL: https://issues.apache.org/jira/browse/SOLR-4565
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>            Reporter: Jan Høydahl
>
> Norway has two official languages, both called "Norwegian", namely Bokmål 
> (nb_NO) and Nynorsk (nn_NO).
> The NorwegianMinimalStemFilter and NorwegianLightStemFilter today only works 
> with the largest of the two, namely Bokmål.
> Propose to incorporate "nn" support through a new "vaiant" config option:
> * variant="nb" or not configured -> Bokmål as today
> * variant="nn" -> Nynorsk only
> * variant="no" -> Remove stems for both nb and nn

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to