[ https://issues.apache.org/jira/browse/SOLR-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662871#comment-13662871 ]
Erlend Garåsen edited comment on SOLR-4565 at 5/22/13 9:33 AM: --------------------------------------------------------------- There are not so many differences between the stemming rules for these two languages. The only difference is that you must skip some rules for Nynorsk if you have configured the stemmer to only use Bokmål. Both Nynorsk and Bokmål have endings with "-ene", for instance many feminine indefinite nouns in plural form such as "jentene" (same for both languages). For these nouns, you must only exclude stemming for words ending with "-ane" if you have configured it for Bokmål. The same rules apply to masculine indefinite nouns in plural form for Nynorsk, i.e. endings with "-ar". The stemmer must skip those endings as long as only Bokmål is used. was (Author: erlendfg): There are not so many differences between the stemming rules for these two languages. The only difference is that you must skip some rules for Nynorsk if you have configuring the stemmer to only use Bokmål. Both Nynorsk and Bokmål have endings with "-ene", for instance many feminine indefinite nouns in plural form such as "jentene" (same for both languages). For these nouns, you must only exclude stemming for words ending with "-ane" if you have configured it for Bokmål. The same rules apply to masculine indefinite nouns in plural form for Nynorsk, i.e. endings with "-ar". The stemmer must skip those endings as long as only Bokmål is used. > Extend NorwegianMinimalStemFilter to handle "nynorsk" > ----------------------------------------------------- > > Key: SOLR-4565 > URL: https://issues.apache.org/jira/browse/SOLR-4565 > Project: Solr > Issue Type: Improvement > Components: Schema and Analysis > Reporter: Jan Høydahl > > Norway has two official languages, both called "Norwegian", namely Bokmål > (nb_NO) and Nynorsk (nn_NO). > The NorwegianMinimalStemFilter and NorwegianLightStemFilter today only works > with the largest of the two, namely Bokmål. > Propose to incorporate "nn" support through a new "vaiant" config option: > * variant="nb" or not configured -> Bokmål as today > * variant="nn" -> Nynorsk only > * variant="no" -> Remove stems for both nb and nn -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org