[ 
https://issues.apache.org/jira/browse/LUCENE-9220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17037669#comment-17037669
 ] 

ASF subversion and git services commented on LUCENE-9220:
---------------------------------------------------------

Commit fc229b170197e37ffcbdb330e7657939979a7def in lucene-solr's branch 
refs/heads/jira/LUCENE-9220 from Robert Muir
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=fc229b1 ]

LUCENE-9220 regenerate all stemmers from snowball 2.0

Instead of patching them after-the-fact (both manually and
automatically over the years) we patch the generator.

This is easier to maintain than patches/changes against generated code.
See LUCENE-9220 for more information.

There is a remaining nocommit, test data. Also need to hook in and test
the new languages that are added here.


> Upgrade Snowball version to 2.0
> -------------------------------
>
>                 Key: LUCENE-9220
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9220
>             Project: Lucene - Core
>          Issue Type: Wish
>            Reporter: Nguyen Minh Gia Huy
>            Priority: Major
>         Attachments: snowball_53739a805cfa6c.patch
>
>
> When working with Snowball-based stemmers, I realized that Lucene is 
> currently [using a pre-compiled version of 
> Snowball|https://lucene.apache.org/core/8_4_1/analyzers-common/org/apache/lucene/analysis/snowball/package-summary.html],
>  that seems from 12 years ago: 
> https://github.com/snowballstem/snowball/tree/e103b5c257383ee94a96e7fc58cab3c567bf079b
> Snowball has just released v2.0 in 10/2019 with many improvements, new 
> supported languages ( Arabic, Indonesian…) and new features ( stringdef 
> notation for Unicode codepoints…). Details of the changes could be found 
> here: https://github.com/snowballstem/snowball/blob/master/NEWS. I think 
> these changes of Snowball could give a promising positive impact on Lucene.
> I wonder when Lucene should upgrade Snowball to the latest version ( v2.0).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to