[ 
https://issues.apache.org/jira/browse/SOLR-17346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17859906#comment-17859906
 ] 

Alastair Porter commented on SOLR-17346:
----------------------------------------

It appears that I cannot assign this task to myself, but I have already opened 
a PR at https://github.com/apache/solr/pull/2533

> Synchronise default configset stopwords to the same list as lucene
> ------------------------------------------------------------------
>
>                 Key: SOLR-17346
>                 URL: https://issues.apache.org/jira/browse/SOLR-17346
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Alastair Porter
>            Priority: Trivial
>
> Solr's default configset comes with a collection of sample stopwords from the 
> snowball project in solr/server/solr/configsets/_default/conf/lang 
> (https://github.com/apache/solr/tree/a42c605fb916439222a086356f368f02cf80304a/solr/server/solr/configsets/_default/conf/lang)
> There is a similar list of stopwords in the lucene repository, however these 
> have been updated to a more recent list of snowball 
> ([https://github.com/apache/lucene/tree/main/lucene/analysis/common/src/resources/org/apache/lucene/analysis/snowball)]
> Specifically, the most recent list of stopwords for the french language has 
> removed a number of words which are homonyms of other useful words which 
> shouldn't be skipped.
> In a discussion on the solr-users mailing list it was agreed that it would be 
> a good idea to sync the list of files in solr with the ones in lucene.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to