[
https://issues.apache.org/jira/browse/LUCENE-1688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Miller resolved LUCENE-1688.
---------------------------------
Resolution: Fixed
Fix Version/s: (was: 3.0)
Lucene Fields: [New, Patch Available] (was: [New])
Thanks Simon!
> Deprecating StopAnalyzer ENGLISH_STOP_WORDS - General replacement with an
> immutable Set
> ---------------------------------------------------------------------------------------
>
> Key: LUCENE-1688
> URL: https://issues.apache.org/jira/browse/LUCENE-1688
> Project: Lucene - Java
> Issue Type: Improvement
> Reporter: Simon Willnauer
> Assignee: Mark Miller
> Priority: Minor
> Fix For: 2.9
>
> Attachments: LUCENE-1688.patch, LUCENE-1688.patch, StopWords.patch
>
>
> StopAnalyzer and StandartAnalyzer are using the static final array
> ENGLISH_STOP_WORDS by default in various places. Internally this array is
> converted into a mutable set which looks kind of weird to me.
> I think the way to go is to deprecate all use of the static final array and
> replace it with an immutable implementation of CharArraySet. Inside an
> analyzer it does not make sense to have a mutable set anyway and we could
> prevent set creation each time an analyzer is created. In the case of an
> immutable set we won't have multithreading issues either.
> in essence we get rid of a fair bit of "converting string array to set" code,
> do not have a PUBLIC static reference to an array (which is mutable) and
> reduce the overhead of analyzer creation.
> let me know what you think and I create a patch for it.
> simon
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]