[
https://issues.apache.org/jira/browse/LUCENE-7318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15481922#comment-15481922
]
Uwe Schindler commented on LUCENE-7318:
---------------------------------------
bq. Example: LowerCaseFilter and UpperCaseFilter are now in different packages
and different jars?!
I agree thats not how it should look like. The package structure of Lucene 4
was so great and now we are back at Lucene 1.0! Sorry, no-go for me.
bq. Steering people toward using StopFilter by default isn't necessarily a good
idea either.
I fully agree. As said in previous comment. Somebody who want to use a
StopFilter can do this very easy using CustomAnalyzer. And thats all part of
analysis/common. No need to do this in core.
People that have no idea about "stop words or not" should not need to deal with
it. At least, do not provide English stop words by default for something that
is advertised as language neutral.
P.S.: Stop words are empty by default in ES, too!
(https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-standard-analyzer.html)
> Graduate StandardAnalyzer out of analyzers module into core
> -----------------------------------------------------------
>
> Key: LUCENE-7318
> URL: https://issues.apache.org/jira/browse/LUCENE-7318
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Priority: Blocker
> Fix For: master (7.0), 6.2, 6.2.1
>
> Attachments: LUCENE-7318.patch
>
>
> Spinoff from LUCENE-7314:
> {{StandardAnalyzer}} has progressed substantially since we broke out the
> analyzers module ... it now follows a real Unicode standard (UAX #29 Unicode
> Text Segmentation). It's also much faster than it used to be, since it
> switched to JFlex a while back. Many bug fixes, etc.
> I think it would make a good default for most Lucene users, and we should
> graduate it from the analyzers module into core, and make it the default for
> {{IndexWriter}}.
> It's really quite crazy that users must go digging in the analyzers module to
> get started with Lucene ... we don't make them dig through the codecs module
> to find a good default codec ...
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]