[jira] [Commented] (LUCENE-8948) Change "name" argument in ICU factories to "form"

2019-08-11 Thread Uwe Schindler (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-8948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16904615#comment-16904615
 ] 

Uwe Schindler commented on LUCENE-8948:
---

Thanks for adding the nfkc test :-) Nice!

> Change "name" argument in ICU factories to "form"
> -
>
> Key: LUCENE-8948
> URL: https://issues.apache.org/jira/browse/LUCENE-8948
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: modules/analysis
>Reporter: Tomoko Uchida
>Assignee: Tomoko Uchida
>Priority: Minor
> Fix For: master (9.0)
>
> Attachments: LUCENE-8948.patch
>
>
> {{o.a.l.a.icu.ICUNormalizer2CharFilterFactory}} and 
> {{o.a.l.a.icu.ICUNormalizer2FilterFactory}} have "name" arguments to specify 
> Unicode Normalization Form. The "name" is vague and it causes problem with 
> SOLR-13593.
> "form" would be suitable here instead of "name".



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-8948) Change "name" argument in ICU factories to "form"

2019-08-10 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-8948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16904563#comment-16904563
 ] 

ASF subversion and git services commented on LUCENE-8948:
-

Commit 407ba89aad028a37cf5ec0f131b8773d394177c2 in lucene-solr's branch 
refs/heads/master from Tomoko Uchida
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=407ba89 ]

LUCENE-8948: Change 'name' argument in ICU factories to 'form'.


> Change "name" argument in ICU factories to "form"
> -
>
> Key: LUCENE-8948
> URL: https://issues.apache.org/jira/browse/LUCENE-8948
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: modules/analysis
>Reporter: Tomoko Uchida
>Priority: Minor
> Attachments: LUCENE-8948.patch
>
>
> {{o.a.l.a.icu.ICUNormalizer2CharFilterFactory}} and 
> {{o.a.l.a.icu.ICUNormalizer2FilterFactory}} have "name" arguments to specify 
> Unicode Normalization Form. The "name" is vague and it causes problem with 
> SOLR-13593.
> "form" would be suitable here instead of "name".



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-8948) Change "name" argument in ICU factories to "form"

2019-08-10 Thread Tomoko Uchida (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-8948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16904562#comment-16904562
 ] 

Tomoko Uchida commented on LUCENE-8948:
---

OK, in the [ICU factory 
documentation|https://lucene.apache.org/core/8_2_0/analyzers-icu/org/apache/lucene/analysis/icu/ICUNormalizer2FilterFactory.html],
 it's explicitly documented as follows:
{quote}name: A Unicode Normalization Form, one of 'nfc','nfkc', 'nfkc_cf'. 
Default is nfkc_cf.
{quote}
So seems there is no need to worry about changing the parameter to "form" :)

Here is the patch that also includes tests and Javadoc changes: 
[^LUCENE-8948.patch]

> Change "name" argument in ICU factories to "form"
> -
>
> Key: LUCENE-8948
> URL: https://issues.apache.org/jira/browse/LUCENE-8948
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: modules/analysis
>Reporter: Tomoko Uchida
>Priority: Minor
> Attachments: LUCENE-8948.patch
>
>
> {{o.a.l.a.icu.ICUNormalizer2CharFilterFactory}} and 
> {{o.a.l.a.icu.ICUNormalizer2FilterFactory}} have "name" arguments to specify 
> Unicode Normalization Form. The "name" is vague and it causes problem with 
> SOLR-13593.
> "form" would be suitable here instead of "name".



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-8948) Change "name" argument in ICU factories to "form"

2019-08-10 Thread Tomoko Uchida (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-8948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16904548#comment-16904548
 ] 

Tomoko Uchida commented on LUCENE-8948:
---

I've searched a bit of details of the parameter naming.

The factories' "name" parameter should come from ICU4J Normalizer2 factory 
class method parameter.

[http://www.icu-project.org/apiref/icu4j/com/ibm/icu/text/Normalizer2.html#getInstance-java.io.InputStream-java.lang.String-com.ibm.icu.text.Normalizer2.Mode-]
{quote}data - the binary, big-endian normalization (.nrm file) data, or null 
for ICU data
 name - "nfc" or "nfkc" or "nfkc_cf" or name of custom data file
{quote}
Strictly speaking, the ICU4J normalizer's "name" seems not to be equal to the 
"Unicode normalization form" (it has wider meaning than "normalization form"). 
 Nonetheless "data" is always null when Lucene ICU factories instantiate it so 
it looks okay to me to change the parameter to "form" from the standpoint of 
understandability.

Just in case, [~thetaphi]: does that make sense to you?

> Change "name" argument in ICU factories to "form"
> -
>
> Key: LUCENE-8948
> URL: https://issues.apache.org/jira/browse/LUCENE-8948
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: modules/analysis
>Reporter: Tomoko Uchida
>Priority: Minor
>
> {{o.a.l.a.icu.ICUNormalizer2CharFilterFactory}} and 
> {{o.a.l.a.icu.ICUNormalizer2FilterFactory}} have "name" arguments to specify 
> Unicode Normalization Form. The "name" is vague and it causes problem with 
> SOLR-13593.
> "form" would be suitable here instead of "name".



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org