[ https://issues.apache.org/jira/browse/LUCENE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16881343#comment-16881343 ]
Mathieu Marie edited comment on LUCENE-8366 at 7/9/19 4:09 PM: --------------------------------------------------------------- sorry to comment on a closed issue. It seems to me that one file was not updated during the upgrade to 62-1. https://github.com/apache/lucene-solr/blob/branch_7_5/lucene/analysis/icu/src/tools/java/org/apache/lucene/analysis/icu/GenerateUTR30DataFiles.java#L66 With that update, running again the ant target `gennorm2`should also bring 3 new files : * nfc.txt * nfkc.txt * nfkc_cf.txt was (Author: matmarie): sorry to comment on a closed issue. It seems to me that one file was not updated during the upgrade to 62-1. [https://github.com/apache/lucene-solr/blob/branch_7_5/lucene/analysis/icu/src/tools/java/org/apache/lucene/analysis/icu/GenerateUTR30DataFiles.java#L66 ] With that update, running again the ant target `gennorm2`should also bring 3 new files : * nfc.txt * nfkc.txt * nfkc_cf.txt > upgrade to icu 62.1 > ------------------- > > Key: LUCENE-8366 > URL: https://issues.apache.org/jira/browse/LUCENE-8366 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis > Reporter: Robert Muir > Priority: Major > Fix For: trunk, 7.5 > > Attachments: LUCENE-8366.patch > > > This gives unicode 11 support. > Also emoji tokenization is simpler and it gives a way to have better > tokenization for emoji from the future. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org