GitHub user cbismuth opened a pull request:
https://github.com/apache/lucene-solr/pull/505
LUCENE-8548: Reevaluate scripts boundary break in Nori's tokenizer
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cbismuth/lucene-solr LUCENE-8548
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/lucene-solr/pull/505.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #505
----
commit 73e0e11807f2c62a7620e53648cb379b18a0d1ee
Author: Christophe Bismuth <christophe.bismuth@...>
Date: 2018-11-21T12:58:44Z
LUCENE-8548: Add Cyrillic word test
commit 4c79ca6271537bb9ca347c6128a3a5ad016e5d97
Author: Christophe Bismuth <christophe.bismuth@...>
Date: 2018-11-23T16:16:05Z
LUCENE-8548: Break on script boundaries and track character classes
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]