[
https://issues.apache.org/jira/browse/LUCENE-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-7476:
---------------------------------------
Attachment: LUCENE-7476.patch
OK thanks [~andyhind], I think this is in fact a real bug!
The reason it happens in Eclipse but not ant is because when {{TestFactories}}
runs from Eclipse, it apparently includes the {{JapaneseNumberFilter}} in the
{{CLASSPATH}} but ant does not, which is annoying. (There is a TODO about this
in {{TestFactories}}).
I'm attaching a patch (based on 6.x) that copies {{TestFactories.java}} into
the kuromoji module, and then I can reproduce this failure:
{noformat}
.TEST FAIL: useCharFilter=false text='alari/power.html 301'
ENOTE: reproduce with: ant test -Dtestcase=TestFactories -Dtests.method=test
-Dtests.seed=4139A5350B1C10BE -Dtests.slow=true
-Dtests.linedocsfile=/lucenedata/hudson.enwiki.random.lines.txt.fixed
-Dtests.locale=cs -Dtests.timezone=America/Paramaribo -Dtests.asserts=true
-Dtests.file.encoding=UTF-8
NOTE: test params are:
codec=FastCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=FAST,
chunkSize=6, maxDocsPerChunk=1004, blockSize=452),
termVectorsFormat=CompressingTermVectorsFormat(compressionMode=FAST,
chunkSize=6, blockSize=452)), sim=ClassicSimilarity, locale=cs,
timezone=America/Paramaribo
NOTE: Linux 4.4.0-38-generic amd64/Oracle Corporation 1.8.0_92
(64-bit)/cpus=8,threads=1,free=415218864,total=514850816
NOTE: All tests run in this JVM: [TestFactories]
Time: 2.298
There was 1 failure:
1) test(org.apache.lucene.analysis.ja.TestFactories)
java.lang.IllegalStateException: incrementToken() called while in wrong state:
INCREMENT_FALSE
at
__randomizedtesting.SeedInfo.seed([4139A5350B1C10BE:C96D9AEFA5E07D46]:0)
at org.apache.lucene.analysis.MockTokenizer.fail(MockTokenizer.java:125)
at
org.apache.lucene.analysis.MockTokenizer.incrementToken(MockTokenizer.java:136)
at
org.apache.lucene.analysis.ja.JapaneseNumberFilter.incrementToken(JapaneseNumberFilter.java:152)
at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkAnalysisConsistency(BaseTokenStreamTestCase.java:716)
at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:627)
at
org.apache.lucene.analysis.BaseTokenStreamTestCase.checkRandomData(BaseTokenStreamTestCase.java:525)
at
org.apache.lucene.analysis.ja.TestFactories.doTestTokenFilter(TestFactories.java:104)
at
org.apache.lucene.analysis.ja.TestFactories.test(TestFactories.java:57)
{noformat}
> Fix transient failure in JapaneseNumberFilter run from TestFactories
> --------------------------------------------------------------------
>
> Key: LUCENE-7476
> URL: https://issues.apache.org/jira/browse/LUCENE-7476
> Project: Lucene - Core
> Issue Type: Bug
> Components: core/other
> Affects Versions: 6.2.1
> Reporter: Andy Hind
> Priority: Trivial
> Attachments: LUCENE-7476.patch, LUCENE-7476.patch
>
>
> Repeatedly running TestFactories show this test to fail ~10% of the time.
> I believe the fix is trivial and related to loosing the state of the
> underlying input stream when testing some analyzer life cycle flows.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]