[ 
https://issues.apache.org/jira/browse/LUCENE-6019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-6019:
---------------------------------------
    Attachment: LUCENE-6019.patch

Patch with new (at first failing) test case, and fix to throw an 
IllegalArgumentException when the invalid document is added, not at merge time 
after index is already corrupted.

I also added some missing fail() in TestDocValuesIndexing.

I also turned off test assertions and fixed tests that relied on them, e.g. by 
upgrading IndexWriter test points to run w/o asserts, and removing some test 
points.  I put one nocommit about whether the assert we use to verify 
TokenStream is final should be "real" (I think not?) ...

If the asserts fix is too much here, I can open a separate issue...

> IndexWriter allows to add same field with different docvlaues type 
> -------------------------------------------------------------------
>
>                 Key: LUCENE-6019
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6019
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/index
>    Affects Versions: 4.10.1
>            Reporter: Simon Willnauer
>            Priority: Critical
>             Fix For: 4.10.2, 5.0
>
>         Attachments: LUCENE-6019.patch
>
>
> IndexWriter checks if the DV types are consitent in multiple places but if 
> due to some problems in Elasticsearch users where able to add the same field 
> with different DV types causing merges to fail. Yet I was able to reduce this 
> to a lucene testcase but I was puzzled since it always failed. Yet, I had to 
> run it without assertions and that cause the bug to happen. I can add field 
> foo with BINARY and SORTED_SET causing a merge to fail. Here is a gist 
> https://gist.github.com/s1monw/8707f924b76ba40ee5f3 / 
> https://github.com/elasticsearch/elasticsearch/issues/8009 
> While this is certainly a problem in Elasticsearch Lucene also allows to 
> corrupt an index due to user error which I think should be prevented. NOTE: 
> this only fails if you run without assertions which I think lucene should do 
> in CI once in a while too.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to