[ 
https://issues.apache.org/jira/browse/LUCENE-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17451331#comment-17451331
 ] 

ASF subversion and git services commented on LUCENE-10272:
----------------------------------------------------------

Commit c89c78cee0fbcd88b23d5772ea02f785865add7a in lucene's branch 
refs/heads/branch_9x from Robert Muir
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=c89c78c ]

LUCENE-10272: cross-check norms with postings in checkindex (#493)

Previously, CheckIndex would iterate norms and validate each one. But if norms 
that should be there were missing, nothing would fail. Now it computes an 
expected count of norms and ensures it saw them all.

> checkindex: cross-check norms against docCount
> ----------------------------------------------
>
>                 Key: LUCENE-10272
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10272
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Robert Muir
>            Priority: Major
>         Attachments: LUCENE-10272.patch
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Spinoff from LUCENE-10271
> Despite the current crazy ghost values / elasticsearch abuse of norms, we 
> should still cross-check the data in checkindex.
> Terms.getDocCount should equal the number of norms (we just ignore the bogus 
> ghost values for now).
> Currently checkindex is missing really any check of norms: and it turns out 
> that non-aborting exceptions create inconsistencies.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to