[ https://issues.apache.org/jira/browse/LUCENE-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Muir updated LUCENE-5969: -------------------------------- Attachment: LUCENE-5969.patch Here is a difference between trunk and branch as a patch. * fixed .si to be immutable again, so copySegmentAsIs doesn't rewrite it. Instead we strip segment prefixes just like .CFS (LUCENE-5412) * disabled write-access to all old .SI writers, its no longer needed. * fixed segment/commit unique ID generation (bugs with sign bits). Also changed this to be a byte[] so it can be efficiently encoded. * Add CodecUtil.write/checkSegmentHeader, which is a regular header, plus the ID of the segment. This gives us mismatched files detection. * beefed up assertingcodec more, with assertinglivedocs * add lots of safety to .si/.fnm/.del * moved out cruft to backwards-codecs. I think this is a good point to merge, and then i will continue on with the other parts of the index. > Add Lucene50Codec > ----------------- > > Key: LUCENE-5969 > URL: https://issues.apache.org/jira/browse/LUCENE-5969 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Michael McCandless > Fix For: 5.0, 6.0 > > Attachments: LUCENE-5969.patch, LUCENE-5969.patch > > > Spinoff from LUCENE-5952: > * Fix .si to write Version as 3 ints, not a String that requires parsing at > read time. > * Lucene42TermVectorsFormat should not use the same codecName as > Lucene41StoredFieldsFormat > It would also be nice if we had a "bumpCodecVersion" script so rolling a new > codec is not so daunting. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org