[ 
https://issues.apache.org/jira/browse/LUCENE-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5969:
--------------------------------
    Attachment: LUCENE-5969.patch

Here is a difference between trunk and branch as a patch.
* fixed .si to be immutable again, so copySegmentAsIs doesn't rewrite it. 
Instead we strip segment prefixes just like .CFS (LUCENE-5412)
* disabled write-access to all old .SI writers, its no longer needed.
* fixed segment/commit unique ID generation (bugs with sign bits). Also changed 
this to be a byte[] so it can be efficiently encoded.
* Add CodecUtil.write/checkSegmentHeader, which is a regular header, plus the 
ID of the segment. This gives us mismatched files detection.
* beefed up assertingcodec more, with assertinglivedocs
* add lots of safety to .si/.fnm/.del
* moved out cruft to backwards-codecs.

I think this is a good point to merge, and then i will continue on with the 
other parts of the index.

> Add Lucene50Codec
> -----------------
>
>                 Key: LUCENE-5969
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5969
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>             Fix For: 5.0, 6.0
>
>         Attachments: LUCENE-5969.patch, LUCENE-5969.patch
>
>
> Spinoff from LUCENE-5952:
>   * Fix .si to write Version as 3 ints, not a String that requires parsing at 
> read time.
>   * Lucene42TermVectorsFormat should not use the same codecName as 
> Lucene41StoredFieldsFormat
> It would also be nice if we had a "bumpCodecVersion" script so rolling a new 
> codec is not so daunting.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to