[ 
https://issues.apache.org/jira/browse/AVRO-135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Philip Zeyliger updated AVRO-135:
---------------------------------

    Attachment: AVRO-135.patch.txt

Attaching a patch which ditches the switch statements in exchange for "Codec" 
classes.  I also switched to deflate (the rfc1951 variant).

I ended up having a package-private Codec abstract class (with two extant 
implementations) and a public CodecOption class.  The latter is necessary for 
users to set what codec they're using.  One thing that doesn't currently work 
is setting compression options for append.  We don't recover what compression 
strength you used when you re-open a file.

I spent a good 30+ minutes scratching my head when I switched from 
GZIPInput/OutputFormat to DeflaterInput/OutputFormat.  Whereas the GZIP classes 
are named Input/Outputformat, the Deflate classes are named 
DeflaterOutputFormat and InflaterInputFormat.

Scott, thanks for the clear explanation of rfc1950 vs rfc1951.  Do you know 
what Java library implements 1950?  Java's docs are confusing as ever here (at 
least to me).

> add compression to data files
> -----------------------------
>
>                 Key: AVRO-135
>                 URL: https://issues.apache.org/jira/browse/AVRO-135
>             Project: Avro
>          Issue Type: New Feature
>          Components: java, spec
>            Reporter: Doug Cutting
>            Assignee: Philip Zeyliger
>            Priority: Blocker
>             Fix For: 1.3.0
>
>         Attachments: AVRO-135.patch.txt, AVRO-135.patch.txt
>
>
> We should add support for at least one compression codec to data files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to