[ https://issues.apache.org/jira/browse/AVRO-135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Philip Zeyliger updated AVRO-135: --------------------------------- Attachment: AVRO-135.patch.txt Attaching a patch which ditches the switch statements in exchange for "Codec" classes. I also switched to deflate (the rfc1951 variant). I ended up having a package-private Codec abstract class (with two extant implementations) and a public CodecOption class. The latter is necessary for users to set what codec they're using. One thing that doesn't currently work is setting compression options for append. We don't recover what compression strength you used when you re-open a file. I spent a good 30+ minutes scratching my head when I switched from GZIPInput/OutputFormat to DeflaterInput/OutputFormat. Whereas the GZIP classes are named Input/Outputformat, the Deflate classes are named DeflaterOutputFormat and InflaterInputFormat. Scott, thanks for the clear explanation of rfc1950 vs rfc1951. Do you know what Java library implements 1950? Java's docs are confusing as ever here (at least to me). > add compression to data files > ----------------------------- > > Key: AVRO-135 > URL: https://issues.apache.org/jira/browse/AVRO-135 > Project: Avro > Issue Type: New Feature > Components: java, spec > Reporter: Doug Cutting > Assignee: Philip Zeyliger > Priority: Blocker > Fix For: 1.3.0 > > Attachments: AVRO-135.patch.txt, AVRO-135.patch.txt > > > We should add support for at least one compression codec to data files. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.