[ https://issues.apache.org/jira/browse/APEXMALHAR-2487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15999009#comment-15999009 ]
ASF GitHub Bot commented on APEXMALHAR-2487: -------------------------------------------- GitHub user ilganeli opened a pull request: https://github.com/apache/apex-malhar/pull/616 APEXMALHAR-2487 Added support for Snappy compression in FilterStreamProvider * Based on existing code to output Gzip or CipherText this patch adds support for writing data out as Hadoop-readable Snappy format * Added unit tests which validate both the provider and the simpler SnappyStream functionality. * This patch reuses some code from existing tests where possible. https://issues.apache.org/jira/browse/APEXMALHAR-2487 You can merge this pull request into a Git repository by running: $ git pull https://github.com/ilganeli/incubator-apex-malhar APEXMALHAR-2487 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/apex-malhar/pull/616.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #616 ---- commit e756309c676778917ae0464c1ec44a9cf84fbe2b Author: Ilya Ganelin <ilya.gane...@capitalone.com> Date: 2017-04-29T04:54:06Z Added support for Snappy compression in FilterStreamProvider, which in turn enables Snappy output. commit 9ed1e7e3e0bb357f4cdad0b3cfc34cc40404d9f3 Author: Ilya Ganelin <ilya.gane...@capitalone.com> Date: 2017-04-29T04:58:55Z Added additional check for presence of native Snappy libraries. commit 9e442c8f41284d13fef1f2ede99a17f7e105c06d Author: Ilya Ganelin <ilya.gane...@capitalone.com> Date: 2017-04-29T05:18:46Z Checkstyle fixes. commit 4f19d45d98f88d17dcde2e3b040b68f9f8a7f1b8 Author: Ilya Ganelin <ilya.gane...@capitalone.com> Date: 2017-04-29T05:41:10Z Fixed header. commit 70a30e6ef855afb5cb5399a84c271fa93ef23c05 Author: Ilya Ganelin <ilya.gane...@capitalone.com> Date: 2017-05-05T21:48:14Z Adressed PR comments. ---- > Malhar should support outputting data in Snappy compression > ----------------------------------------------------------- > > Key: APEXMALHAR-2487 > URL: https://issues.apache.org/jira/browse/APEXMALHAR-2487 > Project: Apache Apex Malhar > Issue Type: Improvement > Reporter: Ilya Ganelin > Assignee: Ilya Ganelin > > At present, the default file output operator (AbstractFileOutputOperator) > supports compression by setting the FilterStreamProvider. However, Malhar > presently only includes two FilterStreamProvider - one to Cipher data, and > one for Gzip. > Snappy offers substantially improved performance over Gzip in terms of > compression and decompression speed at the expense of compression ratio. In > certain applications this is useful. Thus, it would be helpful to add a > Snappy FilterStreamProvider. -- This message was sent by Atlassian JIRA (v6.3.15#6346)