[ https://issues.apache.org/jira/browse/HADOOP-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13048851#comment-13048851 ]
Alejandro Abdelnur commented on HADOOP-7206: -------------------------------------------- Jake, 1. Snappy-Java bundles the native libraries in the JAR itself. While that is convenient/clever packaging technique, this is different from how Hadoop handles native libraries (loading them from lib/native/${OS_ARCH}/). 2. The motivation for keeping hadoop-snappy independent of hadoop was that we could use it right the way in other projects (HBase already integrated it). I would strongly argue that native libraries should handled in a consistent maner in Hadoop. And, if the preference of the Hadoop folks is to bundle snappy in Hadoop (dismissing #2), then I'd advocate for bringing Hadoop-Snappy into Hadoop as this JIRA originally proposed. By doing this we would have 1 external dependency (snappy) instead 2 (snappy-java and snappy, with the side effect that if we need a new version of snappy we would have to wait for snappy-java to do a release with it). Thoughts? > Integrate Snappy compression > ---------------------------- > > Key: HADOOP-7206 > URL: https://issues.apache.org/jira/browse/HADOOP-7206 > Project: Hadoop Common > Issue Type: New Feature > Affects Versions: 0.21.0 > Reporter: Eli Collins > Attachments: HADOOP-7206.patch > > > Google release Zippy as an open source (APLv2) project called Snappy > (http://code.google.com/p/snappy). This tracks integrating it into Hadoop. > {quote} > Snappy is a compression/decompression library. It does not aim for maximum > compression, or compatibility with any other compression library; instead, it > aims for very high speeds and reasonable compression. For instance, compared > to the fastest mode of zlib, Snappy is an order of magnitude faster for most > inputs, but the resulting compressed files are anywhere from 20% to 100% > bigger. On a single core of a Core i7 processor in 64-bit mode, Snappy > compresses at about 250 MB/sec or more and decompresses at about 500 MB/sec > or more. > {quote} -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira