[ 
https://issues.apache.org/jira/browse/BEAM-8564?focusedWorklogId=357405&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-357405
 ]

ASF GitHub Bot logged work on BEAM-8564:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Dec/19 21:01
            Start Date: 10/Dec/19 21:01
    Worklog Time Spent: 10m 
      Work Description: amoght commented on issue #10254: [BEAM-8564] Add LZO 
compression and decompression support
URL: https://github.com/apache/beam/pull/10254#issuecomment-564256222
 
 
   While studying the code, we found that the airlift/ aircompressor library 
only requires some classes which are also present in apache hadoop common 
package(~3.9MB). Therefore, we are now thinking that if we make changes in the 
airlift/ aircompressor package, replacing the 
   com.facebook.presto.hadoop with org.apache.hadoop.common and remove other 
compression mechanisms(like zstd, gzip etc) while only keeping the required LZO 
package.
   But if we go ahead with this approach, we will have to manually update this 
library whenever any changes are made to the airlift/aircompressor's LZO 
package.
   @lukecwik @gsteelman please provide your thoughts on this.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 357405)
    Time Spent: 4h  (was: 3h 50m)

> Add LZO compression and decompression support
> ---------------------------------------------
>
>                 Key: BEAM-8564
>                 URL: https://issues.apache.org/jira/browse/BEAM-8564
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-java-core
>            Reporter: Amogh Tiwari
>            Assignee: Amogh Tiwari
>            Priority: Minor
>          Time Spent: 4h
>  Remaining Estimate: 0h
>
> LZO is a lossless data compression algorithm which is focused on compression 
> and decompression speeds.
> This will enable Apache Beam sdk to compress/decompress files using LZO 
> compression algorithm. 
> This will include the following functionalities:
>  # compress() : for compressing files into an LZO archive
>  # decompress() : for decompressing files archived using LZO compression
> Appropriate Input and Output stream will also be added to enable working with 
> LZO files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to