[ https://issues.apache.org/jira/browse/BEAM-8564?focusedWorklogId=369351&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-369351 ]
ASF GitHub Bot logged work on BEAM-8564: ---------------------------------------- Author: ASF GitHub Bot Created on: 09/Jan/20 20:53 Start Date: 09/Jan/20 20:53 Worklog Time Spent: 10m Work Description: lukecwik commented on pull request #10254: [BEAM-8564] Add LZO compression and decompression support URL: https://github.com/apache/beam/pull/10254#discussion_r364953576 ########## File path: sdks/java/core/src/test/java/org/apache/beam/sdk/io/CompressedSourceTest.java ########## @@ -235,6 +315,30 @@ public void testReadConcatenatedGzip() throws IOException { assertEquals(Bytes.asList(expected), actual); } + /** + * Using Lzo Codec Test a concatenation of lzo files is correctly decompressed. + * + * <p>A concatenation of lzo files as one file is a valid lzo file and should decompress to be the + * concatenation of those individual files. + */ + @Test + public void testReadConcatenatedLzo() throws IOException { Review comment: Can we either add support for multistream or throw an exception if the stream isn't finished? It would be dangerous for users to have part of their data silently dropped in this scenario. We should also add to the comment that concatenated streams aren't supported. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 369351) Time Spent: 6h 50m (was: 6h 40m) > Add LZO compression and decompression support > --------------------------------------------- > > Key: BEAM-8564 > URL: https://issues.apache.org/jira/browse/BEAM-8564 > Project: Beam > Issue Type: New Feature > Components: sdk-java-core > Reporter: Amogh Tiwari > Assignee: Amogh Tiwari > Priority: Minor > Time Spent: 6h 50m > Remaining Estimate: 0h > > LZO is a lossless data compression algorithm which is focused on compression > and decompression speeds. > This will enable Apache Beam sdk to compress/decompress files using LZO > compression algorithm. > This will include the following functionalities: > # compress() : for compressing files into an LZO archive > # decompress() : for decompressing files archived using LZO compression > Appropriate Input and Output stream will also be added to enable working with > LZO files. -- This message was sent by Atlassian Jira (v8.3.4#803005)