[ https://issues.apache.org/jira/browse/COMPRESS-592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17435881#comment-17435881 ]
Peter Lee commented on COMPRESS-592: ------------------------------------ Hi [~rolandkreuzer] Thank you for your reporting! I think I have located this problem. Will try to fix this soon. > Checksum verification failed reading 7z archive with more than 65536 entries > ---------------------------------------------------------------------------- > > Key: COMPRESS-592 > URL: https://issues.apache.org/jira/browse/COMPRESS-592 > Project: Commons Compress > Issue Type: Bug > Components: Compressors > Affects Versions: 1.21 > Environment: Compress 1.21 and XZ 1.9 on JDK 11; reproduced on both > Windows and Ubuntu Linux > Reporter: Roland Kreuzer > Priority: Major > Attachments: 0000_DOC.7z > > > I have a use-case where I have to decompress Sevenzip archives from an > external source which may have a large number of entries. > I found decompression fails when trying to extract entry 65536 (zero-based > index) with a checksum failure. > > I was able to reproduce the issue with a simple 7Zip file containing 70.001 > entries with random MD5 checksum textfiles (attached). > The sample Archive was created using the 7Zip Windows client and uses > LZMA2:3m. > > My code is a simple sequential read of all contents of the file like > {code:java} > @Test > void readBigSevenZipFile() throws IOException > { > try (SevenZFile sevenZFile = new SevenZFile(new > File("E:\\Temp\\0000_DOC.7z"))) > { > SevenZArchiveEntry entry = sevenZFile.getNextEntry(); > while (entry != null) > { > if (entry.hasStream()) > { > byte[] content = new byte[(int) entry.getSize()]; > sevenZFile.read(content); > System.out.println(entry.getName()); > } > entry = sevenZFile.getNextEntry(); > } > } > } > {code} > which fails consistently after file65535.txt with > {code:java} > java.io.IOException: Checksum verification failed > at > org.apache.commons.compress.utils.ChecksumVerifyingInputStream.read(ChecksumVerifyingInputStream.java:94) > ~[commons-compress-1.21.jar!/:1.21] > at > org.apache.commons.compress.archivers.sevenz.SevenZFile.read(SevenZFile.java:1905) > ~[commons-compress-1.21.jar!/:1.21] > at > org.apache.commons.compress.archivers.sevenz.SevenZFile.read(SevenZFile.java:1888) > ~[commons-compress-1.21.jar!/:1.21] > {code} > > It is noticeable that the value is 2 to the 16th power, which could suggest > an overflow error of some sorts. > > While the minimal sample contains only small txt files, I originally found > the issue with larger archives containing also Image and PDF files. The > archive's contents or size in byte does not seem to have direct influence on > the issue, only the number of files contained within. > > I did not find any workaround yet. -- This message was sent by Atlassian Jira (v8.3.4#803005)