Errors while reading archives from java.util.zip via ZipInputStream #279

TalgatAkhm · 2021-01-22T16:27:33Z

There is an error while reading zip archive from standard java.util.zip archiver. Zip4j ZipInputStream successfully read only first header in archive, next it can not find any.

Here is a full code example (with zip generating, and reading via zip4j stream) and reasoning below:

   
    @Test
    public void test() throws IOException {
        // Arrange
        String archivePath = "C:/Users/Public/test.zipByJavaUtilZip";
        File zipArchive = new File(archivePath);

        File folderToZip = new File("C:/Users/Public/FolderToZip");
        int expectedNumberOfEntries;
        try (java.util.zip.ZipOutputStream zos = new ZipOutputStream(new FileOutputStream(zipArchive))) {
            expectedNumberOfEntries = zipByJavaUtilZip(zos, folderToZip, folderToZip.listFiles());
            zos.flush();
        }

        // Act
        List<String> elements = new ArrayList<>();
        try(InputStream is = new FileInputStream(zipArchive);
                net.lingala.zip4j.io.inputstream.ZipInputStream zis = new ZipInputStream(is)) {

            LocalFileHeader fh;
            while((fh = zis.getNextEntry()) != null)
                elements.add(fh.getFileName());
        }

        // Act-assert
        // Lets prove that there are five elements in archive
        net.lingala.zip4j.ZipFile zf = new net.lingala.zip4j.ZipFile(zipArchive);
        Assertions.assertEquals(expectedNumberOfEntries, zf.getFileHeaders().size());
        Assertions.assertEquals(5, zf.getFileHeaders().size());

        // Assert
        // Check for mistakes in zip4j streams
        Assertions.assertEquals(expectedNumberOfEntries, elements.size()); // (elements.size() == 1) = true
    }

    private static int zipByJavaUtilZip(ZipOutputStream zos, File rootFolder, File[] files) throws IOException {
        int entriesNumber = 0;
        for (File f : files) {
            if (f.isDirectory())
                entriesNumber += zipByJavaUtilZip(zos, rootFolder, f.listFiles());
            else {
                entriesNumber++;
                String path = rootFolder.toPath().relativize(f.toPath()).toString().replaceAll("\\\\", "/");
                ZipEntry entry = new ZipEntry(path);
                zos.putNextEntry(entry);
                try (InputStream fis = new FileInputStream(f)) {
                    IOUtils.copy(fis, zos);
                }
                zos.closeEntry();
            }
        }
        return entriesNumber;
    }

However, comparing archives from zip4j and java.util.zip in hex viewer, I found, that there are not many differences between them. One of them is the general purpose flag, but the 3rd bit of it have been set in both archives. That means, that in LFH compression size is 0, and the actual size is written in the data descriptor (they are identical). So it is normal to dont know the actual size while reading LFH. But the problem is, then I call zis.getNextEntry() second time the result will be null, because in zis.getNextEntry() there is call of readUntilEndOfEntry(), in it I can see this code:

    if (localFileHeader.isDirectory() || localFileHeader.getCompressedSize() == 0) {
      return;
    }
.....

In this condition there is a leaving from function, because localFileHeader.getCompressedSize() is equal to zero. But after creating archive with the same content via zip4j ZipFile or ZipOutputStream, the test above will pass. So I found this as a strange behaviour.

Moreover, if I fix it in my code with the problems will appear in reading directories-lfh:

if (!lfh.isDirectory())
   zis.readAllBytes();

I don’t know exactly why this is happening and how to fix it (maybe it doesn't need to be fixed at all).
The test archive in attach. Thank you!
test.zip

The text was updated successfully, but these errors were encountered:

TalgatAkhm · 2021-01-26T10:28:42Z

A quick fix is reading content of any element (no matter file or directory) after each call ZipInputStream.getNextEntry(), even if no one is going to use the data, like:

zis.getNextEntry();
zis.readAllBytes();

The reason for it:
3d bit of general purpose flag is set to 1, so compressedSize in LFH is set to zero, and after calling getNextEntry() second time readUntilEndOfEntry() doesn't do anything, because of compressedSize = 0. That's why zip4j try to read second LFH from first element content. So quick fix in library code is:

private void readUntilEndOfEntry() throws IOException {
    if (localFileHeader.isDirectory() || (localFileHeader.getCompressedSize() == 0 && !localFileHeader.isDataDescriptorExists())) {
      return;
    }
....

But, I think this only solve consequence but not cause

srikanth-lingala · 2021-01-28T06:37:53Z

Thanks for the detailed explanation and your analysis. Appreciate it. I have fixed the issue and will include the fix in the next release.

…tor is set

srikanth-lingala · 2021-02-15T04:06:34Z

Issue fixed in v2.7.0 which was released today.

srikanth-lingala self-assigned this Jan 28, 2021

srikanth-lingala added the in-progress label Jan 28, 2021

srikanth-lingala added a commit that referenced this issue Jan 28, 2021

#279 Check if data descriptor exists when reading until end of entry

13c1706

srikanth-lingala added bug Something isn't working resolved and removed in-progress labels Jan 28, 2021

srikanth-lingala added a commit that referenced this issue Jan 28, 2021

#279 Skip reading until end of entry only for files when data descrip…

d568afc

…tor is set

srikanth-lingala closed this as completed Feb 15, 2021

EvolveWorx mentioned this issue Feb 20, 2021

How to list zip entries using ZipInputStream for Android 11 SAF? #285

Closed

dependabot bot mentioned this issue Mar 18, 2021

Bump zip4j from 1.3.2 to 2.7.0 allure-framework/allure-maven#153

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Errors while reading archives from java.util.zip via ZipInputStream #279

Errors while reading archives from java.util.zip via ZipInputStream #279

TalgatAkhm commented Jan 22, 2021

TalgatAkhm commented Jan 26, 2021

srikanth-lingala commented Jan 28, 2021

srikanth-lingala commented Feb 15, 2021

Errors while reading archives from java.util.zip via ZipInputStream #279

Errors while reading archives from java.util.zip via ZipInputStream #279

Comments

TalgatAkhm commented Jan 22, 2021

TalgatAkhm commented Jan 26, 2021

srikanth-lingala commented Jan 28, 2021

srikanth-lingala commented Feb 15, 2021