TAR in ustar format with long file names: TarEntry name is incomplete #121

alexmbaker · 2016-07-06T08:06:10Z

Steps to reproduce

Obtain a TAR with long file names - eg http://registry.npmjs.org/npm/-/npm-3.10.3.tgz
Follow the directions to extract from the archive provided in the project WiKi https://github.com/icsharpcode/SharpZipLib/wiki/GZip-and-Tar-Samples#-simple-full-extract-from-a-tgz-targz
Run the extract

Expected behavior

The files should be extracted in the same folder structure as it would be using 7Zip. The extracted should look something like

.-- package
     |-- bin
     +-- node_modules
     |    |-- abbrev
     |    |--   
     |    +-- read-package-json
     |    +-- etc etc

Actual behavior

Where the file names including the relative path is long the files are extracted in the wrong place

.-- package
|    |-- bin
|    +-- node_modules
|    |    |-- abbrev
|    |    |--    
|
|-- read-package-json
|-- etc etc

Additional Information

The code in TarInputStream.GetNextEntry appears to only try to read the additional information if the header has a typeFlag equivelent to L. Looking at this documentation https://www.gnu.org/software/tar/manual/html_chapter/tar_14.html it appears different versions of the format do different things. The implementation in SharpZipLib appears to follow the 'old gnu' way but does have support for the new / current way which appears to be to read the field and see what is there.

Version of SharpZipLib

SharpZipLib.0.86.0

Obtained from (place an x between the brackets for all that apply)

The text was updated successfully, but these errors were encountered:

siegfriedpammer · 2017-08-18T10:04:53Z

I tried reproducing this bug. I extracted npm-3.10.3.tgz using 7zip and TarArchive.ExtractContents and both directory structures are identical. Can you provide a dump of the directory structure you are getting? What code are you using to extract the files?

Fixes icsharpcode#121 Only "path" keyword supported as it's used for non-GNU long file names.

Fixes #121 Only "path" keyword supported as it's used for non-GNU long file names.

McNeight added bug tar Related to TAR file format labels Aug 6, 2016

McNeight added this to the 1.0 milestone Aug 6, 2016

McNeight self-assigned this Aug 6, 2016

piksel mentioned this issue Jul 1, 2018

writing correct Tar UTF8 filenames #182

Closed

piksel added a commit to piksel/SharpZipLib that referenced this issue Jul 1, 2018

Add repro for icsharpcode#121 (and icsharpcode#182)

ec138b9

piksel added a commit to piksel/SharpZipLib that referenced this issue Jul 1, 2018

Add support for POSIX Extended Headers

376e8b3

Fixes icsharpcode#121 Only "path" keyword supported as it's used for non-GNU long file names.

piksel mentioned this issue Jul 1, 2018

Add support for POSIX Extended Headers #240

Merged

piksel self-assigned this Jul 1, 2018

piksel closed this as completed in #240 Jul 12, 2018

piksel added a commit that referenced this issue Jul 12, 2018

Merge PR #240, Add support for POSIX Extended Headers

4ee3b24

Fixes #121 Only "path" keyword supported as it's used for non-GNU long file names.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TAR in ustar format with long file names: TarEntry name is incomplete #121

TAR in ustar format with long file names: TarEntry name is incomplete #121

alexmbaker commented Jul 6, 2016

siegfriedpammer commented Aug 18, 2017

TAR in ustar format with long file names: TarEntry name is incomplete #121

TAR in ustar format with long file names: TarEntry name is incomplete #121

Comments

alexmbaker commented Jul 6, 2016

Steps to reproduce

Expected behavior

Actual behavior

Additional Information

Version of SharpZipLib

Obtained from (place an x between the brackets for all that apply)

siegfriedpammer commented Aug 18, 2017