-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation: better explanations on compressor behaviour, compression levels and parameters are welcome #3698
Comments
In your test it seems level 6 is most compressed |
For reference,
Compression performance is still all over the place across most of the range, |
documentation updated |
following recommendations by @zougloub at facebook#3698
following recommendations by @zougloub at facebook#3698
When my kids are going to need to compress stuff, I will tell them to use zstd of course, and I will probably tell them to RTFM.
Now I wanted to double-check the zstd man page so as to be sure that the documentation will be straightforward, and I have some comments. Below I will sometimes use the word "should" but keep in mind it's all suggestions.
In the zstd executable man page, the introductory DESCRIPTION is missing some high-level information:
Later the man page first mentions compression levels:
One may find this a bit terse. I mean, there's not even a terminating period on this line.
And this bullet point in the manual, I think, should be augmented with a concise sentence or paragraph, mentioning:
Then, later, the manual has ADVANCED COMPRESSION OPTIONS, which currently says:
There must be a typo here, and I think that:
--zstd=options
item should be the first entry in the section, it's arguably more important than-B
"\n".join(str(i) for i in range(1<<20)).encode()
(7277497 bytes uncompressed):We can see that the compression time may decrease, and/or the compression ration decreases as the compression level is raised.
The DICTIONARY BUILDER and BENCHMARK sections should be moved after the compression options.
The BENCHMARK section should feature an introductory statement, such as the zstd CLI provides a benchmarking mode that can be used to easily find suitable compression parameters, or alternatively to benchmark a computer's performance. Maybe something also statiing that benchmarking for finding compression options should be performed on a representative data set could be useful.
SEE ALSO should point to the zstd manual, which should be installed with zstd, and maybe to the website, since later some other websites are mentioned.
The text was updated successfully, but these errors were encountered: