Add different dict modes to compression ratio regression test, update results.csv #2559

senhuang42 · 2021-03-25T17:47:43Z

On the row-hash PR, DMS was silently broken until recently fixed/discovered - our coverage of tests on different dict modes (DMS, DDS, Copy, Load) is not particularly high. This should be a good first step in ensuring we detect such things. (The row-hash PR adds a DMS unit test in fuzzer.c as well, to check for regressions)

This PR:

Adds ratio regression tests to clevels >= 1 for each dictMode when testing using advanced API that can honor requested parameters.
Updates results.csv
- New rows with new dictmode tests
- Adjustments for improvements introduced with block splitter and levels 16, 19
- Minor spelling fix
Fixes compiler warnings when building this regression test

Testing: I tested this on row-hash with a broken DMS, and indeed the compression results were bad.

…piler warnings

senhuang42 · 2021-03-25T18:00:52Z

tests/regression/config.c

+        .cli_args = "-" #x,                                                       \
+        .param_values = PARAM_VALUES(level_##x##_param_values_dictload),          \
+        .use_dictionary = 1,                                                      \
+        .advanced_api_only = 1,                                                   \


So I added the new configs to LEVEL() so we automatically get testing for all the levels we try.

An alternative approach is to have each of these just be their own config, with a fixed compression level (and if we wanted to test more clevels/strategies, we could just add more configs). But that does have the downside of not automatically hitting all the tested clevels.

I do prefer having these run on all the compression levels, in case in the future, strategies get changed in a way that might affect the dictionary strategies.

makes sense

terrelln · 2021-03-25T19:30:20Z

Thanks for adding this Sen! Really glad to see increased coverage of dictionary compression.

senhuang42 added 2 commits March 25, 2021 10:39

Add tests to regression tests for dict

1cadf86

Restrict dictmode regression tests only to advanced API, fix some com…

f27e326

…piler warnings

facebook-github-bot added the CLA Signed label Mar 25, 2021

senhuang42 commented Mar 25, 2021

View reviewed changes

senhuang42 force-pushed the add_dict_regression_tests_backup branch from d980071 to bbbd578 Compare March 25, 2021 18:11

Update results.csv

bbbd578

Cyan4973 approved these changes Mar 25, 2021

View reviewed changes

terrelln approved these changes Mar 25, 2021

View reviewed changes

senhuang42 merged commit ab216bc into facebook:dev Mar 25, 2021

senhuang42 mentioned this pull request May 11, 2021

🎉 Zstd 1.5.0 Release 🎉 #2636

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add different dict modes to compression ratio regression test, update results.csv #2559

Add different dict modes to compression ratio regression test, update results.csv #2559

senhuang42 commented Mar 25, 2021 •

edited

Loading

senhuang42 Mar 25, 2021

Cyan4973 Mar 25, 2021

terrelln commented Mar 25, 2021

Add different dict modes to compression ratio regression test, update results.csv #2559

Add different dict modes to compression ratio regression test, update results.csv #2559

Conversation

senhuang42 commented Mar 25, 2021 • edited Loading

senhuang42 Mar 25, 2021

Choose a reason for hiding this comment

Cyan4973 Mar 25, 2021

Choose a reason for hiding this comment

terrelln commented Mar 25, 2021

senhuang42 commented Mar 25, 2021 •

edited

Loading