This patch release includes the implementation of the SC'24 compression tutorial. Specifically:
- Updates cuSZp.h and cuSZp.cpp to increase compatibility.
- In float32 data type and plain mode compression kernel, update a partial re-execution design (alleviating register usages).