Skip to content
Konstantin Nosov edited this page Feb 20, 2017 · 27 revisions

Test results

###Compared versions

For comparison we used Win32 assembly-based optimized version (Asm), Win32 C-based version (C), Win32 DLL-based version (DLL) and Win32 C-based version built from original sources (Orig). "DLL based version" utilized prebuilt zlib 1.2.3 obtained from Winimage zlib page. These DLL files were built from zlib with official Assembly patches.

Tests were built using Visual Studio 2010 with speed optimization. Test system is Windows 8.1 64-bit, Intel Core i7-4702MQ @ 2.2 GHz.

###Used data sets

For testing we used 3 data sets.

  1. Unreal engine 4 source code (Engine/Source)
  2. Unreal engine 4 binaries (Engine/Binaries/Win64)
  3. Graphical files containing geometry and textures in tif and png formats.

All data sets contains 256 Mb of data.

###Win64 notes

For 64-bit target, original C implementation ("Orig") is 6-10% slower than 32-bit build. The same is for "DLL" build. However optimized C version ("C") is faster on 64 bit than on 32-bit, so code gets additional performance boost just because of its 64-bitness.

For 32-bit target, optimized assembly version is ~5% faster than optimized C version, so 64-bit "C" version performance is somewhere between 32-bit "Asm" and 32-bit "C".

###Linux notes

I've tested library on 32-bit Ubuntu, compiled with GCC 5.4.0. Optimized code performs 1-4% slower than Win32 version. Original zlib implementation performs 7-8% slower.

###Results

Test mode is name of compared version, mentioned in Compared versions paragraph. Number is parenthesis is compression level. Each table cell contains data in following format: <elapsed time> / <compressed size> / <compression speed>.

As you may see, Asm version is just a little bit faster than C version. Optimized version performs 2-8 times faster than original C version.

Test mode Source code Binaries Geometry data
Asm (9) 11.8s / 51136013 b / 21.65 Mb/s 15.4s / 54456154 b / 16.64 Mb/s 25.4s / 114917505 b / 10.07 Mb/s
C (9) 12.3s / 51136013 b / 20.81 Mb/s 16.1s / 54456154 b / 15.88 Mb/s 26.6s / 114917505 b / 9.63 Mb/s
Dll (9) 22.6s / 51145811 b / 11.35 Mb/s 42.0s / 54485836 b / 6.10 Mb/s 150.0s / 114942962 b / 1.71 Mb/s
Orig (9) 26.3s / 51145811 b / 9.75 Mb/s 52.6s / 54485836 b / 4.87 Mb/s 184.4s / 114942962 b / 1.39 Mb/s
Clone this wiki locally