use numba rather than C++ for fast polymer calculations #103

pkienzle · 2021-02-08T21:59:03Z

Note: similar to code from https://github.com/richardsheridan/sf_nscf/blob/3e72e5f3777928e8eb844f93bd48549bca8c9041/util.py#L185

The numba version of compose made no difference on the speed of the tests so I did not include it.

This version drops the C++ entirely, falling back to the numpy version if numba is not available.

…stead

richardsheridan

If you merge this, you should consider rewriting reflmodule with numba as well in light of #100.

Thanks for pinging me on this!

richardsheridan · 2021-02-08T23:52:03Z

refl1d/polymer.py

 LAMBDA_ARRAY = np.array([LAMBDA_1, LAMBDA_0, LAMBDA_1])
 MINLAT = 25
 MINBULK = 5
 SQRT_PI = sqrt(pi)

+def correlate_same(a, b):
+    return np.correlate(a, b, 'same')


looking up this correlation mode was absurdly expensive in the pure python mode back in the day, that's why I was selecting it with the weird internal int. See numpy/numpy#4999 and consider using multiarray.correlate based on timings.

I didn't notice significant timing differences when I chose one versus the other, so I went with the newer interface.

Yes, the multiarray.correlate function is faster when I test it alone:

In [187]: %timeit np.convolve(x, y, 'same') 5.4 µs ± 32.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) In [188]: %timeit np.correlate(x, y, 'same') 3.05 µs ± 132 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each) In [189]: %timeit np.core.multiarray.correlate(x, y, 1) 1.5 µs ± 27 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

Trying again, I now see the times dropping significantly (5.7s down to 4.0s), so I'll revert to the deprecated function.

richardsheridan · 2021-02-08T23:52:59Z

refl1d/polymer.py

+except ImportError:
+    USE_NUMBA = False
+
+USE_NUMBA = True # Uncomment when doing timing tests


I guess this should be commented?

richardsheridan · 2021-02-08T23:57:05Z

refl1d/polymer.py

-            for r in range(1, segments):
-                g_zs[:, r] = pg_zs = correlate(pg_zs, LAMBDA_ARRAY, 1) * g_z
+try:
+    from numba import njit, prange


I'm interested to know if prange helps for large numbers of segments. It seems like it could lead to cache contention in most cases though.

prange made no significant difference in my tests, so I stopped using it

Paul Kienzle added 4 commits February 8, 2021 16:58

use numba rather than C++ for fast polymer calculations

0d33c92

add numba to test infrastructure

d9ae39e

drop py27 support for compiled modules

257457d

MSVC compiles C as C++ and doesn't have typeof(x); try decltype(x) in…

6261a1f

…stead

pkienzle mentioned this pull request Feb 8, 2021

Use limited ABI to reduce number of required wheels #97

Closed

richardsheridan reviewed Feb 9, 2021

View reviewed changes

Paul Kienzle and others added 3 commits February 8, 2021 19:51

revert to old correlate function because it is somewhat faster

ff0a362

upload wheels to unstable release

10643c1

remove unused import

28520d3

bmaranville merged commit 0b82a51 into master Feb 9, 2021

pkienzle deleted the polymer_numba branch May 3, 2022 20:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use numba rather than C++ for fast polymer calculations #103

use numba rather than C++ for fast polymer calculations #103

pkienzle commented Feb 8, 2021 •

edited

Loading

richardsheridan left a comment

richardsheridan Feb 8, 2021

pkienzle Feb 9, 2021

richardsheridan Feb 8, 2021

richardsheridan Feb 8, 2021

pkienzle Feb 9, 2021

use numba rather than C++ for fast polymer calculations #103

use numba rather than C++ for fast polymer calculations #103

Conversation

pkienzle commented Feb 8, 2021 • edited Loading

richardsheridan left a comment

Choose a reason for hiding this comment

richardsheridan Feb 8, 2021

Choose a reason for hiding this comment

pkienzle Feb 9, 2021

Choose a reason for hiding this comment

richardsheridan Feb 8, 2021

Choose a reason for hiding this comment

richardsheridan Feb 8, 2021

Choose a reason for hiding this comment

pkienzle Feb 9, 2021

Choose a reason for hiding this comment

pkienzle commented Feb 8, 2021 •

edited

Loading