support aborting on cache miss #828

matthiasdiener · 2024-02-28T19:28:22Z

What do you think @inducer? This would not only make debugging cache misses easier, it could also be used to automate determinism and more general caching tests (by setting LOOPY_ABORT_ON_CACHE_MISS to something trueish, and rerunning the test).

TODO:

add for other cache loads?

Please squash

inducer · 2024-02-29T04:27:01Z

I'm not wildly opposed, but I feel that it's also easy to just hack a 1/0 into that code path when needed. And for automated determinism tests, there are probably simpler approaches than to ask the codegen cache.

Also, "abort on cache miss" sounds like it would include all caches, but this applies to only one of many caches in loopy.

matthiasdiener · 2025-02-04T22:00:45Z

@inducer If possible, I would like to revisit a way to make cache misses automatically testable.

I'm not wildly opposed, but I feel that it's also easy to just hack a 1/0 into that code path when needed. And for automated determinism tests, there are probably simpler approaches than to ask the codegen cache.

Do you have a suggestion on another approach? I thought of perhaps adding a way for persistent_dict itself to abort on a miss, but not sure if that would be better.

inducer · 2025-02-04T22:20:34Z

Do you have a suggestion on another approach?

Um, "hack a 1/0 into that code path when needed." 😁

matthiasdiener · 2025-02-04T22:21:27Z

Do you have a suggestion on another approach?

Um, "hack a 1/0 into that code path when needed." 😁

That doesn't work for automated testing ;-)

inducer · 2025-02-04T22:22:55Z

Ah, fair. 🤷 Let's do this then?

inducer · 2025-02-04T22:23:12Z

Maybe add a comment to explain that the env var is for automated tests?

matthiasdiener · 2025-02-04T22:49:23Z

.github/workflows/ci.yml

@@ -111,6 +111,7 @@ jobs:
                . ./ci-support-v0
                build_py_project_in_conda_env
                ( test_py_project )
+                export LOOPY_ABORT_ON_CACHE_MISS=1
                ( test_py_project )


@inducer do you remember what the purpose of this pytest_twice CI check was/is? Is it just about seeing the time difference between the first pytest run and the second run?

No, it's supposed to test that things still work with data pulled out of a cache, since the first round of tests runs on a cold cache by default.

Is it expected that running pytest like this will lead to cache misses on the second run (demonstrated by the CI in this PR)? I've tried to go back to earlier loopy versions, and it seems like these cache misses are not caused by recent changes.

Huh, I would not expect there to be cache misses on the second go.

I suspect code like this (from test_c_execution.py) is responsible for the cache misses:

def __get_kernel(order="C"): indices = ["i", "j", "k"] sizes = tuple(np.random.randint(1, 11, size=len(indices))) # ...

Would you prefer me to make these sizes constant across runs, or should we just ignore the cache miss?

Setting a seed (and converting to the new-style numpy RNG interface) seems like the way to go.

Considering the multitude of issues encountered, it is probably not realistic to enable this in CI for the near future. I'll disable the export for now.

support aborting on cache miss

8fd54b7

matthiasdiener self-assigned this Feb 28, 2024

matthiasdiener added 2 commits February 4, 2025 15:44

Merge branch 'main' into abort-on-cache-miss

a7d83f4

fix ruff

a0aaa29

cleanups, add to CI

87beabb

matthiasdiener commented Feb 4, 2025

View reviewed changes

matthiasdiener added 2 commits February 5, 2025 11:27

Merge branch 'main' into abort-on-cache-miss

14d13c8

partial rng fix caching

5497933

matthiasdiener mentioned this pull request Feb 5, 2025

persistent_dict can't store NaNs inducer/pytools#287

Open

Merge branch 'main' into abort-on-cache-miss

ea7c197

matthiasdiener mentioned this pull request Feb 6, 2025

LoopyKeyBuilder: hash for BasicSet differs for objects that compare equal #912

Open

Update ci.yml

339ecde

matthiasdiener marked this pull request as ready for review February 7, 2025 00:04

matthiasdiener requested a review from inducer February 7, 2025 00:04

Merge branch 'main' into abort-on-cache-miss

20fdeef

inducer merged commit a46bd53 into main Feb 12, 2025
18 checks passed

inducer deleted the abort-on-cache-miss branch February 12, 2025 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support aborting on cache miss #828

support aborting on cache miss #828

matthiasdiener commented Feb 28, 2024 •

edited

Loading

inducer commented Feb 29, 2024

matthiasdiener commented Feb 4, 2025 •

edited

Loading

inducer commented Feb 4, 2025

matthiasdiener commented Feb 4, 2025

inducer commented Feb 4, 2025

inducer commented Feb 4, 2025

matthiasdiener Feb 4, 2025 •

edited

Loading

inducer Feb 4, 2025

matthiasdiener Feb 5, 2025

inducer Feb 5, 2025

matthiasdiener Feb 5, 2025

inducer Feb 5, 2025

matthiasdiener Feb 7, 2025

support aborting on cache miss #828

support aborting on cache miss #828

Conversation

matthiasdiener commented Feb 28, 2024 • edited Loading

inducer commented Feb 29, 2024

matthiasdiener commented Feb 4, 2025 • edited Loading

inducer commented Feb 4, 2025

matthiasdiener commented Feb 4, 2025

inducer commented Feb 4, 2025

inducer commented Feb 4, 2025

matthiasdiener Feb 4, 2025 • edited Loading

Choose a reason for hiding this comment

inducer Feb 4, 2025

Choose a reason for hiding this comment

matthiasdiener Feb 5, 2025

Choose a reason for hiding this comment

inducer Feb 5, 2025

Choose a reason for hiding this comment

matthiasdiener Feb 5, 2025

Choose a reason for hiding this comment

inducer Feb 5, 2025

Choose a reason for hiding this comment

matthiasdiener Feb 7, 2025

Choose a reason for hiding this comment

matthiasdiener commented Feb 28, 2024 •

edited

Loading

matthiasdiener commented Feb 4, 2025 •

edited

Loading

matthiasdiener Feb 4, 2025 •

edited

Loading