CUDA/HIP: fix tests/test-backend-ops #8896

JohannesGaessler · 2024-08-06T21:57:24Z

Fixes #8864 .
The problem is simply that I forgot to adapt the HIP test logic at some point.

Fixes #8863 .
The problem is that there are test cases that are (according to the corresponding functions) supported with the CUDA backend but not with the CPU backend. As a consequence, when running the tests normally these tests are not executed because there are no CPU results to compare the CUDA results to. In performance mode they are executed though and trigger asserts. Quite frankly though, without a way to assert that the results produced are actually correct such performance numbers would be useless anyways. So I edited the test code in such a way that performance is only evaluated for those ops where correctness can also be tested.

slaren · 2024-08-06T22:08:00Z

tests/test-backend-ops.cpp

@@ -539,7 +539,7 @@ struct test_case {
        return false;
    }

-    bool eval_perf(ggml_backend_t backend, const char * op_name) {
+    bool eval_perf(ggml_backend_t backend1, ggml_backend_t backend2, const char * op_name) {


I don't understand the purpose of this change, the perf mode only uses one backend.

slaren · 2024-08-06T22:09:58Z

So I edited the test code in such a way that performance is only evaluated for those ops where correctness can also be tested.

This should not be necessary.

slaren · 2024-08-06T22:20:34Z

It seems that the issue is that the CUDA backend does not correctly report in the supports_op function what kinds of flash attention and mul_mat operations it can do. The solution is to fix that, not to check also the CPU backend.

JohannesGaessler · 2024-08-06T22:56:53Z

In my opinion the reportedly supported ops being inaccurate and performance tests being executed for cases where the correctness cannot be asserted are both issues. But since the whole point of the tests is to reduce the time developers need to spend on quality assurance I don't want to discuss this at length.

github-actions bot added the testing Everything test related label Aug 6, 2024

slaren reviewed Aug 6, 2024

View reviewed changes

CUDA/HIP: fix tests/test-backend-ops

de7cf9d

JohannesGaessler force-pushed the hip-fix-tests branch from 0231713 to de7cf9d Compare August 6, 2024 22:52

slaren approved these changes Aug 6, 2024

View reviewed changes

JohannesGaessler merged commit a8dbc6f into ggerganov:master Aug 7, 2024
53 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Aug 7, 2024

CUDA/HIP: fix tests/test-backend-ops (ggerganov#8896)

c77d446

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA/HIP: fix tests/test-backend-ops #8896

CUDA/HIP: fix tests/test-backend-ops #8896

JohannesGaessler commented Aug 6, 2024

slaren Aug 6, 2024

slaren commented Aug 6, 2024

slaren commented Aug 6, 2024 •

edited

Loading

JohannesGaessler commented Aug 6, 2024

CUDA/HIP: fix tests/test-backend-ops #8896

CUDA/HIP: fix tests/test-backend-ops #8896

Conversation

JohannesGaessler commented Aug 6, 2024

slaren Aug 6, 2024

Choose a reason for hiding this comment

slaren commented Aug 6, 2024

slaren commented Aug 6, 2024 • edited Loading

JohannesGaessler commented Aug 6, 2024

slaren commented Aug 6, 2024 •

edited

Loading