vulkan: Fix newly added tests for permuted mul_mat and 1D im2col #10226

jeffbolznv · 2024-11-09T03:05:48Z

This fixes recently added tests for permuted mul_mat and 1D im2col in the Vulkan backend. The new tests were added by c39665f and 80273a3.

The im2col fix is just applying the same code change as in 80273a3. The mul_mat fix is a combination of fixes, disabling fast paths that don't support certain combinations, and making some new tests unsupported. I verified that preexisting tests didn't change which code path they used and that no preexisting tests became unsupported.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

jeffbolznv · 2024-11-09T03:06:17Z

ggml/src/ggml-vulkan.cpp

@@ -3630,9 +3630,19 @@ static void ggml_vk_mul_mat_vec_nc_f16_f32(ggml_backend_vk_context * ctx, vk_con

 static void ggml_vk_mul_mat(ggml_backend_vk_context * ctx, vk_context& subctx, const ggml_tensor * src0, const ggml_tensor * src1, ggml_tensor * dst, bool dryrun = false) {
    VK_LOG_DEBUG("ggml_vk_mul_mat(" << src0 << ", " << src1 << ", " << dst << ")");
-    if (src0->type == GGML_TYPE_F16 && ggml_is_permuted(src0) && ggml_is_permuted(src1) && dst->ne[1] == 1) {
+    if (src0->type == GGML_TYPE_F16 && ggml_is_permuted(src0) && ggml_is_permuted(src1) && dst->ne[1] == 1 &&
+        // detect 0213 permutation, and batch size of 1


Is this correct, or is there a better way to detect this permutation?

There isn't a better way currently. We can add a helper function in ggml.h to check permutations if needed.

But I'm wondering how come the CUDA backend does not do these checks and still passes the tests?

Because CUDA only uses this function if any_gpus_with_slow_fp16 is true, otherwise it falls back to another function, in this case ggml_cuda_op_mul_mat_cublas according to my debugger.

If I run the tests with a Tesla P40 (which does have slow fp16) the CUDA tests do in fact fail in the same way, so I guess this fix should also be applied to CUDA.

0cc4m

Thank you for this. I ran the tests again on my GPUs and it fixes the issues.

I think the permutation check should go into ggml.h to make it easier to understand and so we can reuse it in other backends. @ggerganov Should that be done in this PR or separately with the fix for CUDA?

ggerganov · 2024-11-10T09:47:12Z

Can be done in a separate PR. Maybe add a function ggml_is_permuted_0213().

…rganov#10226)

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col

29fd73f

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 9, 2024

jeffbolznv commented Nov 9, 2024

View reviewed changes

0cc4m approved these changes Nov 10, 2024

View reviewed changes

0cc4m merged commit 160687b into ggerganov:master Nov 10, 2024
53 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (gge…

034085d

…rganov#10226)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (gge…

e9197b6

…rganov#10226)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col #10226

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col #10226

jeffbolznv commented Nov 9, 2024

jeffbolznv Nov 9, 2024

ggerganov Nov 9, 2024

0cc4m Nov 9, 2024

0cc4m Nov 9, 2024

0cc4m left a comment

ggerganov commented Nov 10, 2024

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col #10226

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col #10226

Conversation

jeffbolznv commented Nov 9, 2024

jeffbolznv Nov 9, 2024

Choose a reason for hiding this comment

ggerganov Nov 9, 2024

Choose a reason for hiding this comment

0cc4m Nov 9, 2024

Choose a reason for hiding this comment

0cc4m Nov 9, 2024

Choose a reason for hiding this comment

0cc4m left a comment

Choose a reason for hiding this comment

ggerganov commented Nov 10, 2024