Feature: dynamic shared mem moe_align_block_size_kernel #3376

akhoroshev · 2024-03-13T08:41:08Z

I encountered compilation errors related to insufficient shared memory size when I tried to increase NUM_MAX_EXPERTS to 128.

@zwd003

WoosukKwon · 2024-03-13T17:54:16Z

@pcmoritz Can you take a look? The PR looks good to me.

pcmoritz

Thanks, the PR looks good to me, feel free to merge @WoosukKwon !

WoosukKwon

@akhoroshev Thanks for submitting the PR! Left minor comments.

csrc/moe_align_block_size_kernels.cu

WoosukKwon · 2024-03-13T22:03:22Z

Also, just curious: which MoE model are you using? Is there any public model with more than 128 experts?

akhoroshev · 2024-03-14T06:59:24Z

Also, just curious: which MoE model are you using? Is there any public model with more than 128 experts?

This is not a public model

akhoroshev · 2024-03-14T07:54:41Z

@WoosukKwon comments taken into account

WoosukKwon

@akhoroshev LGTM! Thanks for the PR!

…size_kernel (vllm-project#3376)" This reverts commit 78b6c48.

…n_block_size_kernel (vllm-project#3376)"" This reverts commit fe983cc.

akhoroshev added 4 commits March 13, 2024 10:41

impl

92c127d

fix

5efba2a

fix

630c632

fix

c7c5734

pcmoritz approved these changes Mar 13, 2024

View reviewed changes

WoosukKwon reviewed Mar 13, 2024

View reviewed changes

csrc/moe_align_block_size_kernels.cu Outdated Show resolved Hide resolved

csrc/moe_align_block_size_kernels.cu Outdated Show resolved Hide resolved

WoosukKwon added the action-required label Mar 14, 2024

akhoroshev added 2 commits March 14, 2024 09:52

pr review fixes

7c3568e

comment

7bc2b74

AT_CUDA_CHECK

866193e

WoosukKwon removed the action-required label Mar 15, 2024

WoosukKwon approved these changes Mar 15, 2024

View reviewed changes

WoosukKwon merged commit 78b6c48 into vllm-project:main Mar 15, 2024
22 of 24 checks passed

simon-mo added a commit to simon-mo/vllm that referenced this pull request Mar 16, 2024

Revert "Dynamically configure shared memory size for moe_align_block_…

fe983cc

…size_kernel (vllm-project#3376)" This reverts commit 78b6c48.

simon-mo mentioned this pull request Mar 16, 2024

CI: Add ROCm Docker Build #2886

Merged

simon-mo added a commit to simon-mo/vllm that referenced this pull request Mar 18, 2024

Revert "Revert "Dynamically configure shared memory size for moe_alig…

4a88632

…n_block_size_kernel (vllm-project#3376)"" This reverts commit fe983cc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: dynamic shared mem moe_align_block_size_kernel #3376

Feature: dynamic shared mem moe_align_block_size_kernel #3376

akhoroshev commented Mar 13, 2024

WoosukKwon commented Mar 13, 2024

pcmoritz left a comment

WoosukKwon left a comment

WoosukKwon commented Mar 13, 2024

akhoroshev commented Mar 14, 2024

akhoroshev commented Mar 14, 2024

WoosukKwon left a comment

Feature: dynamic shared mem moe_align_block_size_kernel #3376

Feature: dynamic shared mem moe_align_block_size_kernel #3376

Conversation

akhoroshev commented Mar 13, 2024

WoosukKwon commented Mar 13, 2024

pcmoritz left a comment

Choose a reason for hiding this comment

WoosukKwon left a comment

Choose a reason for hiding this comment

WoosukKwon commented Mar 13, 2024

akhoroshev commented Mar 14, 2024

akhoroshev commented Mar 14, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment