[MetaSchedule] Update scripts for subgraph tuning #10501

junrushao · 2022-03-06T02:11:16Z

No description provided.

junrushao · 2022-03-06T02:12:37Z

src/target/tag.cc

 #define TVM_REGISTER_CUDA_TAG(Name, Arch, SharedMem, RegPerBlock) \
  TVM_REGISTER_TARGET_TAG(Name).set_config({                      \
      {"kind", String("cuda")},                                   \
      {"arch", String(Arch)},                                     \
-      {"shared_memory_per_block", Integer(SharedMem)},            \
-      {"registers_per_block", Integer(RegPerBlock)},              \
+      {"max_shared_memory_per_block", Integer(SharedMem)},        \


Per discussion with @masahi, we updated shared_memory_per_block to max_shared_memory_per_block to be consistent with Vulkan settings. I'm not fully convinced but would love to follow the convention as Masa suggested.

junrushao · 2022-03-06T02:13:14Z

src/meta_schedule/postproc/verify_gpu_code.cc

@@ -105,7 +105,6 @@ class VerifyGPUCodeNode : public PostprocNode {
    Target target = context->target.value();
    this->target_constraints_ = Map<String, PrimExpr>{
        {"max_shared_memory_per_block", Extract(target, "shared_memory_per_block")},
-        {"max_local_memory_per_block", Extract(target, "registers_per_block")},


The local memory restriction is removed to be consistent with AutoScheduler, as discussed with @masahi. CC: @Hzfengsy

Hzfengsy

LGTM

junrushao requested review from vinx13, tqchen, kparzysz-quic, ZihengJiang, masahi, merrymercy, jcf94, comaniac, Hzfengsy, jroesch, yzhliu, icemelon and zhiics as code owners March 6, 2022 02:11

junrushao commented Mar 6, 2022

View reviewed changes

junrushao force-pushed the feature/2022-03-05/meta-schedule-scripts-for-op branch from ab6743d to df45fe7 Compare March 6, 2022 02:20

junrushao requested a review from areusch as a code owner March 6, 2022 02:20

[MetaSchedule] Update scripts for tuning subgraphs

30af4b9

junrushao force-pushed the feature/2022-03-05/meta-schedule-scripts-for-op branch from df45fe7 to 30af4b9 Compare March 6, 2022 02:23

junrushao added 2 commits March 5, 2022 18:39

Fix mypy

5c5555b

fix

5452a16

Hzfengsy approved these changes Mar 6, 2022

View reviewed changes

junrushao merged commit 8729f6b into apache:main Mar 6, 2022

ziqiangxu8457 pushed a commit to ziqiangxu8457/tvm that referenced this pull request Mar 6, 2022

[MetaSchedule] Update scripts for subgraph tuning (apache#10501)

71bee5e

pfk-beta pushed a commit to pfk-beta/tvm that referenced this pull request Apr 11, 2022

[MetaSchedule] Update scripts for subgraph tuning (apache#10501)

284bcd7

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MetaSchedule] Update scripts for subgraph tuning #10501

[MetaSchedule] Update scripts for subgraph tuning #10501

junrushao commented Mar 6, 2022

junrushao Mar 6, 2022

junrushao Mar 6, 2022

Hzfengsy left a comment

[MetaSchedule] Update scripts for subgraph tuning #10501

[MetaSchedule] Update scripts for subgraph tuning #10501

Conversation

junrushao commented Mar 6, 2022

junrushao Mar 6, 2022

Choose a reason for hiding this comment

junrushao Mar 6, 2022

Choose a reason for hiding this comment

Hzfengsy left a comment

Choose a reason for hiding this comment