[Injective Schedule] make injective ops's opt schedule applied to every output tensor #11820

crazydemo · 2022-06-22T05:51:44Z

Modify the schedule for injective ops. All outputs of injective ops should share the same schedule optimization, e.g. outputs of OP split are all supposed to be lowered into parallel for.

new ir:
IRModule({GlobalVar(tvmgen_default_fused_split): PrimFunc([placeholder, T_split_sections, T_split_sections]) attrs={"from_legacy_te_schedule": (bool)1, "global_symbol": "tvmgen_default_fused_split", "tir.noalias": (bool)1} {
  buffer_realize T_split_sections([0, 1], [0, 64], [0, 56], [0, 56]) {
    parallel (ax0.ax1.fused, 0, 64) {
      for (ax2, 0, 56) {
        for (ax3.outer, 0, 4) {
          vectorized (ax3.inner, 0, 16) {
            if (tir.likely(((ax3.inner + (ax3.outer*16)) < 56))) {
              T_split_sections[floordiv(ax0.ax1.fused, 64), floormod(ax0.ax1.fused, 64), ax2, (ax3.inner + (ax3.outer*16))] = placeholder[floordiv(ax0.ax1.fused, 64), floormod(ax0.ax1.fused, 64), ax2, (ax3.inner + (ax3.outer*16))]
            }
          }
        }
      }
    }
    buffer_realize T_split_sections([0, 1], [0, 64], [0, 56], [0, 56]) {
      parallel (ax0.ax1.fused, 0, 64) {
        for (ax2, 0, 56) {
          for (ax3.outer, 0, 4) {
            vectorized (ax3.inner, 0, 16) {
              if (tir.likely(((ax3.inner + (ax3.outer*16)) < 56))) {
                T_split_sections[floordiv(ax0.ax1.fused, 64), floormod(ax0.ax1.fused, 64), ax2, (ax3.inner + (ax3.outer*16))] = placeholder[floordiv(ax0.ax1.fused, 64), (floormod(ax0.ax1.fused, 64) + 64), ax2, (ax3.inner + (ax3.outer*16))]
              }
            }
          }
        }
      }
    }
  }
}
})

old ir:
IRModule({GlobalVar(tvmgen_default_fused_split): PrimFunc([placeholder, T_split_sections, T_split_sections]) attrs={"from_legacy_te_schedule": (bool)1, "global_symbol": "tvmgen_default_fused_split", "tir.noalias": (bool)1} {
  buffer_realize T_split_sections([0, 1], [0, 64], [0, 56], [0, 56]) {
    parallel (ax0.ax1.fused, 0, 64) {
      for (ax2, 0, 56) {
        for (ax3.outer, 0, 4) {
          vectorized (ax3.inner, 0, 16) {
            if (tir.likely(((ax3.inner + (ax3.outer*16)) < 56))) {
              T_split_sections[floordiv(ax0.ax1.fused, 64), floormod(ax0.ax1.fused, 64), ax2, (ax3.inner + (ax3.outer*16))] = placeholder[floordiv(ax0.ax1.fused, 64), floormod(ax0.ax1.fused, 64), ax2, (ax3.inner + (ax3.outer*16))]
            }
          }
        }
      }
    }
    buffer_realize T_split_sections([0, 1], [0, 64], [0, 56], [0, 56]) {
      for (ax1, 0, 64) {
        for (ax2, 0, 56) {
          for (ax3, 0, 56) {
            T_split_sections[0, ax1, ax2, ax3] = placeholder[0, (ax1 + 64), ax2, ax3]
          }
        }
      }
    }
  }
}
})

crazydemo · 2022-06-27T01:41:29Z

@masahi Could you please help review this PR?

…che#11820)

make injective ops's opt schedule applied to every output tensor

fd26c20

masahi approved these changes Jun 27, 2022

View reviewed changes

masahi merged commit 1115fd9 into apache:main Jun 27, 2022

blackkker pushed a commit to blackkker/tvm that referenced this pull request Jul 7, 2022

make injective ops's opt schedule applied to every output tensor (apa…

d4f7044

…che#11820)

mikeseven pushed a commit to mikeseven/tvm that referenced this pull request Sep 27, 2023

make injective ops's opt schedule applied to every output tensor (apa…

ecb03f6

…che#11820)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Injective Schedule] make injective ops's opt schedule applied to every output tensor #11820

[Injective Schedule] make injective ops's opt schedule applied to every output tensor #11820

crazydemo commented Jun 22, 2022

crazydemo commented Jun 27, 2022

[Injective Schedule] make injective ops's opt schedule applied to every output tensor #11820

[Injective Schedule] make injective ops's opt schedule applied to every output tensor #11820

Conversation

crazydemo commented Jun 22, 2022

crazydemo commented Jun 27, 2022