Give tests parameterized by shapes better names #406

GMNGeoffrey · 2025-01-22T20:29:50Z

I found it helpful to have more descriptive test names when working on kernels, especially as you can select by them. This just changes the tests already using the shared get_test_shapes function. I think we could expand it to others, but there are some decisions around test organization that I don't want to get into just now.

This incidentally also appears to fix a bug that was creating duplicate perf tests. I printed out get_test_shapes("chained_gemm") and got the following:

(8, 128, 128, 64, 256)
(40, 1024, 64, 64, 1024)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)

Looks like each time shapes.py was imported, the loop adding perf tests got run again.

Before:

testChainedGemm[MMAType.F32_32x32x8_F16-False-shape15]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape14]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape13]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape12]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape11]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape10]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape9]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape8]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape7]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape6]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape5]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape4]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape3]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape2]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape1]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape0]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape15]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape14]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape13]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape12]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape11]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape10]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape9]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape8]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape7]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape6]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape5]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape4]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape3]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape2]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape1]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape0]

After:

testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256]

harsh-nod

this looks nice, thank you!

Note that this adds perf tests for batched gemm.

GMNGeoffrey · 2025-01-27T18:55:44Z

Do we actually want all the shapes to be in a shared file? It allows sharing when you have multiple test files using the same shapes, but the only case of that is with attention, so I'm not sure whether it's worth the extra layer of indirection. WDYT? Also, do we always want perf tests to have the same shapes as regular tests? I've actually been testing with much smaller shapes when I've been developing, which has the advantage of being faster and usually easier to debug.

GMNGeoffrey · 2025-01-27T22:55:34Z

To get this landed with the fix to not have oodles of extra test cases, I'm going to narrow the scope and just improve the names for the tests already using get_test_shapes

This reverts commit 3d2c4b8.

This reverts commit 7bff57f.

This reverts commit 849e158.

GMNGeoffrey · 2025-01-27T22:59:37Z

Old PR description:

This is an example of what I think the change should be, but I didn't want to go through and do it for everything until checking in with others. I found it helpful to have more descriptive test names when working on kernels, especially as you can select by them.

This incidentally also appears to fix a bug that was creating duplicate perf tests. I printed out get_test_shapes("chained_gemm") and got the following:

(8, 128, 128, 64, 256)
(40, 1024, 64, 64, 1024)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)

Looks like each time shapes.py was imported, the loop adding perf tests got run again.

Before:

testChainedGemm[MMAType.F32_32x32x8_F16-False-shape15]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape14]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape13]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape12]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape11]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape10]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape9]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape8]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape7]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape6]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape5]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape4]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape3]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape2]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape1]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape0]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape15]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape14]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape13]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape12]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape11]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape10]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape9]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape8]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape7]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape6]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape5]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape4]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape3]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape2]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape1]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape0]

After:

testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256]

Before:

testGemm[MMAType.F32_32x32x8_F16-True-True-shape5]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape4]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape3]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape2]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape1]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape0]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape5]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape4]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape3]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape2]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape1]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape0]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape5]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape4]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape3]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape2]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape1]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape0]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape5]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape4]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape3]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape2]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape1]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape0]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape5]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape4]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape3]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape2]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape1]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape0]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape5]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape4]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape3]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape2]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape1]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape0]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape5]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape4]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape3]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape2]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape1]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape0]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape5]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape4]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape3]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape2]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape1]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape0]

After:

testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-4096x20480x2560-perf]

IMO, it's also a bit weird that the perf test param piggy-backs on shapes and then is separately controlled by a command line arg. Could it just be a separate parameter instead with the appropriate mark and the test can read from there rather than directly accessing the command line arg? One disadvantage to that is that in the current proposal there'd then be a "no_perf" suffix on all the non-perf tests. I couldn't figure out a way to my pytest have no id for a test param value (you can do the empty string but it still gets joined on - and your test name looks weird).

WDYT?

harsh-nod · 2025-01-28T07:03:20Z

Do we actually want all the shapes to be in a shared file? It allows sharing when you have multiple test files using the same shapes, but the only case of that is with attention, so I'm not sure whether it's worth the extra layer of indirection. WDYT? Also, do we always want perf tests to have the same shapes as regular tests? I've actually been testing with much smaller shapes when I've been developing, which has the advantage of being faster and usually easier to debug.

Yes sharing shapes makes sense. I think it's okay to have that layer of indirection. Good point - in general we want the test shapes to be smaller but representative and have perf cover the final shapes of interest. The interesting thing here is that we also have iree-kernel-benchmark for more comprehensive performance testing, so think of the performance tests here as a smaller subset that we can use to identify perf regressions. This was the original idea but I don't think it was implemented well (or at all ).

harsh-nod · 2025-01-28T07:07:45Z

Old PR description:

This is an example of what I think the change should be, but I didn't want to go through and do it for everything until checking in with others. I found it helpful to have more descriptive test names when working on kernels, especially as you can select by them.

This incidentally also appears to fix a bug that was creating duplicate perf tests. I printed out get_test_shapes("chained_gemm") and got the following:

(8, 128, 128, 64, 256)
(40, 1024, 64, 64, 1024)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)
ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None)

Looks like each time shapes.py was imported, the loop adding perf tests got run again.

Before:

testChainedGemm[MMAType.F32_32x32x8_F16-False-shape15]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape14]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape13]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape12]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape11]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape10]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape9]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape8]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape7]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape6]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape5]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape4]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape3]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape2]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape1]
testChainedGemm[MMAType.F32_32x32x8_F16-False-shape0]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape15]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape14]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape13]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape12]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape11]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape10]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape9]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape8]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape7]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape6]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape5]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape4]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape3]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape2]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape1]
testChainedGemm[MMAType.F32_16x16x16_F16-False-shape0]

After:

testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256-perf]
testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024]
testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256]

Before:

testGemm[MMAType.F32_32x32x8_F16-True-True-shape5]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape4]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape3]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape2]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape1]
testGemm[MMAType.F32_32x32x8_F16-True-True-shape0]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape5]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape4]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape3]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape2]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape1]
testGemm[MMAType.F32_32x32x8_F16-True-False-shape0]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape5]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape4]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape3]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape2]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape1]
testGemm[MMAType.F32_32x32x8_F16-False-True-shape0]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape5]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape4]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape3]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape2]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape1]
testGemm[MMAType.F32_32x32x8_F16-False-False-shape0]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape5]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape4]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape3]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape2]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape1]
testGemm[MMAType.F32_16x16x16_F16-True-True-shape0]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape5]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape4]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape3]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape2]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape1]
testGemm[MMAType.F32_16x16x16_F16-True-False-shape0]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape5]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape4]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape3]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape2]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape1]
testGemm[MMAType.F32_16x16x16_F16-False-True-shape0]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape5]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape4]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape3]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape2]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape1]
testGemm[MMAType.F32_16x16x16_F16-False-False-shape0]

After:

testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-no_dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-1024x5120x640]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_16x16x16_F16-dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-no_dyn-sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-no_sched-4096x20480x2560-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-1024x5120x640]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-2048x10240x1280]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-4096x20480x2560]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-1024x5120x640-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-2048x10240x1280-perf]
testGemm[MMAType.F32_32x32x8_F16-dyn-sched-4096x20480x2560-perf]

IMO, it's also a bit weird that the perf test param piggy-backs on shapes and then is separately controlled by a command line arg. Could it just be a separate parameter instead with the appropriate mark and the test can read from there rather than directly accessing the command line arg? One disadvantage to that is that in the current proposal there'd then be a "no_perf" suffix on all the non-perf tests. I couldn't figure out a way to my pytest have no id for a test param value (you can do the empty string but it still gets joined on - and your test name looks weird).

WDYT?

That makes sense. Please remove the command line arg and keep it as a separate parameter. The no_perf suffix is alright.

GMNGeoffrey · 2025-01-28T15:58:53Z

Ok I can continue fiddling with it. Can we land this one as is so we can get rid of all the duplicate tests though (sorry if I'd realized that problem to begin with I would have just created a PR for that).

GMNGeoffrey · 2025-01-28T16:09:28Z

Yes sharing shapes makes sense. I think it's okay to have that layer of indirection. Good point - in general we want the test shapes to be smaller but representative and have perf cover the final shapes of interest. The interesting thing here is that we also have iree-kernel-benchmark for more comprehensive performance testing, so think of the performance tests here as a smaller subset that we can use to identify perf regressions. This was the original idea but I don't think it was implemented well (or at all ).
That makes sense. Please remove the command line arg and keep it as a separate parameter. The no_perf suffix is alright.

Ok, given that we want different shapes for perf and non-perf, I think maybe what would make the most sense would be to have a parametrization that controls two parameters (shape and perf), which is a common thing in pytest, and we can have an API like @param_shapes(shapes=[(3, 16, 16, 16, 16)], perf_shapes=[(40, 512, 1024, 1024, 512)]). If we still want perf tests controlled by a command line arg, it could also be one that adjusts which params are even generated rather than generating perf tests and skipping them: https://docs.pytest.org/en/stable/example/parametrize.html#generating-parameters-combinations-depending-on-command-line

GMNGeoffrey · 2025-01-28T16:33:54Z

Can we land this one as is so we can get rid of all the duplicate tests though?

(I don't have merge permissions)

I found it helpful to have more descriptive test names when working on kernels, especially as you can select by them. This just changes the tests already using the shared `get_test_shapes` function. I think we could expand it to others, but there are some decisions around test organization that I don't want to get into just now. This incidentally also appears to fix a bug that was creating duplicate perf tests. I printed out `get_test_shapes("chained_gemm")` and got the following: ``` (8, 128, 128, 64, 256) (40, 1024, 64, 64, 1024) ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((8, 128, 128, 64, 256),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ParameterSet(values=(ParameterSet(values=(ParameterSet(values=((40, 1024, 64, 64, 1024),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None),), marks=(MarkDecorator(mark=Mark(name='perf_only', args=(), kwargs={})),), id=None) ``` Looks like each time shapes.py was imported, the loop adding perf tests got run again. Before: ``` testChainedGemm[MMAType.F32_32x32x8_F16-False-shape15] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape14] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape13] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape12] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape11] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape10] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape9] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape8] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape7] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape6] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape5] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape4] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape3] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape2] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape1] testChainedGemm[MMAType.F32_32x32x8_F16-False-shape0] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape15] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape14] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape13] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape12] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape11] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape10] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape9] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape8] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape7] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape6] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape5] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape4] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape3] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape2] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape1] testChainedGemm[MMAType.F32_16x16x16_F16-False-shape0] ``` After: ``` testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024-perf] testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256-perf] testChainedGemm[MMAType.F32_32x32x8_F16-False-40x1024x64x64x1024] testChainedGemm[MMAType.F32_32x32x8_F16-False-8x128x128x64x256] testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024-perf] testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256-perf] testChainedGemm[MMAType.F32_16x16x16_F16-False-40x1024x64x64x1024] testChainedGemm[MMAType.F32_16x16x16_F16-False-8x128x128x64x256] ``` Signed-off-by: xintin <vermagaurav9408@gmail.com>

GMNGeoffrey added 2 commits January 22, 2025 11:59

Give tests parameterized by shapes better names

87b3eee

Give tests parameterized by bools better names

849e158

GMNGeoffrey requested a review from harsh-nod January 22, 2025 20:29

harsh-nod approved these changes Jan 24, 2025

View reviewed changes

Use shared shape file for gemm test as well

7bff57f

Note that this adds perf tests for batched gemm.

Have option for always-false bools

3d2c4b8

GMNGeoffrey added 3 commits January 27, 2025 14:56

Revert "Have option for always-false bools"

de35ea6

This reverts commit 3d2c4b8.

Revert "Use shared shape file for gemm test as well"

22384f7

This reverts commit 7bff57f.

Revert "Give tests parameterized by bools better names"

540b233

This reverts commit 849e158.

GMNGeoffrey requested a review from harsh-nod January 27, 2025 22:59

pre-commit

d485732

harsh-nod merged commit 09bd1a8 into iree-org:main Jan 28, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Give tests parameterized by shapes better names #406

Give tests parameterized by shapes better names #406

GMNGeoffrey commented Jan 22, 2025 •

edited

Loading

harsh-nod left a comment

GMNGeoffrey commented Jan 27, 2025 •

edited

Loading

GMNGeoffrey commented Jan 27, 2025

GMNGeoffrey commented Jan 27, 2025

harsh-nod commented Jan 28, 2025

harsh-nod commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

Give tests parameterized by shapes better names #406

Give tests parameterized by shapes better names #406

Conversation

GMNGeoffrey commented Jan 22, 2025 • edited Loading

harsh-nod left a comment

Choose a reason for hiding this comment

GMNGeoffrey commented Jan 27, 2025 • edited Loading

GMNGeoffrey commented Jan 27, 2025

GMNGeoffrey commented Jan 27, 2025

harsh-nod commented Jan 28, 2025

harsh-nod commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

GMNGeoffrey commented Jan 28, 2025

GMNGeoffrey commented Jan 22, 2025 •

edited

Loading

GMNGeoffrey commented Jan 27, 2025 •

edited

Loading