You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The waves_per_block argument to tkw.HardwareConstraint currently has to be of length 3. Wave supports logical workgroup distribution along more than 3 dimensions and then figures out the mapping later. It should do the same with waves_per_block. Related to this, there are several sharp edges with the current setup:
If you have a workgroup dimension with block size other than 1 at index 3 or greater, the Wave compiler tries to look up waves_per_block at that index and fails with an IndexError
If you try to make waves_per_block longer than 3, the program crashes (looks like a segfault on the C side or something).
You have to specify waves_per_block for dimensions that you're not distributing along.
The
waves_per_block
argument totkw.HardwareConstraint
currently has to be of length 3. Wave supports logical workgroup distribution along more than 3 dimensions and then figures out the mapping later. It should do the same withwaves_per_block
. Related to this, there are several sharp edges with the current setup:waves_per_block
at that index and fails with an IndexErrorwaves_per_block
longer than 3, the program crashes (looks like a segfault on the C side or something).waves_per_block
for dimensions that you're not distributing along.FYI @harsh-nod
The text was updated successfully, but these errors were encountered: