-
Notifications
You must be signed in to change notification settings - Fork 448
Support reduce and scan for more than 2^31 items #584
Comments
I'd be happy to work on a patch for this if the CUB developers think the proposed change would be acceptable. |
Hello, @milthorpe! Thank you for offering help! |
Does anyone have plans on tackling this change? |
@wmaxey unless @milthorpe would like to contribute 64-bit support for scan, we'll create a separate issue for that. |
I don't actually have a current use-case for scan (only reduce, which @senior-zero kindly fixed already), which is why I've been so slow to respond - sorry! I think it is better to create a separate issue. |
@milthorpe I'm closing this issue then |
cub::DeviceReduce
andcub::DeviceScan
specify the input size asint num_items
, which limits reductions/scans to 2^31-1 items. Other CUB functions have the type of the number of items as a template parameter e.g.cub::DeviceRadixSort
acceptsNumItemsT num_items
.The
num_items
parameter should be replaced with a templated parameterNumItemsT num_items
for bothcub::DeviceReduce
andcub::DeviceScan
, to allow reductions and scans of more than 2^31 items.A simple test code for
DeviceReduce
withsize_t num_items
would be as follows:The text was updated successfully, but these errors were encountered: