Reduction This project is used to develop a high performance multi-gpu reduction kernel with the help of cooperative group. If the array size is large enough, the overhead of synchronizaion is negligible.