Matrix Inverse CUDA generation - no kernels loaded #39

leratojeffrey · 2014-06-11T08:28:21Z

Hi,
I am trying to generate a CUDA code for DenseMatrix.inv statement in OptiML and I realized that in the DenseMatrixOps the DenseMatrixInverse case class inherits/extends a DeliteOpSingleWithManifest, which according to my experience will allow emitting only a sequential/Scala code as it is not a parallel op. I maybe wrong about this as I am still getting to understand some of these things.

If I am right about this, do you think it's possible to try implementing DenseMatrixInverse using DeliteOpIndexedLoop or DeliteOpForEach with the concept of Guass-Jordan elimination algorithm. I have tried this with pure CUDA and it seems to work fine although I have not compared any speed-ups with the sequential pure C version. My plan was to try it first but my advisors suggested I find out first before any attempt. Please advice.

Here is the code I tried on OptiML and my new DSL (OptiSDR), which currently adotpts/inherits most functionality from OptiLA.

        val m1 = DenseMatrix.rand(10000,4250)
        val invm1 = m1.inv

The text was updated successfully, but these errors were encountered:

hyouklee · 2014-06-12T19:21:32Z

Hi Lerato,

You're right that the current implementation of the matrix inverse is sequential, and therefore CUDA kernel will not be generated.
I think it's worth trying to implement using parallel ops as you mentioned if it's not too complicated.
One thing to note is that there are existing CUDA libraries you can use to calculate the matrix inverse, and I'm not sure if using the Delite parallel ops would perform poorly compared to those implementations.

leratojeffrey · 2014-06-13T13:40:05Z

Thanks Lee, I will try it using Delite parallel ops and let you Guys know soon what I came up with.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matrix Inverse CUDA generation - no kernels loaded #39

Matrix Inverse CUDA generation - no kernels loaded #39

leratojeffrey commented Jun 11, 2014

hyouklee commented Jun 12, 2014

leratojeffrey commented Jun 13, 2014

Matrix Inverse CUDA generation - no kernels loaded #39

Matrix Inverse CUDA generation - no kernels loaded #39

Comments

leratojeffrey commented Jun 11, 2014

hyouklee commented Jun 12, 2014

leratojeffrey commented Jun 13, 2014