Increased performance of the discrete adjoint solver by using Templates for Linear Solvers #653

pcarruscag · 2019-02-16T00:31:07Z

Proposed Changes

This is the continuation to #650, so please compare these changes to the ones therein.

The linear systems solved in the discrete adjoint are now in passivedouble which speeds the discrete adjoint by a factor of about 1.5 (less for low CFL, more for stiff Jacobians like those in structural FEM). This is possible because we only carry the derivatives of the residual.

I tried to keep the design simple. CSysSolve and all related classes (vector, matrix, preconditioner, and matrix-vector product) have a single template parameter, the data type (which can be passivedouble or su2double). There are no provisions for "mixed" arithmetic, except in CSysSolve where through ::Solve (and only through ::Solve) one can ask for the solution of a system with passive Jacobian and active RHS and solution (this is possible at the expense of temporaries that are allocated only once). Passive-Passive and Active-Active work without temporaries and Active-Passive is not supported as it does not make sense (see end of previous paragraph).

This is to keep the need for template specialization to a minimum. Wherever mixing types was necessary small helper methods were defined to provide the compatibility instead of specializing larger methods, I think this keeps the code readable.

The place where passive and active types mix the most is CSysMatrix. This happens because the blocks are prepared by the numerics in **su2double and are then "Set", "Add", or "Subtract", on a CSysMatrix. The solution was to inline those routines and template them also for the data type of the block (or diagonal).

I only tested on one fluid adjoint and one FSI adjoint case (fingers cross not to fail too many tests).

Related Work

#594 Does not help with memory much but helps with speed.
#648 Makes it easier to interface with an external solver and still work with the discrete adjoint.
#650 Builds on top of what is proposed there.
#543 These MKL optimizations will now be possible for the discrete adjoint but I have not made them available yet.
Branch feature_template_linear_solver

PR Checklist

I am submitting my contribution to the develop branch.
My contribution generates no new compiler warnings (try with the '-Wall -Wextra -Wno-unused-parameter -Wno-empty-body' compiler flags).
My contribution is commented and consistent with SU2 style.
I have added a test case that demonstrates my contribution, if necessary.

…of CJacobiTransposedPreconditioner (not implemented)

…or su2double

…_solvers

talbring · 2019-02-19T09:28:47Z

@pcarruscag Thanks for taking a lead on that and picking that up! I wanted to do that for a very long time, but never had enough time 👍 I will have a look into it soon.

…_solvers

rsanfer · 2019-04-16T06:59:57Z

This has been sitting here for way too long; my apologies, @pcarruscag . Thanks for taking the lead and making the effort. The changes look good to me, but I'm no expert in templating myself, so I'd rather have somebody else with more experience in the topic approving it. Any comments, @economon @talbring ? However, I think this should go in soon, so if there are no further comments I'll be merging in the changes by next week.

economon

LGTM. Nice work @pcarruscag - looks to be a clean and logical introduction of templating for the linear solvers. Code is still pretty easy to follow in the end.

I have made a few comments in the review about places where we will need to sync up this and PR #652.

Also, it may be time to consider some additional regression tests for the linear solvers in the different modes. Perhaps even some unit tests at some point. I do not expect that we have full coverage yet of all permutations of the linear solvers, preconditioners, smoothers, etc in both base an AD mode.

I will leave it to @talbring to comment if there are any more considerations relative to the initial work for templates in these classes.

Common/src/linear_solvers_structure.cpp

Common/src/matrix_structure.cpp

pcarruscag · 2019-04-18T09:56:26Z

Thanks guys.
@economon, regarding the comments, I am happy to remove them if that is ok with who put them there.
Regarding the conflicts I noticed them when first reading through 652, the first two you mention are easy to solve, the one about moving the comms will probably require templating the associated method, I can help with the merge.
Cheers,
Pedro

economon · 2019-04-19T23:43:09Z

Looks like the commented code was added here (b5db893) but never activated. The Matrix* routines are not being used anywhere at the moment. Do you see some value in testing it out? Otherwise, might be best to remove so we aren't carrying around dead code.

pcarruscag · 2019-04-20T11:54:43Z

I see, I do not know what is the quickest way to invert a 5x5 matrix, most robust would probably be LU with pivoting (for which we have the code in the RBF class). Since that relates to how we handle small dense matrices I would say it relates to #643 so it would be good if the community got to a conclusion there.
In any case I want this PR to be only about templating, I can do that kind of cleanup when I:

Try to activave the MKL optimizations for the discrete adjoint.
Move the row/col elimination tasks from the structural solver and mesh deformation to CSysMatrix (as you suggested in Fix given displacement BC's of FEA solver and CElasticityMovement #658).

economon · 2019-04-23T18:42:15Z

Sounds good to me. Then, I suggest we leave the comments for now, and you can come back to it when considering #643 further (or when some performance issues are considered) in a later PR.

pcarruscag

@economon I fixed the conflicts without templating CGeometry, the MPI buffers stay in su2double and the translation is done by CSysMatrix in InitiateComms and CompleteComms. Please see if everything still looks ok, also see my comment below.
Thanks,
Pedro

Common/src/linear_solvers_structure.cpp

…pe instead of CSysMatrix template parameter

economon · 2019-04-30T18:20:36Z

W.r.t. the translation in IntiateComms() and CompleteComms(): appears to be a pretty straightforward typecasting to take care of the templating. LGTM.

pcarruscag · 2019-05-01T13:01:42Z

Thank you, I am putting this to rest now then.

pcarruscag added 12 commits February 14, 2019 19:34

mpi mechanism to select appropriate wrapper based on type

9c68d6f

templated CSysVector

f71f35d

templated CSysMatrix, does not compile in AD

5430012

CSysMatrix compiles for AD

265bc7c

CSysSolve templated and compiles

131b722

CSysSolve_b templated and compiles

cd73010

Explained the templated types of CSysSolve and removed instanciation …

f46cf8a

…of CJacobiTransposedPreconditioner (not implemented)

SU2_CFD and _AD compile and run with templated classes instantiated f…

6b8460e

…or su2double

ElasticityMovement uses passive matrix and lin solver in AD

0a96888

CSolver uses passive matrix and lin solver in AD

ee740fd

missing "inline" keyword in some places

7ee8a72

Merge branch 'feature_refactor_lin_solvers' into feature_template_lin…

6fcaa35

…_solvers

pcarruscag and others added 2 commits February 19, 2019 12:09

Merge branch 'feature_refactor_lin_solvers' into feature_template_lin…

d601068

…_solvers

Merge branch 'develop' into feature_template_lin_solvers

c042cc0

talbring mentioned this pull request Mar 15, 2019

Update of CoDiPack and MeDiPack versions #660

Merged

4 tasks

pcarruscag and others added 2 commits March 20, 2019 10:52

Merge branch 'develop' into feature_template_lin_solvers

541cad9

Merge branch 'develop' into feature_template_lin_solvers

38fced7

pcarruscag mentioned this pull request Apr 4, 2019

Revert "Update of codi and medi" #667

Closed

4 tasks

pcarruscag added 2 commits April 4, 2019 22:16

resolve conflicts with develop

34b9b91

fix compiler warning

a6cd75f

rsanfer requested review from economon and talbring and removed request for economon April 16, 2019 06:59

economon approved these changes Apr 17, 2019

View reviewed changes

economon mentioned this pull request Apr 24, 2019

MPI Point-to-Point Refactoring + New Periodic BC Implementation #652

Merged

4 tasks

Merge branch 'develop' into feature_template_lin_solvers

6573bce

pcarruscag commented Apr 29, 2019

View reviewed changes

Common/src/linear_solvers_structure.cpp Outdated Show resolved Hide resolved

pcarruscag added 3 commits April 29, 2019 17:23

allow compilation of BASE without c++11 flag

4919075

type cast mechanism of CSysMatrix changed to depend on destination ty…

9f94644

…pe instead of CSysMatrix template parameter

forgot to handle directdiff in previous commit

fee8f14

add FULL_COMMS check to FGMRES

dfce6e3

pcarruscag merged commit 28e634a into su2code:develop May 1, 2019

pcarruscag mentioned this pull request Jun 5, 2019

CSysMatrix cleanup and performance improvements #700

Merged

4 tasks

talbring added the changelog:chore label Nov 7, 2019

talbring changed the title ~~Templated Linear Solvers~~ Templated Linear Solvers to increase performance of discrete adjoint solver Nov 8, 2019

pr-triage bot added the PR: merged label Nov 8, 2019

talbring changed the title ~~Templated Linear Solvers to increase performance of discrete adjoint solver~~ Increase performance of discrete adjoint solver by using Templates for Linear Solvers Nov 8, 2019

talbring changed the title ~~Increase performance of discrete adjoint solver by using Templates for Linear Solvers~~ Increased performance of the discrete adjoint solver by using Templates for Linear Solvers Nov 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increased performance of the discrete adjoint solver by using Templates for Linear Solvers #653

Increased performance of the discrete adjoint solver by using Templates for Linear Solvers #653

pcarruscag commented Feb 16, 2019

talbring commented Feb 19, 2019

rsanfer commented Apr 16, 2019

economon left a comment

pcarruscag commented Apr 18, 2019

economon commented Apr 19, 2019

pcarruscag commented Apr 20, 2019

economon commented Apr 23, 2019

pcarruscag left a comment

economon commented Apr 30, 2019

pcarruscag commented May 1, 2019

Increased performance of the discrete adjoint solver by using Templates for Linear Solvers #653

Increased performance of the discrete adjoint solver by using Templates for Linear Solvers #653

Conversation

pcarruscag commented Feb 16, 2019

Proposed Changes

Related Work

PR Checklist

talbring commented Feb 19, 2019

rsanfer commented Apr 16, 2019

economon left a comment

Choose a reason for hiding this comment

pcarruscag commented Apr 18, 2019

economon commented Apr 19, 2019

pcarruscag commented Apr 20, 2019

economon commented Apr 23, 2019

pcarruscag left a comment

Choose a reason for hiding this comment

economon commented Apr 30, 2019

pcarruscag commented May 1, 2019