-
Notifications
You must be signed in to change notification settings - Fork 849
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Always use vectorization when the numerical scheme supports it #1752
Merged
Merged
Changes from 1 commit
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
6c53381
use the non physical counter in SIMD numerics
pcarruscag 2edca53
automatically use vectorized Roe when possible
pcarruscag 5f0e319
fix build and some doxygen cleanup
pcarruscag c841509
proper fix
pcarruscag 3f6af89
this is what "using namespace std" does
pcarruscag 004b67b
fix leak
pcarruscag b8515c7
MG segfault
pcarruscag ef4de7d
Merge remote-tracking branch 'upstream/develop' into vectorize_when_p…
pcarruscag 41aa1d8
prevent nans with wall functions
pcarruscag ac58780
update hybrid regression
pcarruscag c808e8d
update parallel regression
pcarruscag 1166497
serial and v&v tests
pcarruscag 6a680ec
work around some forward mode issues
pcarruscag c0a8d7c
Merge remote-tracking branch 'upstream/develop' into vectorize_when_p…
pcarruscag 4357145
regression updates and SIMD size 1 for reverse AD
pcarruscag 37b4f58
updates after simd size of 1 for reverse AD
pcarruscag 87f451e
Update SU2_CFD/include/numerics_simd/flow/convection/common.hpp
pcarruscag 92fdc3d
Merge branch 'develop' into vectorize_when_possible
pcarruscag 750def1
CoDiPack update.
jblueh 59b5041
fix min/max problems? remove USE_VECTORIZATION from testcases
pcarruscag 0b5991a
Merge remote-tracking branch 'upstream/vectorize_when_possible' into …
pcarruscag fedec9e
Revert "CoDiPack update."
pcarruscag f963ffd
convert the result of comparissons to passive type
pcarruscag c9af050
Revert "Revert "CoDiPack update.""
pcarruscag File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What happened exactly?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The result of the one line version was 1e-310, but I could not reproduce in a unit test, and also didnt get anything from valgrind. So i'm not sure, might have something to do with the expression types interfeering with each other?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of 0 or 1.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I managed to reproduce it, I'll look into it a little further.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome thanks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
std
is not the point, just that writingrequires the
cmath
version offmax
to be accessible like this, too. https://en.cppreference.com/w/cpp/numeric/math/fmax looks to me as if they are instd
, though?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
True, the suggested solution with
passivedouble
andfmin
/fmax
is specific to thesu2double
instantiation of the vector expressions. But maybe we could introduce some small traits that choose what I suggested forsu2double
and default to the previous solution for everything else?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Usually fmin and fmax work without namespace specification.
We can use fmin/max as the implementation for now and leave a note that they will not be efficient for integers.
I dont think we use that at the moment, Ill double check when I have some time and make the changes, thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Argument-dependent lookup should select the CoDiPack overloads automatically. Alright.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the fix covers everything, active to passive, and suitable overloads.
Thank you for getting to the root cause.