Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove call to cudaDeviceSetCacheConfig #1536

Merged
merged 1 commit into from
Nov 12, 2020

Conversation

WeiqunZhang
Copy link
Member

Summary

No longer set CUDA device cache configuration to prefer L1 cache. This does
not appear to affect any kernels in a negative way. With more shared memory
available, reduction functions are faster in some cases.

Additional background

Checklist

The proposed changes:

  • fix a bug or incorrect behavior in AMReX
  • add new capabilities to AMReX
  • changes answers in the test suite to more than roundoff level
  • are likely to significantly affect the results of downstream AMReX users
  • are described in the proposed changes to the AMReX documentation, if appropriate

No longer set CUDA device cache configuration to prefer L1 cache.  This does
not appear to affect any kernels in a negative way.  With more shared memory
available, reduction functions are faster in some cases.
@WeiqunZhang WeiqunZhang requested a review from atmyers November 12, 2020 16:23
@WeiqunZhang WeiqunZhang merged commit cdb236e into AMReX-Codes:development Nov 12, 2020
@WeiqunZhang WeiqunZhang deleted the rm_setcacheconfig branch November 12, 2020 16:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants