Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

For pm-cpu, remove libfabric work-around of setting FI_CXI_RX_MATCH_MODE=software #6703

Merged

Conversation

ndkeen
Copy link
Contributor

@ndkeen ndkeen commented Oct 21, 2024

We had been using FI_CXI_RX_MATCH_MODE=software on pm-cpu to avoid some issues when Perlmutter was young.
There was no measurable performance difference, so did not change this setting.
Now let's try removing this and letting it use the default which is FI_CXI_RX_MATCH_MODE=hybrid.

[BFB]

@ndkeen ndkeen self-assigned this Oct 21, 2024
@ndkeen ndkeen added Machine Files BFB PR leaves answers BFB pm-cpu Perlmutter at NERSC (CPU-only nodes) labels Oct 21, 2024
@ndkeen ndkeen requested a review from rljacob October 21, 2024 16:12
Copy link

PR Preview Action v1.4.8
🚀 Deployed preview to https://E3SM-Project.github.io/E3SM/pr-preview/pr-6703/
on branch gh-pages at 2024-10-21 16:13 UTC

@rljacob
Copy link
Member

rljacob commented Oct 21, 2024

what kind of testing have you done?

@ndkeen ndkeen changed the title For pm-cpu, remove libfabric work-around of setting FI_CXI_RX_MATCH_MODE-software For pm-cpu, remove libfabric work-around of setting FI_CXI_RX_MATCH_MODE=software Oct 28, 2024
@ndkeen
Copy link
Contributor Author

ndkeen commented Oct 28, 2024

In general, I've done quite a bit of testing with this over the years -- with various cases looking for performance changes (there were not any). I'm sure I've run integration before and I can run again, but don't expect any surprises.

@ndkeen
Copy link
Contributor Author

ndkeen commented Oct 28, 2024

merged to next

@ndkeen
Copy link
Contributor Author

ndkeen commented Oct 29, 2024

Tests all green as expected

@ndkeen ndkeen merged commit 96ef265 into master Oct 29, 2024
9 checks passed
@ndkeen ndkeen deleted the ndk/machinefiles/nersc-remove-FI_CXI_RX_MATCH_MODE-software branch October 29, 2024 18:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFB PR leaves answers BFB Machine Files pm-cpu Perlmutter at NERSC (CPU-only nodes)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants