-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
shared-cache - enable shared-cache support in SCR #481
Comments
SCR_CACHE_BASE
to make the name node unique
@adammoody, I think that this change is required in order for SCR to support a shared cache. Do you agree? If so, should a shared cache be a mode that SCR is configured in? Or, should we simply change the naming scheme in general so that it works in both a shared and non-shared cache? |
To start with, let's only claim to support I don't know whether we can easily enforce that one only uses Instead, we can document that it is on the user to mark any shared storage as
|
SCR can direct the application to write dataset files to subdirectories within a cache directory. SCR also stores its redundancy data in these subdirectories.
Question: Should it be considered an error to configure redunancy schemes when cache is shared?
To construct the full path of a cache directory, SCR incorporates a cache base directory name (
SCR_CACHE_BASE
) with the user name and the allocation id associated with the resource allocation.The cache directory name is currently derived from the concatenation of the cache base directory (
SCR_CACHE_BASE
), the user name running the application, and the job scheduler resource allocation id. This presents a name collision problem when the cache is on a shared file system.This ticket proposes that the cache directory name should also have the MPI rank numbed appended to the name above.
Question: Should we just append this in general? Or only when the cache is on a shared file system, which begs the question of how SCR can determine when/if the file system is shared. My vote is to simply append the rank number as a general rule after the session id.
The text was updated successfully, but these errors were encountered: