Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Common utility code for dumping to a distributed filesystem #10892

Open
jlowe opened this issue May 24, 2024 · 0 comments
Open

Common utility code for dumping to a distributed filesystem #10892

jlowe opened this issue May 24, 2024 · 0 comments
Labels
feature request New feature or request

Comments

@jlowe
Copy link
Contributor

jlowe commented May 24, 2024

Is your feature request related to a problem? Please describe.
There are a number of places in the code where data can be side-band written to a distributed filesystem, e.g.: file scan input dumps, GPU core dumps, profiling data, etc. A lot of this code has to solve the problem of how to get the proper Hadoop configuration and credentials on the executor to access the filesystem to create the new file, and that code is often replicated across subsystems.

Describe the solution you'd like
There should be a common API in the executor plugin that subsystems on the executor can use to create a file on the distributed filesystem. The API should allow specifying a specific filename or generating a temporary name based on a filename template.

@jlowe jlowe added feature request New feature or request ? - Needs Triage Need team to review and classify labels May 24, 2024
@mattahrens mattahrens removed the ? - Needs Triage Need team to review and classify label May 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants