Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python HDFS: add Kerberos authentication support #20719

Open
damccorm opened this issue Jun 4, 2022 · 2 comments
Open

Python HDFS: add Kerberos authentication support #20719

damccorm opened this issue Jun 4, 2022 · 2 comments

Comments

@damccorm
Copy link
Contributor

damccorm commented Jun 4, 2022

The HDFS client used by Beam supports Kerberos.

Initial idea: add a flag --hdfs_client that defaults to "INSECURE" and also accepts "KERBEROS". This flag will control initialization of self._hdfs_client.

HDFS client docs:
https://hdfscli.readthedocs.io/en/latest/api.html#module-hdfs.ext.kerberos

The HDFS client seems to use this Kerberos library:
https://pypi.org/project/requests-kerberos/

Imported from Jira BEAM-11750. Original Jira may contain additional context.
Reported by: udim.

@blazingbhavneek
Copy link

Hey there! 👋 I'm new to this repository and eager to contribute! 🌟 Could you kindly suggest some entry point or files to look into?

@damccorm
Copy link
Contributor Author

Hey, saw you added this comment several places. I'd recommend focusing on a single issue at first (I answered the underlying question here - #20298 (comment))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants