-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Python][FS][Azure] Minimal python bindings for AzureFilesystem
#39968
Comments
AzureFilesystem
AzureFilesystem
take |
I want to enable the python side tests for Azure but there are a couple of failures.
I'm hoping its ok to skip these for Azure initially. I can create Github issues to get these ironed out. |
UPDATE: I ended up not implementing |
…thub.com:Tom-Newton/arrow into tomnewton/minimal_python_bindings/apacheGH-39968
…ystem` (#40021) ### Rationale for this change We want to use the new `AzureFileSystem` in `pyarrow`. ### What changes are included in this PR? - Add minimal python bindings for `AzureFileSystem`. This includes just enough to run the python tests against azurite plus default credential auth to enable real use of this once this PR merges. - Adding additional configuration options and remaining authentication options can be done as a follow up. - I tried to copy the existing pybinds for GCS and S3 - Explicitly set `ARROW_AZURE=OFF` rather than relying on defaults. The defaults are different for builds vs tests so this was causing tests to be enabled while Azure was disabled during the build. ### Are these changes tested? Enabled the the python filesystem tests for the new filesystem. I had to skip azure in a couple of the tests though because they are not yet working on the C++ side. I created Github issues to resolve these #40025 and #40026 and added TODO comments where relevant, that reference these Github issues. ### Are there any user-facing changes? `pyarrow` users can now use the native `AzureFileSystem` to get much better reliability and performance compared to `adlfs` based options. * Closes: #39968 * GitHub Issue: #39968 Lead-authored-by: Thomas Newton <thomas.w.newton@gmail.com> Co-authored-by: Sutou Kouhei <kou@cozmixng.org> Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com> Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Describe the enhancement requested
Child issue of: #39317
Blocked by: #39352
Create minimal pybindings such that the Azure filesystem is useable. Probably this will have no configuration options exposed and will always just use default credential auth.
I think it makes sense to circle back later to fill out all the configuration options that exist and may be added to
AzureFilesystem
Component(s)
Python
The text was updated successfully, but these errors were encountered: