Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IO Error: AzureBlobStorageFileSystem open file error reading Iceberg written by Spark and Nessie catalog #71

Open
krsma33 opened this issue Aug 13, 2024 · 2 comments

Comments

@krsma33
Copy link

krsma33 commented Aug 13, 2024

Azure file connection format parsing issue.

I have a following usecase. Writing iceberg format using Spark and Nessie using iceberg rest protocol. Nessie supports azure file access format with @ notation (e.g. abfss://warehouse@<storage_account>.dfs.core.windows.net, and hence all paths in metadata.json files are in that format.

DuckDb on the other hand supports path notation (e.g. abfss://<storage_account>.dfs.core.windows.net/warehouse) and I think because of it my query fails with following error IO Error: AzureBlobStorageFileSystem open file 'abfss://warehouse@<storage_account>.dfs.core.windows.net/some-path/some-file.avro'

I think the problem is that DuckDb doesn't handle @ notation for azure schema.

Please let me know if there is a workaround.

@gdubya
Copy link

gdubya commented Aug 14, 2024

I wonder if this delta issue is also related? duckdb/duckdb_delta#71

@gdubya
Copy link

gdubya commented Sep 11, 2024

@krsma33 can you test if this issue is fixed in 1.1.0?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants