Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(python): reuse state in to_pyarrow_dataset #2485

Merged

Conversation

ion-elgreco
Copy link
Collaborator

Description

Reuse the state so we save time instead of reconstructing and verifying the state.

@github-actions github-actions bot added the binding/python Issues for the Python package label May 7, 2024
@ion-elgreco ion-elgreco force-pushed the fix/reuse_table_state_pyarrow_dataset branch from 08a3a35 to 8617c80 Compare May 7, 2024 06:54
@ion-elgreco ion-elgreco marked this pull request as draft May 7, 2024 07:12
@ion-elgreco ion-elgreco marked this pull request as ready for review May 7, 2024 08:58
@adriangb
Copy link
Contributor

adriangb commented May 7, 2024

Yup this fixes the issue 🥳

@ion-elgreco ion-elgreco force-pushed the fix/reuse_table_state_pyarrow_dataset branch from 0b80b53 to 2695081 Compare May 7, 2024 13:59
@ion-elgreco ion-elgreco enabled auto-merge (rebase) May 7, 2024 14:06
@ion-elgreco ion-elgreco disabled auto-merge May 7, 2024 14:06
@ion-elgreco ion-elgreco enabled auto-merge (squash) May 7, 2024 14:06
@adriangb
Copy link
Contributor

adriangb commented May 7, 2024

@ion-elgreco looks like it needs a rebase. Also just curious if you'll be making a release with this fix since it's such a huge performance win for an essential feature?

@ion-elgreco ion-elgreco force-pushed the fix/reuse_table_state_pyarrow_dataset branch from bdcbfe2 to b53643e Compare May 7, 2024 14:37
@ion-elgreco
Copy link
Collaborator Author

@ion-elgreco looks like it needs a rebase. Also just curious if you'll be making a release with this fix since it's such a huge performance win for an essential feature?

@wjones127 could you take a look at the PR?

python/tests/test_fs.py Outdated Show resolved Hide resolved
python/src/filesystem.rs Outdated Show resolved Hide resolved
@ion-elgreco ion-elgreco force-pushed the fix/reuse_table_state_pyarrow_dataset branch from f5294a9 to a5071b0 Compare May 10, 2024 06:40
@ion-elgreco ion-elgreco force-pushed the fix/reuse_table_state_pyarrow_dataset branch from abe79d9 to 6cbcf01 Compare May 10, 2024 19:02
Copy link
Collaborator

@wjones127 wjones127 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks better now. Thanks.

@ion-elgreco ion-elgreco merged commit c734926 into delta-io:main May 10, 2024
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
binding/python Issues for the Python package
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants