-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Pandas==2.x in Apache Beam. #27221
Support Pandas==2.x in Apache Beam. #27221
Comments
Yes, Apache Beam doesn't yet support Pandas 2.x since Pandas 2.x removed |
Moving this to P2 since its a support feature rather than a bug. |
Thanks for reporting! This is a known issue, but not worked on to my knowledge. Contributions to fix the issue or further investigation/breakdown on what it would take to support Pandas 2.x are welcome. |
I started working on this and could use some opinions on the best way to proceed.
|
apache-beam has no compatibility with pandas>=2 see apache/beam#27221
.take-issue Cleaning this up. The goal is that the Pandas API stays consistent with Pandas. So I'm gating removals on Pandas version. |
Thanks a lot for stepping in to help with this effort, @caneff . |
What happened?
Beam doesn't work with pandas>=2.0.0
Reproducible Example
Issue Description
Pipeline fails with a runtime error: `AttributeError: type object 'Series' has no attribute 'append'`. Sample stacktrace:
Expected Behavior
the code must be the same result that version 1.5.3 from pandas
Installed Versions
Replace this line with the output of pd.show_versions()
version that fails Pandas 2.0.2
version that working ok Pandas 1.5.3
Response when I reported the issue on the panda's project
pandas-dev/pandas#53799 (comment)
Issue Priority
Priority: 1 (data loss / total loss of function)
Issue Components
The text was updated successfully, but these errors were encountered: