We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
StatisticsConverter
Thanks to #10453, we now have the very nice StatisticsConverter API (code link) that extracts statistics from parquet row groups and is well tested.
However, there is a different code path that extracts and summarizes statistics in ListingTable used for pruning files
ListingTable
datafusion/datafusion/core/src/datasource/file_format/parquet.rs
Lines 310 to 373 in 9ab597b
In addition to being more code, this also seems like it doesn't properly convert the data types
Which is used
Lines 511 to 535 in ea21b08
I would like to update the code to use StatisticsConverter and rather than converting row/column at a time
No response
The text was updated successfully, but these errors were encountered:
I did not notice there was a PR already. Feel free to close mine.
Sorry, something went wrong.
alamb
Successfully merging a pull request may close this issue.
Is your feature request related to a problem or challenge?
Thanks to #10453, we now have the very nice
StatisticsConverter
API (code link) that extracts statistics from parquet row groups and is well tested.However, there is a different code path that extracts and summarizes statistics in
ListingTable
used for pruning filesdatafusion/datafusion/core/src/datasource/file_format/parquet.rs
Lines 310 to 373 in 9ab597b
In addition to being more code, this also seems like it doesn't properly convert the data types
Which is used
datafusion/datafusion/core/src/datasource/file_format/parquet.rs
Lines 511 to 535 in ea21b08
Describe the solution you'd like
I would like to update the code to use
StatisticsConverter
and rather than converting row/column at a timeDescribe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: