[FEATURE] Projection Pushdown (Parquet) #196

joocer · 2022-06-14T20:30:36Z

Push down the projection to the read step.

This should improve performance by handling less data in the processing steps.

joocer · 2022-06-14T22:58:01Z

only implement for the external readers (blob, nosql and sql readers)

joocer · 2022-06-18T16:40:27Z

This is probably going to be harder than just collecting all the tokens which are labelled as identifiers, especially when there's joins or sub queries, or aliases.

joocer · 2022-06-20T23:18:17Z

Implement hint NO_PUSH_PROJECTION at the same time

joocer · 2022-08-19T18:28:04Z

Do this as a first page to gather all the fields, then intersect with the selected fields and use that on future page reads.

This will mean no benefit on small datasets (single page), but that's not what would benefit from this anyway.

A '*' in the field list should disable the optimization.

This should be reflected in EXPLAIN.

Note that NATURAL JOIN should add a '*' to the field list when implemented.

This should NOT be the same approach taken for other data types.

joocer · 2022-08-19T21:33:11Z

This may conflict with the schema evolution feature, what happens if we select a column that doesn't exist. Maybe we need to wrap in a try and do more expensive work if it fails.

joocer · 2022-08-20T09:51:51Z

Can the field list be converted to a set earlier and once

Can we use the schema to update the field list set

FEATURE/#196 - Initial Projection Pushdown (Parquet only)

joocer added Performance 🏃‍♀️ Improve performance High Priority 1️⃣ labels Jun 14, 2022

joocer self-assigned this Jun 14, 2022

joocer removed the High Priority 1️⃣ label Jun 22, 2022

joocer removed their assignment Jun 25, 2022

joocer added a commit that referenced this issue Aug 13, 2022

FEATURE/#196

4732488

joocer added a commit that referenced this issue Aug 13, 2022

FEATURE/#196

bd4cf2d

joocer added the Next Release Planned for next release label Aug 15, 2022

joocer added a commit that referenced this issue Aug 16, 2022

FEATURE/#196

bfec587

joocer changed the title ~~[FEATURE] Projection Pushdown~~ [FEATURE] Projection Pushdown (Parquet) Aug 19, 2022

joocer self-assigned this Aug 19, 2022

joocer mentioned this issue Aug 19, 2022

✨ Projection Pushdown #360

Closed

joocer added a commit that referenced this issue Aug 20, 2022

FEATURE/#196

106b7ac

joocer added the Awaiting Closure Fixed - waiting for merging/releasing label Aug 20, 2022

joocer added a commit that referenced this issue Aug 20, 2022

FEATURE/#196

d843281

joocer closed this as completed Aug 20, 2022

joocer added a commit that referenced this issue Aug 20, 2022

Merge pull request #362 from mabel-dev/FEATURE/#196

c34a13a

FEATURE/#196 - Initial Projection Pushdown (Parquet only)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Projection Pushdown (Parquet) #196

[FEATURE] Projection Pushdown (Parquet) #196

joocer commented Jun 14, 2022

joocer commented Jun 14, 2022

joocer commented Jun 18, 2022

joocer commented Jun 20, 2022

joocer commented Aug 19, 2022 •

edited

Loading

joocer commented Aug 19, 2022 •

edited

Loading

joocer commented Aug 20, 2022

[FEATURE] Projection Pushdown (Parquet) #196

[FEATURE] Projection Pushdown (Parquet) #196

Comments

joocer commented Jun 14, 2022

joocer commented Jun 14, 2022

joocer commented Jun 18, 2022

joocer commented Jun 20, 2022

joocer commented Aug 19, 2022 • edited Loading

joocer commented Aug 19, 2022 • edited Loading

joocer commented Aug 20, 2022

joocer commented Aug 19, 2022 •

edited

Loading

joocer commented Aug 19, 2022 •

edited

Loading