Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: transform during load #10173

Open
5 of 16 tasks
youngsofun opened this issue Feb 23, 2023 · 1 comment
Open
5 of 16 tasks

Feature: transform during load #10173

youngsofun opened this issue Feb 23, 2023 · 1 comment
Labels
C-feature Category: feature

Comments

@youngsofun
Copy link
Member

youngsofun commented Feb 23, 2023

Summary

todo

the following is out-of-date, we mainly rely on copy from select

  • auto cast

    • for text format, more rules is trivial.
    • for parquet, need some rules or just all built-in cast
  • copy from select (much more general)

    • copy into is easier to impl with subquery since we already support select from stage.
      • need Atomicity
    • streaming_load is harder, since it is syntax is special
  • insert to view( the view do some transform before insert to table)

@youngsofun youngsofun added the C-feature Category: feature label Feb 23, 2023
@youngsofun
Copy link
Member Author

youngsofun commented Feb 23, 2023

for streaming load:

what about add a table function stream :
Insert into table 1 from ( select * from stream(format=>’parquet’) )


but streaming_load has another issue:

schema of file cannot be inferred during parse...

while as a table to select, we always need some schema in advance.

which means for streaming_load

  1. the subquery approach can not works.
  2. the cast need todo cannot be determined during parse, while copy into can, which may lead to 2 diff impl

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-feature Category: feature
Projects
None yet
Development

No branches or pull requests

1 participant