Generic support for INSERT/PUT ... VALUES with partial column spec #1394

sumwale · 2019-07-29T22:54:41Z

Changes allow the following to be supported in addition to currently supported ones on all tables:

// simple constructs
INSERT (INTO|OVERWRITE) <table> VALUES ...
PUT INTO <table> VALUES ...

// with partial columns that can be out of order with table definition
INSERT (INTO|OVERWRITE) <table>(<col1>...) VALUES ...
INSERT (INTO|OVERWRITE) <table>(<col1>...) SELECT ...
PUT INTO <table>(<col1>...) VALUES ...
PUT INTO <table>(<col1>...) SELECT ...

Changes proposed in this pull request

add SnappyParser "inlineTable" rule to insert/put and removed "subSelectQuery" rule
that was specifically added to avoid the "inlineTable" rule
enhanced the AnalyzeMutableOperations rule to deal with the case of "partial columns"
which is parsed as a TableValueFunction; locate the columns in target table and
add appropriate Project on child
capture the query string for possible later use in "partial column" construct for
row tables where explicit non-null DEFAULT value for a column may have been specified;
for this case switch to DMLExternalTable which may fail if query contains Spark-only functions
removed custom PutIntoValuesColumnTable and use consistent way for both INSERT and PUT
handle the QuestionMark case in VALUES() in a generic way as LocalRelation resolved by Spark
removed ResolveRelationsExtended and instead made table as proper child of DMLExternalTable
so that it is resolved in the normal way by ResolveRelations

New unit tests to be added shortly.

Patch testing

precheckin

ReleaseNotes.txt changes

Document support for VALUES and partial column specification in INSERT/PUT. Limitation of
not being able to use Spark-only functions with partial column specification for ROW tables.

Other PRs

NA

- use JDBCMutableRelation.executeUpdate consistently that correctly sets current schema instead of direct JDBC calls - for dropIndex when JDBCMutableRelation has not been resolved, SnappySession execution resolves the full table name before passing to dropRowStoreIndex - update SHOW INDEXES example output - minor formatting changes in ExternalStoreUtils and jdbcExtensions

Changes allow the following to be supported in addition to currently supported ones: // simple construct INSERT (INTO|OVERWRITE) <table> VALUES ... PUT INTO <table> VALUES ... // with partial columns that can be out of order with table definition INSERT (INTO|OVERWRITE) <table>(<col1>...) VALUES ... INSERT (INTO|OVERWRITE) <table>(<col1>...) SELECT ... PUT INTO <table>(<col1>...) VALUES ... PUT INTO <table>(<col1>...) SELECT ... - add SnappyParser "inlineTable" rule to insert/put and removed "subSelectQuery" rule that was specifically added to avoid the "inlineTable" rule - enhanced the AnalyzeMutableOperations rule to deal with the case of "partial columns" which is parsed as a TableValueFunction; locate the columns in target table and add appropriate Project on child - capture the query string for possible later use in "partial column" construct for row tables where explicit non-null DEFAULT value for a column may have been specified; for this case switch to DMLExternalTable which may fail if query contains Spark-only functions - removed custom PutIntoValuesColumnTable and use consistent way for both INSERT and PUT - also handle the QuestionMark case in VALUES() in a generic way as LocalRelation resolved by Spark - removed ResolveRelationsExtended and instead made table as proper child of DMLExternalTable so that it is resolved in the normal way by ResolveRelations

dshirish

For complex datatypes, will we still continue using "insert into<> select <>" syntax?

dshirish

Looks like unit tests mentioned are to be pushed.
However the code change itself looks good to me.

sumwale · 2019-08-01T19:53:09Z

For complex datatypes, will we still continue using "insert into<> select <>" syntax?

No, ARRAY/STRUCT/MAP will work now in VALUES like they do in SELECT.

also moved prepared put statement test from QueryRoutingDUnitTest to PreparedQueryRoutingSingleNodeSuite which is invoked by both scalatest and the dunit test

sumwale added 2 commits July 30, 2019 03:58

sumwale requested review from kneeraj, vatsalmevada, dshirish and smahajan05 July 29, 2019 22:54

sumwale changed the base branch from SNAP-2885 to master July 30, 2019 17:03

Merge remote-tracking branch 'origin/master' into dmlVALUES

6cc33af

dshirish reviewed Aug 1, 2019

View reviewed changes

dshirish approved these changes Aug 1, 2019

View reviewed changes

sumwale added 8 commits August 2, 2019 01:38

Merge remote-tracking branch 'origin/master' into dmlVALUES

218318e

Merge remote-tracking branch 'origin/master' into dmlVALUES

4c5f07e

Merge remote-tracking branch 'origin/master' into dmlVALUES

b7065a8

fixing test failures

30e4971

also moved prepared put statement test from QueryRoutingDUnitTest to PreparedQueryRoutingSingleNodeSuite which is invoked by both scalatest and the dunit test

Merge remote-tracking branch 'origin/master' into dmlVALUES

4006394

Merge remote-tracking branch 'origin/master' into dmlVALUES

e9526ee

fixing unit test failures

51c7a9c

Merge remote-tracking branch 'origin/master' into dmlVALUES

7dc3adf

ashetkar force-pushed the master branch from b73485e to f740fee Compare April 20, 2021 09:04

ashetkar force-pushed the dmlVALUES branch from 7be931f to 7dc3adf Compare April 20, 2021 09:07

sumwale force-pushed the master branch from 1e636db to e1d45b2 Compare June 26, 2021 19:41

sumwale force-pushed the master branch from 8cc4798 to 5f5c15d Compare July 14, 2021 18:12

sumwale force-pushed the master branch 5 times, most recently from 8b43301 to 2b254d9 Compare October 1, 2021 09:23

sumwale force-pushed the master branch 2 times, most recently from 232b75d to a2ab483 Compare October 17, 2021 01:41

sumwale force-pushed the master branch 3 times, most recently from 2c254f0 to 0f2888f Compare October 18, 2021 17:01

sumwale force-pushed the master branch 2 times, most recently from a466d26 to ea127bd Compare April 12, 2022 10:05

sumwale force-pushed the master branch 2 times, most recently from 99ec79c to c7b84fa Compare June 12, 2022 04:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic support for INSERT/PUT ... VALUES with partial column spec #1394

Generic support for INSERT/PUT ... VALUES with partial column spec #1394

sumwale commented Jul 29, 2019 •

edited

Loading

dshirish left a comment

dshirish left a comment

sumwale commented Aug 1, 2019

Generic support for INSERT/PUT ... VALUES with partial column spec #1394

Are you sure you want to change the base?

Generic support for INSERT/PUT ... VALUES with partial column spec #1394

Conversation

sumwale commented Jul 29, 2019 • edited Loading

Changes proposed in this pull request

Patch testing

ReleaseNotes.txt changes

Other PRs

dshirish left a comment

Choose a reason for hiding this comment

dshirish left a comment

Choose a reason for hiding this comment

sumwale commented Aug 1, 2019

sumwale commented Jul 29, 2019 •

edited

Loading