Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add hive.metastore.glue.skip-archive config option #23817

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

nineinchnick
Copy link
Member

Description

When set to true, updating tables in Glue does not create an archived
version of the table. Glue tables are updated during INSERT operations,
so this option can be used to avoid reaching the limit of table
versions, when executing many such operations.

In the second commit, I also added it to the legacy v1 Glue Metastore. I don't think it's needed there, it might help when migrating Iceberg from v1 Glue to the latest one, at least to avoid binding issues in Guice modules.

Additional context and related issues

This is modeled after #14336

Release notes

( ) This is not user-visible or is docs only, and no release notes are required.
(x) Release notes are required. Please propose a release note for me.
( ) Release notes are required, with the following suggested text:

## Section
* Fix some things. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label Oct 17, 2024
@github-actions github-actions bot added docs iceberg Iceberg connector hive Hive connector labels Oct 17, 2024
@ebyhr
Copy link
Member

ebyhr commented Oct 17, 2024

/test-with-secrets sha=5d538527d13412ac48610f78f48d692a49480102

Copy link

The CI workflow run with tests that require additional secrets has been started: https://github.com/trinodb/trino/actions/runs/11384047749

Copy link
Member

@anusudarsan anusudarsan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

few minor comments/question otherwise lgtm

ebyhr

This comment was marked as outdated.

When set to true, updating tables in Glue does not create an archived
version of the table. Glue tables are updated during INSERT operations,
so this option can be used to avoid reaching the limit of table
versions, when executing a large number of such operations.
@ebyhr
Copy link
Member

ebyhr commented Oct 25, 2024

/test-with-secrets sha=3c3d1dd43e6e3cbd87550ce038b5a288b627ac6f

Copy link

The CI workflow run with tests that require additional secrets has been started: https://github.com/trinodb/trino/actions/runs/11515725284

@nineinchnick
Copy link
Member Author

All green!

return skipArchive;
}

@Config("iceberg.glue.skip-archive")
Copy link
Contributor

@findinpath findinpath Oct 25, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove also from the metastores.md if the decision is to remove the property.
Are we talking about a "breaking change" if we remove the property?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla-signed docs hive Hive connector iceberg Iceberg connector
Development

Successfully merging this pull request may close these issues.

5 participants