Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added column is_partitioned to ucx.tables and dashboard #959

Closed
wants to merge 41 commits into from
Closed

Added column is_partitioned to ucx.tables and dashboard #959

wants to merge 41 commits into from

Conversation

dleiva04
Copy link
Contributor

@dleiva04 dleiva04 commented Feb 19, 2024

Changes

Linked issues

Resolves #871

Related PR:

Functionality

  • added relevant user documentation
  • added new CLI command
  • modified existing command: databricks labs ucx ...
  • added a new workflow
  • modified existing workflow: ...
  • added a new table
  • modified existing table: ucx.tables

Tests

  • manually tested
  • added unit tests
  • added integration tests
  • verified on staging environment (screenshot attached)

image
image

@dleiva04 dleiva04 requested review from a team and dipankarkush-db February 19, 2024 21:00
@CLAassistant
Copy link

CLAassistant commented Feb 19, 2024

CLA assistant check
All committers have signed the CLA.

Copy link
Contributor

@nfx nfx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add integration test and more comments.

How this partitioning information should affect the migration process?

Copy link

codecov bot commented Feb 20, 2024

Codecov Report

Attention: Patch coverage is 75.00000% with 5 lines in your changes are missing coverage. Please review.

Project coverage is 88.56%. Comparing base (41309a2) to head (732c119).

❗ Current head 732c119 differs from pull request most recent head fd529a1. Consider uploading reports for the commit fd529a1 to get more accurate results

Files Patch % Lines
src/databricks/labs/ucx/install.py 73.68% 1 Missing and 4 partials ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #959      +/-   ##
==========================================
+ Coverage   88.45%   88.56%   +0.10%     
==========================================
  Files          47       47              
  Lines        6211     6103     -108     
  Branches     1116     1094      -22     
==========================================
- Hits         5494     5405      -89     
+ Misses        477      463      -14     
+ Partials      240      235       -5     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@nfx nfx removed the request for review from dipankarkush-db February 20, 2024 08:14
@nfx nfx added pr/breaking-change this change does require data or configuration migration from previous versions step/assessment go/uc/upgrade - Assessment Step labels Mar 4, 2024
Copy link

gitguardian bot commented Mar 4, 2024

️✅ There are no secrets present in this pull request anymore.

If these secrets were true positive and are still valid, we highly recommend you to revoke them.
Once a secret has been leaked into a git repository, you should consider it compromised, even if it was deleted immediately.
Find here more information about risks.


🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.

Our GitHub checks need improvements? Share your feedbacks!

Copy link
Contributor

@nfx nfx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a breaking change, add the upgrade script

tests/integration/test_installation.py Outdated Show resolved Hide resolved
@dleiva04 dleiva04 requested a review from nfx March 6, 2024 00:55
src/databricks/labs/ucx/install.py Show resolved Hide resolved
src/databricks/labs/ucx/mixins/sql.py Show resolved Hide resolved
src/databricks/labs/ucx/mixins/sql.py Outdated Show resolved Hide resolved
src/databricks/labs/ucx/mixins/sql.py Show resolved Hide resolved
Copy link
Contributor

@nfx nfx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove security holes

src/databricks/labs/ucx/mixins/sql.py Show resolved Hide resolved
@dleiva04 dleiva04 requested a review from nfx March 6, 2024 20:38
Copy link
Contributor

@nfx nfx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove irrelevant changes

@@ -476,20 +476,28 @@ def run_workflow(self, step: str):
except OperationFailed as err:
# currently we don't have any good message from API, so we have to work around it.
job_run = self._ws.jobs.get_run(job_run_waiter.run_id)
raise self._infer_error_from_job_run(job_run) from err
raise self._infer_nested_error(job_run) from err
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Properly fetch main branch and rebase. This file is irrelevant to this PR.

@@ -264,10 +265,17 @@ def _row_converters(self, execute_response):
type_name = col.type_name
if not type_name:
type_name = ColumnInfoTypeName.NULL
conv = self.type_converters.get(type_name, None)
if type_name == ColumnInfoTypeName.BOOLEAN:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is irrelevant

if conv is None:
msg = f"{col.name} has no {type_name.value} converter"
raise ValueError(msg)
col_conv.append(conv)
row_factory = type("Row", (Row,), {"__columns__": col_names})
return col_conv, row_factory

@staticmethod
def _convert_boolean_type(value):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Irrelevant

@@ -0,0 +1,12 @@
from databricks.labs.blueprint.installation import Installation
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

change name to v0.16.0

@dleiva04 dleiva04 closed this by deleting the head repository Mar 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr/breaking-change this change does require data or configuration migration from previous versions step/assessment go/uc/upgrade - Assessment Step
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEATURE]: Add "is_partitioned" field to HMS tables crawler output
3 participants