Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. #825

FastLee · 2024-01-22T15:07:06Z

Changes

Linked issues

closes #818
Relates to #823

Resolves #..

Functionality

added relevant user documentation
added new CLI command
modified existing command: databricks labs ucx ...
added a new workflow
modified existing workflow: ...
added a new table
modified existing table: ...

Tests

manually tested
added unit tests
added integration tests
verified on staging environment (screenshot attached)

codecov · 2024-01-22T15:09:14Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (e36db5f) 85.39% compared to head (6d12d1a) 85.61%.
Report is 2 commits behind head on main.

Files	Patch %	Lines
src/databricks/labs/ucx/assessment/crawlers.py	33.33%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #825      +/-   ##
==========================================
+ Coverage   85.39%   85.61%   +0.21%     
==========================================
  Files          40       41       +1     
  Lines        5031     5212     +181     
  Branches      921      950      +29     
==========================================
+ Hits         4296     4462     +166     
- Misses        523      536      +13     
- Partials      212      214       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nfx

see #823

tests/unit/assessment/test_clusters.py

nfx · 2024-01-23T18:47:24Z

tests/unit/assessment/test_clusters.py

+    assert len(result_set) == 1
+    assert result_set[0].success == 0
+    match = re.findall(fail_regex, result_set[0].failures)
+    assert len(match) == 2


parse JSON and assert on concrete failures, don't rely tests on regexes. And mark this PR ready for review.

Added tests.

nfx

Beautiful

nfx · 2024-01-24T08:41:06Z

tests/unit/assessment/test_clusters.py

+def test_cluster_assessment():
+    ws = workspace_client_mock(clusters="assortment-conf.json")
+    crawler = ClustersCrawler(ws, MockBackend(), "ucx")
+    result_set = list(crawler.snapshot())

    assert len(result_set) == 4


Can you do assertions on failure messages received? That way we'll be confident in tests doing what they should

nfx · 2024-01-24T08:43:25Z

tests/unit/assessment/test_clusters.py

+    assert len(result_set) == 1
+    assert result_set[0].success == 0
+    failures = json.loads(result_set[0].failures)
+    assert 'unsupported config: spark.databricks.passthrough.enabled' in failures


I'd suggest to create some helper function for this, as we'll be doing it a lot

Fixed and issue introduced with PR #825

- fix conflicts in assessment/clusters.py and assessment/jobs.py from the PR #825 and PR #838 - move _check_cluster_failures logic into assessment/crawlers.py and let jobs and clusters call this function

* Added `databricks labs ucx alias` command to create a view of tables from one schema/catalog in another schema/catalog ([#837](#837)). * Added `databricks labs ucx save-aws-iam-profiles` command to scan instance profiles identify AWS S3 access and save a CSV with permissions ([#817](#817)). * Added total view counts in the assessment dashboard ([#834](#834)). * Cleaned up `assess_jobs` and `assess_clusters` tasks in the `assessment` workflow to improve testing and reduce redundancy.([#825](#825)). * Added documentation for the assessment report ([#806](#806)). * Fixed escaping for SQL object names ([#836](#836)). Dependency updates: * Updated databricks-sdk requirement from ~=0.17.0 to ~=0.18.0 ([#832](#832)).

…nd reduce redundancy. (#825)

Fixed and issue introduced with PR #825

* Added `databricks labs ucx alias` command to create a view of tables from one schema/catalog in another schema/catalog ([#837](#837)). * Added `databricks labs ucx save-aws-iam-profiles` command to scan instance profiles identify AWS S3 access and save a CSV with permissions ([#817](#817)). * Added total view counts in the assessment dashboard ([#834](#834)). * Cleaned up `assess_jobs` and `assess_clusters` tasks in the `assessment` workflow to improve testing and reduce redundancy.([#825](#825)). * Added documentation for the assessment report ([#806](#806)). * Fixed escaping for SQL object names ([#836](#836)). Dependency updates: * Updated databricks-sdk requirement from ~=0.17.0 to ~=0.18.0 ([#832](#832)).

FastLee had a problem deploying to account-admin January 22, 2024 15:07 — with GitHub Actions Failure

nfx requested changes Jan 22, 2024

View reviewed changes

tests/unit/assessment/test_clusters.py Outdated Show resolved Hide resolved

FastLee added 2 commits January 23, 2024 11:10

Added Multi Failure Test

13810df

Used workspace client mock

205fb59

FastLee force-pushed the fix/clusters_multiple_issues_818 branch from a61eb09 to 205fb59 Compare January 23, 2024 16:28

FastLee temporarily deployed to account-admin January 23, 2024 16:28 — with GitHub Actions Inactive

nfx requested changes Jan 23, 2024

View reviewed changes

Added unit tests. Implemented Mixings

c4ca6ff

FastLee had a problem deploying to account-admin January 23, 2024 23:56 — with GitHub Actions Failure

Added Jobs

78a4f48

FastLee had a problem deploying to account-admin January 24, 2024 03:04 — with GitHub Actions Failure

Added Job Support.

6d12d1a

Added tests.

FastLee had a problem deploying to account-admin January 24, 2024 03:48 — with GitHub Actions Failure

FastLee changed the title ~~Fix Multiple Failures in Cluster Assessment~~ Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. Jan 24, 2024

FastLee requested a review from nfx January 24, 2024 03:49

nfx approved these changes Jan 24, 2024

View reviewed changes

nfx marked this pull request as ready for review January 24, 2024 08:44

nfx requested review from a team and HariGS-DB January 24, 2024 08:44

nfx merged commit c03596e into main Jan 24, 2024
6 of 7 checks passed

nfx deleted the fix/clusters_multiple_issues_818 branch January 24, 2024 08:45

FastLee mentioned this pull request Jan 25, 2024

Addressed integration test issue #839

Merged

nfx pushed a commit that referenced this pull request Jan 25, 2024

Addressed integration test issue (#839)

b4128cc

Fixed and issue introduced with PR #825

nfx mentioned this pull request Jan 26, 2024

Release v0.11.0 #848

Merged

dmoore247 pushed a commit that referenced this pull request Mar 23, 2024

Cleaned up Job assessment and Cluster assessment to improve testing a…

e3ed21a

…nd reduce redundancy. (#825)

dmoore247 pushed a commit that referenced this pull request Mar 23, 2024

Addressed integration test issue (#839)

dfff20e

Fixed and issue introduced with PR #825

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. #825

Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. #825

FastLee commented Jan 22, 2024 •

edited by nfx

Loading

codecov bot commented Jan 22, 2024 •

edited

Loading

nfx left a comment

nfx Jan 23, 2024

nfx left a comment

nfx Jan 24, 2024

nfx Jan 24, 2024

Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. #825

Cleaned up Job assessment and Cluster assessment to improve testing and reduce redundancy. #825

Conversation

FastLee commented Jan 22, 2024 • edited by nfx Loading

Changes

Linked issues

Functionality

Tests

codecov bot commented Jan 22, 2024 • edited Loading

Codecov Report

nfx left a comment

Choose a reason for hiding this comment

nfx Jan 23, 2024

Choose a reason for hiding this comment

nfx left a comment

Choose a reason for hiding this comment

nfx Jan 24, 2024

Choose a reason for hiding this comment

nfx Jan 24, 2024

Choose a reason for hiding this comment

FastLee commented Jan 22, 2024 •

edited by nfx

Loading

codecov bot commented Jan 22, 2024 •

edited

Loading