Make asset checks matter to integration tests #3687

jdangerx · 2024-06-21T21:34:52Z

Overview

Closes #3705 .

What did you change?

we weren't running asset checks in our ETL when running in integration tests. Instead of making a new, different ETL job for integration tests, we now just run the same job we use everywhere else, but with whatever dataset configuration we have defined for the tests.
we also weren't asserting that the ETL execution actually succeeded. Asset check failures currently cause Exceptions, so we don't technically need this, but it seems prudent to assert success anyways.

Testing

verified that the fgd_equipment_null_check asset check passes, by running the raw_eia860 assets and then the _core_eia860__fgd_equipment asset in Dagster
ran integration tests with make pytest-coverage and saw the integration tests pass
break the fgd_equipment_null_check by making it return AssetCheckResult(passed=False)
re-run the _core_eia860__fgd_equipment asset in Dagster and see the check fail
re-run integration tests with make pytest-coverage and see that the integration test fails with a bunch of errors about dagster._core.errors.DagsterAssetCheckFailedError: Blocking check 'fgd_equipment_null_c heck' for asset '_core_eia860__fgd_e...

To-do list

Give feedback

Ensure docs build, unit & integration tests, and test coverage pass locally with make pytest-coverage (otherwise the merge queue may reject your PR)
Review the PR yourself and call out any questions or issues you have
Options

jdangerx · 2024-06-21T21:36:07Z

Frustratingly, running make pytest-coverage on this branch has caused a bevy of apparently unrelated errors in the integration tests so I will need to poke around more to validate that this indeed causes our tests to fail on failed asset checks.

Also, don't continue integration test if the ETL run fails.

jdangerx · 2024-07-02T16:15:39Z

src/pudl/etl/cli.py

@@ -23,42 +19,6 @@
 logger = pudl.logging_helpers.get_logger(__name__)


-def pudl_etl_job_factory(


This was only being used in conftest so I moved it over to avoid some import issues.

jdangerx · 2024-07-02T16:17:04Z

test/conftest.py

@@ -272,6 +279,29 @@ def ferc1_xbrl_taxonomy_metadata(ferc1_engine_xbrl: sa.Engine):
    return result.output_for_node("raw_ferc1_xbrl__metadata_json")


+def _pudl_etl_job_factory(


I removed one layer of function nesting here because it seemed unnecessary based on the reconstructable jobs docs.

Since we're only using execute_in_process right now anyways, we don't even need a reconstructable job, but it seems fine to leave this because the logic is pretty straightforward now.

jdangerx · 2024-07-02T16:19:32Z

test/conftest.py

+        The job definition to be executed.
+    """
+    pudl.logging_helpers.configure_root_logger(logfile=logfile, loglevel=loglevel)
+    if not process_epacems:


Instead of making whole new JobDefinitions I just return the ones we actually use in other builds - that should make test behavior hew more closely to production behavior.

jdangerx force-pushed the fix-failing-asset-checks-not-failing-etl branch from 8dd44ac to bb86c06 Compare June 28, 2024 21:45

Run asset checks in integration test.

f84467d

Also, don't continue integration test if the ETL run fails.

jdangerx force-pushed the fix-failing-asset-checks-not-failing-etl branch from bb86c06 to f84467d Compare July 2, 2024 16:08

jdangerx changed the title ~~WIP - fail a bunch of asset checks~~ Make asset checks matter to integration tests Jul 2, 2024

jdangerx marked this pull request as ready for review July 2, 2024 16:15

jdangerx commented Jul 2, 2024

View reviewed changes

jdangerx requested review from a team and e-belfer and removed request for a team July 2, 2024 16:19

jdangerx enabled auto-merge July 2, 2024 16:19

Merge branch 'main' into fix-failing-asset-checks-not-failing-etl

82169dd

zaneselvans added testing Writing tests, creating test data, automating testing, etc. dagster Issues related to our use of the Dagster orchestrator labels Jul 2, 2024

zaneselvans assigned jdangerx Jul 2, 2024

zaneselvans approved these changes Jul 2, 2024

View reviewed changes

jdangerx added this pull request to the merge queue Jul 2, 2024

Merged via the queue into main with commit 7c79116 Jul 2, 2024
12 checks passed

jdangerx deleted the fix-failing-asset-checks-not-failing-etl branch July 2, 2024 23:52

zaneselvans mentioned this pull request Jul 3, 2024

Nightly Build Failure 2024-07-03 #3708

Closed

bendnorman mentioned this pull request Jul 3, 2024

Move pudl_etl_job_factory back to pudl.etl.cli.py #3711

Merged

zaneselvans mentioned this pull request Oct 23, 2024

Make asset checks run during integration tests #3928

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make asset checks matter to integration tests #3687

Make asset checks matter to integration tests #3687

jdangerx commented Jun 21, 2024 •

edited

Loading

To-do list

jdangerx commented Jun 21, 2024

jdangerx Jul 2, 2024

jdangerx Jul 2, 2024

jdangerx Jul 2, 2024

		@@ -23,42 +19,6 @@
		logger = pudl.logging_helpers.get_logger(__name__)


		def pudl_etl_job_factory(

		@@ -272,6 +279,29 @@ def ferc1_xbrl_taxonomy_metadata(ferc1_engine_xbrl: sa.Engine):
		return result.output_for_node("raw_ferc1_xbrl__metadata_json")


		def _pudl_etl_job_factory(

Make asset checks matter to integration tests #3687

Make asset checks matter to integration tests #3687

Conversation

jdangerx commented Jun 21, 2024 • edited Loading

Overview

Testing

To-do list

jdangerx commented Jun 21, 2024

jdangerx Jul 2, 2024

Choose a reason for hiding this comment

jdangerx Jul 2, 2024

Choose a reason for hiding this comment

jdangerx Jul 2, 2024

Choose a reason for hiding this comment

jdangerx commented Jun 21, 2024 •

edited

Loading