Refactor load_file tests #194

dimberman · 2022-03-10T19:46:56Z

Description
As a step towards "maturing" the astro DAG authoring project, we must rewrite our tests to ensure that every integration test runs against every database.

This step will simultaneously reduce the number of tests we need to maintain, make testing much simpler as we add new databases, and will make a future refactor much simpler as we can ensure proper coverage.

To do this, we will take advantage of two features in pytest, fixtures and parameterize.

For this ticket, we will update the append function.

Acceptance criteria
Have a single test file validating append across all databases.

Integration tests should be marked with pytest.marker.integration:

Each test should work across all databases
Validating both Table and TempTable as inputs
Use test_utils.run_dag
Use PyTest fixtures and parameterize to have a single main test that will validate transform across multiple databases

The tests should validate these scenarios:

loading against all formats (csv, parquet, avro, etc.)
loading to a temp_table or named table
loading to default schema and to named schema

The text was updated successfully, but these errors were encountered:

tatiana · 2022-03-17T06:35:34Z

@dimberman the description mentions append, but the title says load_file. Which one is this?

Also, I think it is important to mention that load_file was the first place where we started introducing the new style of integration tests.

These lines were the first place where we adopted the new integration tests approach:
https://github.com/astro-projects/astro/blob/main/tests/operators/test_agnostic_load_file.py#L532-L801

What is missing, IMPOV is to refactor these lines:
https://github.com/astro-projects/astro/blob/a83308a32f513a1e4049bc6dd98730c4e12de9f5/tests/operators/test_agnostic_load_file.py#L68-L529

Finally, I don't think we should be aiming to have a single test function per operator. I'd vote for us to have three independent tests than this type of fixture:
https://github.com/astro-projects/astro/blob/main/tests/operators/test_agnostic_merge.py#L47-L79

We are at risk of overusing fixtures, which can compromise the readability of the tests.

utkarsharma2 · 2022-03-17T11:34:58Z

@dimberman, I think I can partially work on this, refactoring testcases which are having conflicting table names.

Closes: #194 - [x] Each test should work across all databases - [x] Use test_utils.run_dag - [x] Use PyTest fixtures and parameterize to have a single main test that will validate transform across multiple databases - [x] Loading against all formats (csv, parquet, avro, etc.) -- **Partially done, since it's not required for all tests.** - [x] Loading to a temp_table or named table -- **Partially done, since it's not required for all tests.** - [x] Loading to default schema and to named schema -- **Partially done, since it's not required for all tests.**

dimberman added priority/high High priority testing Unit or integration testing improvements labels Mar 10, 2022

tatiana changed the title ~~Refactor load tests~~ Refactor load_file tests Mar 12, 2022

dimberman assigned tatiana and unassigned tatiana Mar 15, 2022

utkarsharma2 mentioned this issue Mar 17, 2022

Refactor load_file integration tests #189

Closed

utkarsharma2 self-assigned this Mar 17, 2022

utkarsharma2 mentioned this issue Mar 22, 2022

Refactored Load_file Op. testcases #229

Merged

6 tasks

dimberman closed this as completed in #229 Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor load_file tests #194

Refactor load_file tests #194

dimberman commented Mar 10, 2022

tatiana commented Mar 17, 2022

utkarsharma2 commented Mar 17, 2022

Refactor load_file tests #194

Refactor load_file tests #194

Comments

dimberman commented Mar 10, 2022

tatiana commented Mar 17, 2022

utkarsharma2 commented Mar 17, 2022