[Feature] CT-201 source config WIP #5003

emmyoop · 2022-04-06T20:18:37Z

resolves #3662

Description

Draft PR for feature branch.

Some context over in #4960

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have added information about my change to be included in the CHANGELOG.

Co-authored-by: Jeremy Cohen <jeremy@dbtlabs.com>

github-actions · 2022-04-06T20:18:53Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

jtcohen6

Getting close! I left a few comments that may provide helpful (or least historical) context

jtcohen6 · 2022-04-08T10:11:14Z

core/dbt/parser/sources.py

            base=False,
+            patch_config_dict=target.source.config,


We need access to both:

target.source.config = config defined at the source level

target.table.config = config defined at the source table level

I think there are two options here:

Merge those configs together (preferring table-level over source-level), before passing into patch_config_dict

Modify calculate_node_config to accept multiple "levels" of patches, so that all calculation happens in that same method: source table > source > project-level config

jtcohen6 · 2022-04-08T10:12:13Z

core/dbt/parser/manifest.py

@@ -465,6 +465,8 @@ def parse_project(
                    else:
                        dct = block.file.dict_from_yaml
                    parser.parse_file(block, dct=dct)
+                    # Came out of here with UnpatchedSourceDefinition containing configs at the source level
+                    # and not configs at the table level (as expected)


See comment re: patch_config_dict passed into calculate_node_config

jtcohen6 · 2022-04-08T10:12:33Z

core/dbt/contracts/graph/model_config.py

+        default=None,
+        metadata=CompareBehavior.Exclude.meta(),
+    )
+    # TODO what type is this? docs say: "<column_name_or_expression>"


String is right.

Food for thought: These dataclass attributes are duplicative with the ones already defined:

in UnparsedSourceTableDefinition

in ParsedSourceDefinition

Given our aim here is backwards compatibility, that duplication may be unavoidable

jtcohen6 · 2022-04-08T10:13:30Z

core/dbt/contracts/graph/model_config.py

@@ -335,6 +335,39 @@ def replace(self, **kwargs):
 @dataclass
 class SourceConfig(BaseConfig):
    enabled: bool = True
+    quoting: Dict[str, Any] = field(


Should we move the addition of these new configs into a separate PR? If the goal is just to support setting enabled as a config on the source/table levels

jtcohen6 · 2022-04-08T10:40:14Z

core/dbt/parser/sources.py

        )

        unrendered_config = self._generate_source_config(
-            fqn=target.fqn,
+            target=target,


Background context on rendered: bool, UnrenderedConfigGenerator, and unrendered_config, since these are pretty confusing!

This exists to limit "false positives" during state comparison, i.e. to power state:modified selection in Slim CI runs ("just build what's changed"): https://docs.getdbt.com/reference/node-selection/state-comparison-caveats#false-positives

The comparison happens in the same_config method here:

dbt-core/core/dbt/contracts/graph/parsed.py

Lines 317 to 321 in 899b0ef

def same_config(self, old: T) -> bool:

return self.config.same_contents(

self.unrendered_config,

old.unrendered_config,

)

Idea being, it's not uncommon for users to have environment-based configuration like:

sources: - name: my_source config: database: "{{ 'prod_raw' if target.name == 'prod' else 'sampled_dev' }}" schema: "{{ env_var('DBT_SOURCE_SCHEMA') }}" # etc

The rendered versions of those configs will be different based on the values of env vars or target values. The thing we actually want to detect changes in is the raw un-rendered Jinja expression. If that's changed, it's because someone edited the thing.

This works for configs set in dbt_project.yml, which are stored as unrendered Jinja expressions, and passed twice into calculate_node_config (once with rendering, once without).

However, we've already rendered all fields in (non-project) yaml files, during initial parsing:

dbt-core/core/dbt/parser/schemas.py

Lines 633 to 635 in 899b0ef

# Render the data (except for tests and descriptions).

# See the SchemaYamlRenderer

entry = self.render_entry(entry)

So we're passing already-rendered Jinja expressions into calculate_node_config, and we never get access to their unrendered forms. We'd need a subtle reordering of parsing/rendering to make it work as intended.

Related: #2744, #3576

nathaniel-may · 2022-04-08T15:11:35Z

@jtcohen6 I definitely requested your review on the wrong PR. This one will be for the full feature. The enabled flag work is going on in #5008. Thank you for all the context here!

github-actions · 2022-10-06T02:14:10Z

This PR has been marked as Stale because it has been open for 180 days with no activity. If you would like the PR to remain open, please remove the stale label or comment on the PR, or it will be closed in 7 days.

emmyoop and others added 8 commits March 25, 2022 13:35

initial pass at source config test w/o overrides

1f81ea3

Update tests/functional/sources/test_source_configs.py

49797a6

Co-authored-by: Jeremy Cohen <jeremy@dbtlabs.com>

Update tests/functional/sources/test_source_configs.py

fe9e004

Co-authored-by: Jeremy Cohen <jeremy@dbtlabs.com>

tweaks from feedback

913e9ed

clean up some test logic - add override tests

e469e88

add new fields to source config class

8d00f6c

fix odd formatting

6337a4f

got a test working

8d984b1

cla-bot bot added the cla:yes label Apr 6, 2022

emmyoop changed the title ~~Feature/ct 201 source config~~ [Feature] CT-201 source config WIP Apr 6, 2022

emmyoop mentioned this pull request Apr 6, 2022

initial pass at source config tests w/o overrides #4960

Closed

4 tasks

jtcohen6 reviewed Apr 8, 2022

View reviewed changes

jtcohen6 mentioned this pull request Apr 8, 2022

Add enabled as a source config #5008

Merged

4 tasks

github-actions bot added the stale Issues that have gone stale label Oct 6, 2022

emmyoop closed this Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] CT-201 source config WIP #5003

[Feature] CT-201 source config WIP #5003

emmyoop commented Apr 6, 2022 •

edited

Loading

github-actions bot commented Apr 6, 2022

jtcohen6 left a comment

jtcohen6 Apr 8, 2022

jtcohen6 Apr 8, 2022

jtcohen6 Apr 8, 2022

jtcohen6 Apr 8, 2022

jtcohen6 Apr 8, 2022

nathaniel-may commented Apr 8, 2022

github-actions bot commented Oct 6, 2022

	def same_config(self, old: T) -> bool:
	return self.config.same_contents(
	self.unrendered_config,
	old.unrendered_config,
	)

	# Render the data (except for tests and descriptions).
	# See the SchemaYamlRenderer
	entry = self.render_entry(entry)

[Feature] CT-201 source config WIP #5003

[Feature] CT-201 source config WIP #5003

Conversation

emmyoop commented Apr 6, 2022 • edited Loading

Description

Checklist

github-actions bot commented Apr 6, 2022

jtcohen6 left a comment

Choose a reason for hiding this comment

jtcohen6 Apr 8, 2022

Choose a reason for hiding this comment

jtcohen6 Apr 8, 2022

Choose a reason for hiding this comment

jtcohen6 Apr 8, 2022

Choose a reason for hiding this comment

jtcohen6 Apr 8, 2022

Choose a reason for hiding this comment

jtcohen6 Apr 8, 2022

Choose a reason for hiding this comment

nathaniel-may commented Apr 8, 2022

github-actions bot commented Oct 6, 2022

emmyoop commented Apr 6, 2022 •

edited

Loading