Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] state:modified consider external table column name changes as changes #317

Open
dat-a-ish opened this issue Sep 15, 2024 · 1 comment
Labels
enhancement New feature or request triage

Comments

@dat-a-ish
Copy link

Describe the feature

Hey, external tables column changes aren't considered as a change when we run:

 dbt ls --select "state:modified"

I believe that this happens because it's a source definition.
Is it possible to include column changes to state:modified for external tables?

Additional context

This is very similar to this issue. The difference is that columns' definition for regular sources can't really break downstream models, unlike columns' definition for external tables' columns.

Who will this benefit?

Anyone who wants to run CI tests, build downstream models each time external tables' columns were changed, and catch a wrong reference problem before production run.

@dat-a-ish dat-a-ish added enhancement New feature or request triage labels Sep 15, 2024
@afaehnrich
Copy link

afaehnrich commented Dec 18, 2024

Poor man's solution
I ran into the same problem. As long as this feature is waiting to be added to DBT, you can use a python script and the module dictdiffer to detect changes in sources. The output of this script can be used for DBT's select arg. This script will also mark a source as modified if only the description changed. While that's fine for me, you may want to design something more elaborate.
You have to parse your current model before running this script.

import json
import dictdiffer

with open('target/manifest.json') as f:
    target_sources = json.load(f).get('sources', {})
for s in target_sources:
    target_sources[s].pop('created_at', None)

with open('workspace/prod_state/manifest.json') as f:
    prod_sources = json.load(f).get('sources', {})
for s in prod_sources:
    prod_sources[s].pop('created_at', None)

modified_sources = [
    s for s in target_sources
    if bool(next(dictdiffer.diff(prod_sources.get(s, {}), target_sources[s]), False))
]
if modified_sources:
    print("+ ".join(modified_sources)+"+")

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request triage
Projects
None yet
Development

No branches or pull requests

2 participants