Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EPIC] Migrate Python notebooks that belong to a single job #1204

Open
8 of 14 tasks
Tracked by #103 ...
nfx opened this issue Apr 1, 2024 · 0 comments
Open
8 of 14 tasks
Tracked by #103 ...

[EPIC] Migrate Python notebooks that belong to a single job #1204

nfx opened this issue Apr 1, 2024 · 0 comments
Labels
CUJ critial user journey migrate/code Abstract Syntax Trees and other dark magic migrate/jobs Step 5 - Upgrading Jobs for External Tables

Comments

@nfx
Copy link
Collaborator

nfx commented Apr 1, 2024

Most of the Databricks Workflows are just scheduled Python notebooks. We need to migrate all relevant parts of the code in those notebooks including but not limited to:

Proposed Solution

Additional Context

No response

@nfx nfx added CUJ critial user journey migrate/code Abstract Syntax Trees and other dark magic labels Apr 1, 2024
nfx pushed a commit that referenced this issue Apr 10, 2024
## Changes
 - integrates dependency graph to NotebookMigrator

### Linked issues
#1204 
#1286 
#1326

---------

Co-authored-by: Cor <jczuurmond@protonmail.com>
@nfx nfx changed the title [FEATURE]: Migrate Python notebooks that belong to a single job [EPIC]: Migrate Python notebooks that belong to a single job Apr 22, 2024
@nfx nfx added the migrate/jobs Step 5 - Upgrading Jobs for External Tables label Apr 22, 2024
This was referenced Apr 22, 2024
@nfx nfx changed the title [EPIC]: Migrate Python notebooks that belong to a single job [EPIC] Migrate Python notebooks that belong to a single job Apr 22, 2024
nfx added a commit that referenced this issue May 7, 2024
```mermaid
flowchart TD
    job -->|has many| job_task
    job_task -.-> notebook_task
    job_task -.-> wheel_task 

    job -.-> git_source

    job_task -.->|execute on| interactive_cluster
    interactive_cluster -.-> library

    job_task -.-> library
    library -.-> wheel_on_dbfs
    library -.-> wheel_on_wsfs
    library -.-> wheel_on_volumes
    library -.-> egg_on_dbfs
    library -.-> egg_on_wsfs
    library -.-> pypi
    wheel_task -.-> wheel_on_dbfs
    wheel_task -.-> wheel_on_wsfs

    wheel_on_dbfs -.-> python_file
    wheel_on_wsfs -.-> python_file
    egg_on_dbfs -.-> python_file
    egg_on_wsfs -.-> python_file
    pypi -.-> python_file
    wsfs_file -.-> python_file
    python_file -.->|import| python_file
    notebook_task -.-> notebook
    notebook -.->|import| python_file
    notebook -.->|can run| notebook

    job_task -.-> dependency_graph
    python_file --> dependency_graph
    notebook --> dependency_graph

    git_source -.-> python_file
    git_source -.-> notebook
    lint_local_code_cli --> dependency_graph

    workflow_linter --> dependency_graph
    workflow_linter -.-> job_problems
    dependency_graph -.-> job_problems
    job_problems -.->|viz| redash_dashboard
```

This PR adds baseline for linting workflows

Related to:
- #1542 
- #1541
- #1540
- #1539
- #1382
- #1204
- #1203
- #1085

closes #1559
closes #1468
closes #1286
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CUJ critial user journey migrate/code Abstract Syntax Trees and other dark magic migrate/jobs Step 5 - Upgrading Jobs for External Tables
Projects
Status: Month Backlog
Development

No branches or pull requests

1 participant