Skip to content
This repository has been archived by the owner on Dec 4, 2024. It is now read-only.

Latest commit

 

History

History
162 lines (98 loc) · 10.2 KB

CONTRIBUTING.md

File metadata and controls

162 lines (98 loc) · 10.2 KB

Contributing to dbt-server

dbt-server is source available software. We intend for this repository to embody the open source spirit at dbt Labs -- we welcome you to contribute to the knowledge loop. Whether you are a seasoned open source contributor or a first-time committer, we welcome you to contribute code, documentation, ideas, or problem statements to this project.

Important: we are not expecting external PRs and may not respond in a timely fashion. The dbt Server is intended to a be thin wrapper around dbt and, after a period of initial development to v1.0, may not see frequent changes. If you're looking to make open source contributions in the dbt ecosystem, we recommend starting a discussion in dbt-core.

The Runtime team at dbt Labs maintains this repository.

About this document

There are many ways to contribute to the ongoing development of dbt-server, such as by participating in discussions and issues. We encourage you to first read our higher-level document: "Expectations for Open Source Contributors".

The rest of this document serves as a more granular guide for contributing code changes to dbt-server (this repository). It is not intended as a guide for using dbt-server. Specific code snippets in this guide may assume you are using macOS or Linux and are comfortable with the command line.

Notes

  • CLA: Please note that anyone contributing code to dbt-server must sign the Contributor License Agreement. If you are unable to sign the CLA, the dbt-server maintainers will unfortunately be unable to merge any of your Pull Requests. We welcome you to participate in discussions, open issues, and comment on existing ones.
  • Branches: All pull requests from community contributors should target the main branch (default).

External contributors

If you are not a member of the dbt-labs GitHub organization, you can contribute to dbt-server by forking the dbt-server repository. For a detailed overview on forking, check out the GitHub docs on forking. In short, you will need to:

  1. Fork the dbt-server repository
  2. Clone your fork locally
  3. Check out a new branch for your proposed changes
  4. Push changes to your fork
  5. Open a pull request against dbt-labs/dbt-server from your forked repository

dbt Labs contributors

If you are a member of the dbt-labs GitHub organization, you will have push access to the dbt-server repo. Rather than forking dbt-server to make your changes, just clone the repository, check out a new branch, and push directly to that branch.

Adding a CHANGELOG Entry

We use changie to generate CHANGELOG entries. Note: Do not edit the CHANGELOG.md directly. Your modifications will be lost.

Follow the steps to install changie for your system.

Once changie is installed and your PR is created, simply run changie new and changie will walk you through the process of creating a changelog entry. Commit the file that's created and your changelog entry is complete!

You don't need to worry about which dbt-server version your change will go into. Just create the changelog entry with changie, and open your PR against the main branch. All merged changes will be included in the next minor version of dbt-server. The Server maintainers may choose to "backport" specific changes in order to patch older minor versions. In that case, a maintainer will take care of that backport after merging your PR, before releasing the new version of dbt-server.

Submitting a Pull Request

Code can be merged into the current development branch main by opening a pull request. A dbt-server maintainer will review your PR. They may suggest code revision for style or clarity, or request that you add unit or integration test(s). These are good things! We believe that, with a little bit of help, anyone can contribute high-quality code.

Automated tests run via GitHub Actions. If you're a first-time contributor, all tests (including code checks and unit tests) will require a maintainer to approve. Changes in the dbt-server repository trigger integration tests.

Once all tests are passing and your PR has been approved, a dbt-server maintainer will merge your changes into the active development branch. And that's it! Happy developing 🎉

Sometimes, the content license agreement auto-check bot doesn't find a user's entry in its roster. If you need to force a rerun, add @cla-bot check in a comment on the pull request.

Developer setup

Install Python

It is recommended to use pyenv to manage versions of Python you have installed on your machine. Follow their installation instructions to get set up. If you are on MacOS, we recommend using the homebrew installation instructions.

Today, we do not standardize the Python versions for development environments. This will likely change in the future. For now, use pyenv to install the latest, long-term stable version of Python. For instance: pyenv install 3.10.5. Then, instruct pyenv to use that version of Python every time you work on this project: pyenv local 3.10.5. This will create a .python-version file in this directory. It is git-ignored, so it's okay if it is unique to you. Now, every time you cd into this directory, the specified version of Python will be activated.

Setup a virtual environment

If you are not familiar with Python virtual environments, you may find this primer from Real Python helpful. There are several approaches to managing virtual environments. If you are new to virtual environments, then you can't go wrong with managing your environment directly as you get familiar with the concepts.

First, create a virtual environment called "venv" in the root of this repository

python -m venv venv

Now, every time you work on this project, you should activate the virtual environment. That way, you are using the version of Python and the packages that are installed for this project only, instead of your global Python installation.

source venv/bin/activate

Install dependencies

Install requirements

For development, you need to install production and dev dependencies.

pip install -r requirements.txt -r dev-requirements.txt

Install dbt and adapter

Additionally, dbt-server expects that dbt is available in the Python environment. You can install dbt by first picking the adapter you want to work with and installing that. dbt-core will be installed alongside the adapter automatically.

If you don't have a particular database you want to work with, Postgres is a good choice. The adapter is well-used, has first-party support, and you can run Postgres locally.

pip install dbt-postgres # This installs dbt-core as well

More information on installing dbt can be found here.

You will also need Postgres installed on your machine in order to use dbt-postgres locally. There a number of ways to install / run Postgres on your machine (homebrew, asdf, docker). If you don't have a preference, you can pick your platform and download from the official download page.

Create a dbt project for dbt-server to reference

dbt-server expects for a client to "push" a project to it, so to manually interact with dbt-server, you need a dbt project on your machine. If you don't have a particular project that you want to work with, we recommend using dbt-labs/jaffle_shop_metrics as an approachable sample project.

Make sure that you set up / clone the dbt project outside of this repository. The dbt project should be on your machine, but in a different directory than dbt-server.

# From somewhere on your machine that you want this project to live. Perhaps ~/repos.
git clone https://github.com/dbt-labs/jaffle_shop_metrics.git

Run tests

This project uses pytest for testing, which is installed with your development dependencies. To run all tests, simply run pytest in the root directory of this project.

pytest

Run the server in development mode

From the root of this repository, make sure your virutal environment is activated.

uvicorn dbt_server.server:app --reload --host=127.0.0.1 --port 8580

This will start the Fast API server in development mode, with hot-code reloading.

Build and run for production

ALLOW_ORCHESTRATED_SHUTDOWN

ALLOW_ORCHESTRATED_SHUTDOWN overrides the signal handling for SIGINT and SIGTERM to allow the server to continue accepting requests after receiving either signal. A second signal will terminate the server normally. ALLOW_ORCHESTRATED_SHUTDOWN also enables the POST /shutdown endpoint, which can be called by another application that controls the lifetime of dbt server.

NOTE: The signal handling overrides do not work properly with the --reload flag, so be sure to remove it to test or iterate on this functionality (everything else works fine and the server will restart as expected).

To run with ALLOW_ORCHESTRATED_SHUTDOWN, set the env var ALLOW_ORCHESTRATED_SHUTDOWN=on:

ALLOW_ORCHESTRATED_SHUTDOWN=on uvicorn dbt_server.server:app --reload --host=127.0.0.1 --port 8580

Building docker container locally

docker build -f Dockerfile . -t dbt-server-<dbt-core-version>:latest --build-arg DBT_CORE_VERSION=<dbt-core-version> --build-arg DBT_DATABASE_ADAPTER_PACKAGE=dbt-<database-adapter>

Example for building container for dbt-core v0.21.0 with the Snowflake database adapter:

docker build -f Dockerfile . -t dbt-server-0.21.0:latest --build-arg DBT_CORE_VERSION=0.21.0 --build-arg DBT_DATABASE_ADAPTER_PACKAGE=dbt-snowflake