MLOps with Azure DevOps

This sample shows you how to operationalize your Machine Learning development cycle with Azure Machine Learning Service and Azure Databricks - as a compute target - by leveraging Azure DevOps Pipelines as the orchestrator for the whole flow.

By running this project, you will have the opportunity to work with Azure workloads, such as:

Technology	Objective/Reason
Azure DevOps	The platform to help you implement DevOps practices on your scenario
Azure Machine Learning Service	Manage Machine Learning models with the power of Azure
Azure Databricks	Use its compute power as a Remote Compute for training models
Azure Container Instance	Deploy Machine Learning models as Docker containers

Preparing the environment

Infrastructure/Cloud Infrastructure

This repository contains the base structure for you to start developing your Machine Learning project using:

Azure Machine Learning Service
Azure Databricks
Azure Container Instance

To have all the resources set, leverage the following resource to get your infrastructure ready:

Setting up your cloud infrastructure

Azure DevOps

After you have your infrastructure set, it's time to have your Azure DevOps connected to it to start orchestrating your Machine Learning pipeline.

If you don't have an Azure DevOps account, please refer to this doc to have it set up.

You will find resources and docs to have Azure DevOps orchestrating your pipeline by following this guidance:

Setting up the training pipeline

Sample project

This code sample reproduces the Image Classification, a convolutional neural network image classification. For more details of the project structure, check the project structure page.

This project structure was also based on the cookiecutter data science project template.

Running this project on your local development environment

This sample provides you two options to have your environment set up for developing and debugging locally. The focus of these docs is on how to have all the required environment variables set so you can invoke and debug this code on your machine.

See this page for details on how to debug using Visual Studio Code
See this page for looking into details on how to have the enrironment variables set using Bash

Project flow

Starting point/Current state

A Machine Learning project being developed by a team of Data Engineers/Data Analysts, using Python.

The team develops the code to train the Machine Learning model and they need to orchestrate the way this code gets tested, trained, packaged and deployed.

Testing

Testing the code that generages a model is crucial to the success and accuracy of the model being developed.

The code being developed will produce a Machine Learning model that will help people to take decisions, when not being the main responsible for the decisions itself.

That's why testing the units of the code to make sure it meets the requirements is a fundamental piece of the development cycle.

You will achieve it using the following capabilities:

Python Unit Testing frameworks
Azure DevOps

Training

This project is all about generating a Machine Learning model, which needs to be trained. Training a model requires compute power and orchestration.

Compute power is commonly an expensive asset and that's why this project leverages cloud workloads to optimize resource consumption and avoiding upfront costs.

To enable this, the following capabilities will be used:

Machine Learning Python SDKs
Azure Databricks
Azure DevOps

Deploying

The resulting model from the training step needs to be deployed somewhere so the edge can consume it. There are a few ways to achieve it and, for this scenario, you will deploy this model as part of a Docker Container.

A container has the power of having all the dependencies the application needs to run encapsulated within it. It is also easily portable to multiple different platforms.

To take advantage of deploying the model to a container, you will use:

Azure DevOps
Azure Container Instances
Azure Machine Learning Service

See this page for details on setting release pipeline to deploy model

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github		.github
aml_config		aml_config
aml_service		aml_service
azdo_pipelines		azdo_pipelines
cluster_config		cluster_config
docs		docs
iac/arm-templates		iac/arm-templates
models		models
notebooks/image_classification_notebook		notebooks/image_classification_notebook
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
environment.yml		environment.yml
pylintrc		pylintrc
requirements.txt		requirements.txt
set-environment-vars.sh		set-environment-vars.sh
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLOps with Azure DevOps

Preparing the environment

Infrastructure/Cloud Infrastructure

Azure DevOps

Sample project

Running this project on your local development environment

Project flow

Starting point/Current state

Testing

Training

Deploying

About

Releases

Packages

Languages

License

eedorenko/MLOpsDatabricks

Folders and files

Latest commit

History

Repository files navigation

MLOps with Azure DevOps

Preparing the environment

Infrastructure/Cloud Infrastructure

Azure DevOps

Sample project

Running this project on your local development environment

Project flow

Starting point/Current state

Testing

Training

Deploying

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages