Skip to content

AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large Language Models (LLM) and GenAI development and operations on AWS

License

Notifications You must be signed in to change notification settings

nagyen/aiops-modules

 
 

Repository files navigation

AIOps Modules

AIOps modules is a collection of resuable Infrastructure as Code (IAC) modules that works with SeedFarmer CLI. Please see the DOCS for all things seed-farmer.

The modules in this repository are decoupled from each other and can be aggregated together using GitOps (manifest file) principles provided by seedfarmer and achieve the desired use cases. It removes the undifferentiated heavy lifting for an end user by providing hardended modules and enables them to focus on building business on top of them.

General Information

The modules in this repository are / must be generic for reuse without affiliation to any one particular project in Machine Learning and Foundation Model Operations domain.

All modules in this repository adhere to the module structure defined in the the SeedFarmer Guide

Deployment

See deployment steps in the Deployment Guide.

Modules

SageMaker Modules

Type Description
SageMaker Studio Module Provisions secure SageMaker Studio Domain environment, creates example User Profiles for Data Scientist and Lead Data Scientist linked to IAM Roles, and adds lifecycle config
SageMaker Endpoint Module Creates SageMaker real-time inference endpoint for the specified model package or latest approved model from the model package group
SageMaker Project Templates via Service Catalog Module Provisions SageMaker Project Templates for an organization. The templates are available using SageMaker Studio Classic or Service Catalog. Available templates:
- Train a model on Abalone dataset using XGBoost
- Perform batch inference
- Multi-account model deployment
- HuggingFace model import template
SageMaker Notebook Instance Module Creates secure SageMaker Notebook Instance for the Data Scientist, clones the source code to the workspace
SageMaker Custom Kernel Module Builds custom kernel for SageMaker Studio from a Dockerfile
SageMaker Model Package Group Module Creates a SageMaker Model Package Group to register and version SageMaker Machine Learning (ML) models and setups an Amazon EventBridge Rule to send model package group state change events to an Amazon EventBridge Bus
SageMaker Model Package Promote Pipeline Module Deploy a Pipeline to promote SageMaker Model Packages in a multi-account setup. The pipeline can be triggered through an EventBridge rule in reaction of a SageMaker Model Package Group state event change (Approved/Rejected). Once the pipeline is triggered, it will promote the latest approved model package, if one is found.
SageMaker Model Monitoring Module Deploy data quality, model quality, model bias, and model explainability monitoring jobs which run against a SageMaker Endpoint.

Mlflow Modules

Type Description
Mlflow Image Module Creates Mlflow Docker container image and pushes the image to Elastic Container Registry
Mlflow on AWS Fargate Module Runs Mlflow container on AWS Fargate in a load-balanced Elastic Container Service. Supports Elastic File System and Relational Database Store for metadata persistence, and S3 for artifact store

FMOps/LLMOps Modules

Type Description
SageMaker JumpStart Foundation Model Endpoint Module Creates an endpoint for a SageMaker JumpStart Foundation Model.
SageMaker Hugging Face Foundation Model Endpoint Module Creates an endpoint for a SageMaker Hugging Face Foundation Model.
Amazon Bedrock Finetuning Module Creates a pipeline that automatically triggers Amazon Bedrock Finetuning.
AppSync Knowledge Base Ingestion and Question and Answering RAG Module Creates an Graphql endpoint for ingestion of data and and use ingested as knowledge base for a Question and Answering model using RAG.

MWAA Modules

Type Description
Example DAG for MLOps Module Deploys a Sample DAG in MWAA demonstrating MLOPs and it is using MWAA module from IDF

EKS Modules

Type Description
Ray on EKS Module Provisions Ray on EKS cluster using IDF EKS module, Ray Operator, and RayJob or RayCluster via Custom Resources.

Example Modules

Type Description
Event Bus Module Creates an Amazon EventBridge Bus for cross-account events.
Personas Module This module is an example that creates various roles required for an AI/ML project.

Industry Data Framework (IDF) Modules

The modules in this repository are compatible with Industry Data Framework (IDF) Modules and can be used together within the same deployment. Refer to examples/manifests for examples.

Autonomous Driving Data Framework (ADDF) Modules

The modules in this repository are compatible with Autonomous Driving Data Framework (ADDF) Modules and can be used together within the same deployment.

About

AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large Language Models (LLM) and GenAI development and operations on AWS

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.2%
  • TypeScript 3.9%
  • Jupyter Notebook 2.6%
  • Shell 1.5%
  • Makefile 1.0%
  • JavaScript 0.4%
  • Other 0.4%