Intel® Gaudi® AI Accelerator Examples for Training and Inference

Intel® Gaudi® AI Accelerator Examples for Training and Inference
Community

Model List and Performance Data

Please visit this page for performance information.

This repository is a collection of models that have been ported to run on Intel Gaudi AI accelerator. They are intended as examples, and will be reasonably optimized for performance while still being easy to read.

Computer Vision

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
ResNet50, ResNeXt101	PyTorch	Training	Training, Inference
ResNet152	PyTorch	Training	-
MobileNetV2	PyTorch	Training	-
UNet 2D, Unet3D	PyTorch Lightning	Training, Inference	Training, Inference
SSD	PyTorch	Training	Training
GoogLeNet	PyTorch	Training	-
Vision Transformer	PyTorch	Training	-
DINO	PyTorch	Training	-
YOLOX	PyTorch	Training	-

Natural Language Processing

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
BERT Pretraining and Finetuning	PyTorch	Training, Inference	Training, Inference
DeepSpeed BERT-1.5B, BERT-5B	PyTorch	Training	-
BART	PyTorch	Training	-

Audio

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
Wav2Vec2ForCTC	PyTorch	Inference	Inference

Generative Models

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
Stable Diffusion	PyTorch Lightning	Training	Training
Stable Diffusion FineTuning	PyTorch	Training	Training

MLPerf™ Training 4.0

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
GPT3	PyTorch	-	Training
Llama 70B LoRA	PyTorch	-	Training

MLPerf™ Inference 4.0

Models	Framework	Validated on Gaudi	Validated on Gaudi 2
Llama 70B	PyTorch	-	Inference
Stable Diffusion XL	PyTorch	-	Inference

Reporting Bugs/Feature Requests

We welcome you to use the GitHub issue tracker to report bugs or suggest features.

When filing an issue, please check existing open, or recently closed, issues to make sure somebody else hasn't already reported the issue. Please try to include as much information as you can. Details like these are incredibly useful:

A reproducible test case or series of steps
The version of our code being used
Any modifications you've made relevant to the bug
Anything unusual about your environment or deployment

Community

Hugging Face

All supported models are available in Optimum Habana project https://github.com/huggingface/optimum-habana/ and as model cards at https://huggingface.co/Habana.

Megatron-DeepSpeed

Megatron-DeepSpeed was moved to a new GitHub repository HabanaAI/Megatron-DeepSpeed.

DeepSpeed-Chat

This model was moved to a new GitHub repository HabanaAI/DeepSpeedExample.

Fairseq

Transformer

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.github		.github
MLPERF4.0		MLPERF4.0
PyTorch		PyTorch
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Computer Vision

Natural Language Processing

Audio

Generative Models

MLPerf™ Training 4.0

MLPerf™ Inference 4.0

Reporting Bugs/Feature Requests

Community

Hugging Face

Megatron-DeepSpeed

DeepSpeed-Chat

Fairseq

About

Releases

Packages

Languages

Chris-Sigopt/Model-References

Folders and files

Latest commit

History

Repository files navigation

Intel® Gaudi® AI Accelerator Examples for Training and Inference

Model List and Performance Data

Computer Vision

Natural Language Processing

Audio

Generative Models

MLPerf™ Training 4.0

MLPerf™ Inference 4.0

Reporting Bugs/Feature Requests

Community

Hugging Face

Megatron-DeepSpeed

DeepSpeed-Chat

Fairseq

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages