Switch Transformers

PyTorch implementation of the Switch Transformer paper. Read also my blogpost covering the paper.

News

Now supporting the latest aux_loss free load balancing technique from this paper. Simply pass use_biased_gating=True while instantiating the SwitchTransformer class.

Rest all is taken care of!

switch_transformer = SwitchTransformer(
    inp_dim,
    num_experts,
    num_heads,
    vocab_size,
    use_biased_gating=True,
).cuda()

Usage

Clone the repo

git clone https://github.com/srishti-git1110/torch-switch-transformers.git

Navigate to the correct directory

cd torch-switch-transformers

Install the required dependencies

pip install -r requirements.txt

Usage

With aux_loss

import torch

from switch_transformers import SwitchTransformer

inp_dim = 512
num_experts = 8
num_heads = 8
vocab_size = 50000

switch_transformer = SwitchTransformer(
    inp_dim,
    num_experts,
    num_heads,
    vocab_size,
    use_aux_loss=True, # optional since this is used by default if use_biased_gating is not True
).cuda()

x = torch.randn(2, 1024, inp_dim).cuda()
output, total_aux_loss = switch_transformer(x)

With aux_loss free load balancing

switch_transformer = SwitchTransformer(
    inp_dim,
    num_experts,
    num_heads,
    vocab_size,
    use_biased_gating=True,
).cuda()

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
switch_transformers		switch_transformers
.gitignore		.gitignore
pyproject.toml		pyproject.toml
readme.md		readme.md
requirements.txt		requirements.txt
switch_layer.png		switch_layer.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Switch Transformers

News

Usage

With aux_loss

With aux_loss free load balancing

About

Releases

Packages

Languages

srishti-git1110/torch-switch-transformers

Folders and files

Latest commit

History

Repository files navigation

Switch Transformers

News

Usage

With aux_loss

With aux_loss free load balancing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages