Skip to content

Commit

Permalink
update ut/doc for glm/codegen (microsoft#4057)
Browse files Browse the repository at this point in the history
* update ut/doc for glm/codegen

* formatting/spacing on docs

* re-order/alphabetize the models

---------

Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Co-authored-by: Logan Adams <loadams@microsoft.com>
  • Loading branch information
3 people authored and polisettyvarma committed Aug 7, 2023
1 parent 0bde488 commit 5425af7
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 9 deletions.
7 changes: 4 additions & 3 deletions docs/_tutorials/automatic-tensor-parallelism.md
Original file line number Diff line number Diff line change
Expand Up @@ -123,11 +123,14 @@ The following model families have been successfully tested with automatic tensor
- albert
- bert
- bigbird_pegasus
- bloom
- camembert
- codegen
- deberta_v2
- electra
- ernie
- esm
- glm
- gpt-j
- gpt-neo
- gpt-neox
Expand All @@ -136,6 +139,7 @@ The following model families have been successfully tested with automatic tensor
- llama
- m2m_100
- marian
- mpt
- mvp
- nezha
- openai
Expand All @@ -151,14 +155,11 @@ The following model families have been successfully tested with automatic tensor
- xglm
- xlm_roberta
- yoso
- bloom
- mpt

# Unsupported Models

The following models are not currently supported with automatic tensor parallelism. They may still be compatible with other DeepSpeed features (e.g., kernel injection for Bloom):

- codegen
- deberta
- flaubert
- fsmt
Expand Down
8 changes: 2 additions & 6 deletions tests/unit/inference/test_inference.py
Original file line number Diff line number Diff line change
Expand Up @@ -478,12 +478,8 @@ def test(
@pytest.mark.seq_inference
@pytest.mark.parametrize(
"model_w_task",
[
("Helsinki-NLP/opus-mt-en-de", "translation"),
],
ids=[
"marian",
],
[("Helsinki-NLP/opus-mt-en-de", "translation"), ("Salesforce/codegen-350M-mono", "text-generation")],
ids=["marian", "codegen"], #codegen has fusedqkv weight.
)
@pytest.mark.parametrize("dtype", [torch.float16], ids=["fp16"])
class TestAutoTensorParallelism(DistributedTest):
Expand Down

0 comments on commit 5425af7

Please sign in to comment.