[Features] Support FP16 training #198

rentainhe · 2023-02-02T06:41:36Z

TODO

support fp16 training, which will reduce 20-30% GPU memory usage.
fp16 training baseline dino-r50-4scale-12ep: 49.1 AP (with amp) vs 49.2 AP (w/o amp)

Note

For MultiScaleDeformableAttention, we simply convert the input value to torch.float32 and convert the output from torch.float32 to torch.float16, which means we skip fp16 and conduct fp32 computation in MultiScaleDeformableAttention operator.

Usage

start fp16 training with train.amp.enabled:

python tools/train_net.py \
    --config-file projects/dab_detr/configs/dab_detr_r50_50ep.py \
    --num-gpus 8 \
    train.amp.enabled=True

FabianSchuetze · 2023-03-18T10:10:25Z

Thanks for this commit!

Did you observe instabilities when using the deformable attention layer with fp16? Is there another reason why the deformable attention layer cannot be used with fp16?

…training [Features] Support FP16 training

fix fp16 bugs

16759dc

rentainhe changed the title ~~Support FP16 training~~ [Features] Support FP16 training Feb 2, 2023

rentainhe requested a review from zengzhaoyang February 2, 2023 08:05

refine readme

95862c9

zengzhaoyang approved these changes Feb 3, 2023

View reviewed changes

zengzhaoyang merged commit 79f8bb7 into main Feb 3, 2023

rentainhe deleted the support_fp16_training branch February 3, 2023 02:23

Lontoone pushed a commit to Lontoone/detrex that referenced this pull request Jan 8, 2024

Merge pull request IDEA-Research#198 from IDEA-Research/support_fp16_…

1010e35

…training [Features] Support FP16 training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Features] Support FP16 training #198

[Features] Support FP16 training #198

rentainhe commented Feb 2, 2023 •

edited

Loading

FabianSchuetze commented Mar 18, 2023

[Features] Support FP16 training #198

[Features] Support FP16 training #198

Conversation

rentainhe commented Feb 2, 2023 • edited Loading

TODO

Note

Usage

FabianSchuetze commented Mar 18, 2023

rentainhe commented Feb 2, 2023 •

edited

Loading