Replies: 4 comments
-
Bumping this---this would still be hugely useful for me. |
Beta Was this translation helpful? Give feedback.
-
this is important, anyone can help with this? this is a good repo and thanks a lot! |
Beta Was this translation helpful? Give feedback.
-
Upvoting for this option! |
Beta Was this translation helpful? Give feedback.
-
There are some ways I dont know if helps you or not.
Or enable fp16 for softmaxes and layernorms. something like this:
Or custom cast function can return True to prevent the cast, or False to allow the cast. something like this
|
Beta Was this translation helpful? Give feedback.
-
What's the cleanest way to manually prevent up/downcasting for certain operations with DeepSpeed fp16/bfloat16 mixed precision enabled? I'm basically looking for something equivalent to:
Ultimately, what I want to have happen is that softmaxes & layernorms are forced to run using bfloat16. If there's an even cleaner global way to do that, that would also be nice.
Beta Was this translation helpful? Give feedback.
All reactions