You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, am I right that currently TE does not support ViT? Specifically, I am looking for ViT that uses 2D rope, e.g. Pixtral's ViT.
I also checked Nemo codebase, but it seems that they are also building their own impl for ViT instead of relying on TE.
Nemo defines ParallelVisionTransformer. This class init a series of ParallelVisionTransformerLayer layers, each a subclass of ParallelVisionTransformerLayer_, which in turn a subclass of ParallelTransformerLayer_. This class defines all operations in Nemo's impl.
Thanks.
The text was updated successfully, but these errors were encountered:
Hello, am I right that currently TE does not support ViT? Specifically, I am looking for ViT that uses 2D rope, e.g. Pixtral's ViT.
I also checked Nemo codebase, but it seems that they are also building their own impl for ViT instead of relying on TE.
Nemo defines
ParallelVisionTransformer
. This class init a series ofParallelVisionTransformerLayer
layers, each a subclass ofParallelVisionTransformerLayer_
, which in turn a subclass ofParallelTransformerLayer_
. This class defines all operations in Nemo's impl.Thanks.
The text was updated successfully, but these errors were encountered: