You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are 3 stages of training in your paper, in which the first one is contrasive learning.
which token do you use to get the embedding of text in the image-text pair?
And how do you get the embedding of the image in the image-text pair?
thanks
The text was updated successfully, but these errors were encountered:
There are 3 stages of training in your paper, in which the first one is contrasive learning.
which token do you use to get the embedding of text in the image-text pair?
And how do you get the embedding of the image in the image-text pair?
thanks
The text was updated successfully, but these errors were encountered: