You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I saw the compositional approach adding multimodal capabilities to Llama3 in the report, and am curious about the details about the image encoder and adaptor. Can you please provide any of the model config files for vision experiments?
The text was updated successfully, but these errors were encountered:
I saw the compositional approach adding multimodal capabilities to Llama3 in the report, and am curious about the details about the image encoder and adaptor. Can you please provide any of the model config files for vision experiments?
The text was updated successfully, but these errors were encountered: