Does the current code supports generation of text + image + image ? #25
Answered
by
lucidrains
Wonder1905
asked this question in
Q&A
-
Hi, does the current code supports generation of text + image + image ? as see in the paper? where they are predicting text image tokens and then again text tokens |
Beta Was this translation helpful? Give feedback.
Answered by
lucidrains
Nov 24, 2024
Replies: 1 comment 3 replies
-
@Wonder1905 it can do anything, any number of modalities, any order the sky is the limit |
Beta Was this translation helpful? Give feedback.
3 replies
Answer selected by
lucidrains
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@Wonder1905 it can do anything, any number of modalities, any order
the sky is the limit