Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
-
Updated
May 7, 2023 - Python
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
The source code of AMFMN and the dataset RSITMD
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
secure and verifiable cross-modal retrieval
Add a description, image, and links to the crossmodal-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the crossmodal-retrieval topic, visit your repo's landing page and select "manage topics."