Other Keywords:
- voice-face cross-modal biometric matching
- voice-face representation learning
- voice-face cross-modal mapping
Please raise an issue if there is anything mistake or missing. :-)
Abbr. | Title | Year | Conf | Code |
---|---|---|---|---|
SVHF | Seeing voices and hearing faces: Cross-modal biometric matching | 2018 | CVPR | code ,model |
FVCME | Face-voice matching using cross-modal embeddings | 2018 | MM | ❎ |
Pins | Learnable pins: Crossmodal embeddings for person identity | 2018 | ECCV | official, pytorch |
LAFV | On learning associations of faces and voices | 2018 | ACCV | ❎ |
SSNet | Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals | 2019 | DICTA | ❎ |
DIMNet | Disjoint mapping network for cross-modal matching of voices and faces | 2019 | ICLR | ❎ |
EmNet | A Novel Distance Learning for Elastic Cross-Modal Audio-Visual Matching | 2019 | ICME-Workshop | ❎ |
VFMR | Voice-Face Cross-modal Matching and Retrieval- A Benchmark | 2019 | - | ❎ |
Learning Discriminative Joint Embeddings for Efficient Face and Voice Association | 2020 | SIGIR | ❎ | |
Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network | 2020 | MM | ❎ | |
VFNet | Audio-visual Speaker Recognition with a Cross-modal Discriminative Network | 2020 | Interspeech | ❎ |
AML | Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching | 2021 | TMM | official, copy |
Seeking the Shape of Sound- An Adaptive Framework for Learning Voice-Face Association | 2021 | CVPR | code | |
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective | 2021 | CVPR-Workshop | ❎ | |
Disentangled Representation Learning for Cross-Modal Biometric Matching | 2021 | TMM | ❎ | |
FOP | Fusion and Orthogonal Projection for Improved Face-Voice Association | 2022 | ICASSP | code |
Self-Lifting | Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning | 2022 | ICMR | code |
CMPC | Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast | 2022 | IJCAI | code |
Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching | 2022 | ICDM | ❎ | |
Looking and Hearing into Details: Dual-enhanced Siamese Adversarial Network for Audio-Visual Matching | 2022 | TMM | ❎ | |
SBNet | Single-branch Network for Multimodal Training | 2023 | ICASSP | code |
https://github.com/my-yy/vfal-eva
- Reproduce bunches of works based on unified standards 😃
- High-speed training and testing ⚡
- Easy to extend 💭