multimodal_vtt

Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval

Stars

66

Forks

16

Language

Python

Last Updated

Jul 30, 2022

Similar Repos