mdmmt

MDMMT: Multidomain Multimodal Transformer for Video Retrieval

Stars

27

Forks

3

Language

Python

Last Updated

Dec 29, 2023

Similar Repos

Repo	Language	Stars	Description	Updated At
multimodal_vtt	Python	66	Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval	Jul 30, 2022
GRTr	Python	22	Generative Retrieval Transformer	May 22, 2022
video-retrieval-sampler	Python	7	The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'.	Apr 27, 2023
Multimodal-Transformer	Python	532	[ACL'19] [PyTorch] Multimodal Transformer	Aug 16, 2022
MMVD	None	3	Multimodal Video Description	Jan 03, 2020
MulT	Python	13	[Reproduce] Code for the ACL2019 paper "Multimodal Transformer for Unaligned Multimodal Language Sequences".	Jul 18, 2022
MAN	Python	26	Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)	Nov 19, 2022
HIMT	Python	3	The code for our paper Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment Analysis	Apr 17, 2023
EMT-DLFR	Python	2	Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis (TAC 2023)	May 16, 2023
zorro-pytorch	Python	54	Implementation of Zorro, Masked Multimodal Transformer, in Pytorch	Mar 19, 2023
METER	Python	208	METER: A Multimodal End-to-end TransformER Framework	Aug 28, 2022
TFood	HTML	21	[CVPRW22] Official Implementation of T-Food: "Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval". Accepted …	May 18, 2023
WMRN	Python	18	Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval	May 19, 2023
drydrop-sites	JavaScript	2	sites for my multidomain drydrop	Aug 13, 2019
mmFormer	Python	49	[MICCAI 2022] The official code for "mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of …	May 21, 2023
Mr.Right	Python	14	Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text	Mar 13, 2023
SDML	Python	32	Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code)	Apr 02, 2023
SDML	None	2	Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code)	Feb 06, 2023
MFT	Jupyter Notebook	20	Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.	Aug 11, 2022
video-swin-transformer-pytorch	Python	88	Video Swin Transformer - PyTorch	Aug 22, 2022
VGT	Python	10	Video Graph Transformer for Video Question Answering (ECCV'22)	Sep 06, 2022
demix-data	Python	16	Benchmark API for Multidomain Language Modeling	Feb 21, 2023
dmformer	Python	2	Decoupled Multimodal Transformers (DMFormer) for Referring Video Object Segmentation	Jun 26, 2023
R2Former	None	27	Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition	Jul 17, 2023
VIVA	Python	2	VIsual Information Retrieval in Video Archives	Dec 16, 2021
DRL	Python	42	[arXiv22] Disentangled Representation Learning for Text-Video Retrieval	Sep 07, 2022
collaborative-experts	Python	285	Video embeddings for retrieval with natural language queries	Apr 10, 2023
Cross-Modal-Adapter	None	34	[arXiv] Cross-Modal Adapter for Text-Video Retrieval	Apr 29, 2023
CLIP_Video_Representation	Jupyter Notebook	56	Use CLIP to represent video for Retrieval Task	Mar 19, 2023
video-retrieval	None	10	Deep Learning for Video Retrieval by Natural Language	Nov 20, 2021
MTVM	C++	17	[ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation	Feb 17, 2023
CenterCLIP	Python	100	[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based …	Apr 27, 2023
redaxo_yrewrite	PHP	56	A multidomain URL rewrite engine for REDAXO	Mar 22, 2023
multiglossar	PHP	26	Glossar, auch für Multidomain Sites	May 18, 2022
MultiWE	Python	4	Learning multimodal word embeddings from youtube video data	Sep 15, 2019
Dialogue-to-Video-Retrieval	Python	2	Code for ECIR 2023 paper "Dialogue-to-Video Retrieval"	Jul 07, 2023
FGT	Jupyter Notebook	186	[ECCV 2022] Flow-Guided Transformer for Video Inpainting	Apr 29, 2023
MCAT	Jupyter Notebook	49	Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021	Aug 09, 2022
VFIformer	Python	42	Video Frame Interpolation with Transformer (CVPR2022)	Aug 12, 2022
transformer-uvos	Python	8	Unsupervised Video Object Segmentation using Transformer	May 04, 2022
bio-reader	None	3	BioReader: a Retrieval-Enhanced Text-to-Text Transformer for Biomedical Literature [EMNLP 2022]	Jan 02, 2024
sysadmin	C#	3	Application for administer multidomain Active Directory-based networks	May 26, 2022
livewire-multidomain	PHP	2	Use Livewire in a multidomain environment	Sep 22, 2022
VLTinT	Jupyter Notebook	36	[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning	Apr 25, 2023
VideoMoji	Python	2	Multimodal (Video, Audio, Voice) human emotion & engagement detection app	Jul 24, 2021
LAFF	Python	27	Source code of ECCV2022 LAFF for Text-to-Video Retrieval	Apr 03, 2023
sea	Python	6	SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries	Sep 13, 2022
DKPH	Python	6	Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval ECCV22	Oct 15, 2023
mt-captioning	Python	23	A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning	Apr 20, 2023
hive.ndvr	Java	2	near duplicate video retrieval and detection with Hive	Jan 03, 2020