VLPCook

Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval

Stars

8

Forks

1

Language

Jupyter Notebook

Last Updated

Jan 15, 2024

Similar Repos

Repo	Language	Stars	Description	Updated At
TFood	HTML	21	[CVPRW22] Official Implementation of T-Food: "Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval". Accepted …	May 18, 2023
Hashing-Retrieval	MATLAB	4	Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval	May 23, 2022
CLIP4CMR	Python	20	A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval	Apr 11, 2022
ViCHA	Jupyter Notebook	32	[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"	Apr 20, 2023
awesome-vision-language-pretraining	None	14	Awesome Vision-Language Pretraining Papers	Apr 23, 2023
Spurious_CM_Retrieval	None	2	Official PyTorch implementation of CVPR 2023 MULA Workshop paper "Exposing and Mitigating Spurious Correlations for …	Apr 06, 2023
XLM	Python	2652	PyTorch original implementation of Cross-lingual Language Model Pretraining.	Aug 09, 2022
XLM	Python	2	PyTorch original implementation of Cross-lingual Language Model Pretraining.	Feb 19, 2020
qb-norm	Python	43	Cross Modal Retrieval with Querybank Normalisation	Apr 10, 2023
Awesome-VLP-and-Efficient-Transformer	None	11	Vision-Language Pretraining & Efficient Transformer Papers.	Jun 12, 2022
Text-to-Clip_Retrieval	Jupyter Notebook	44	Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"	Apr 22, 2023
LEMON-MM2020	MATLAB	5	Label Embedding Online Hashing for Cross-Modal Retrieval	Jul 25, 2022
Cross-Modal-Adapter	None	34	[arXiv] Cross-Modal Adapter for Text-Video Retrieval	Apr 29, 2023
Cross-Modal-Hashing-Retrieval	MATLAB	3	The baselines of cross-modal hashing retrieval.	Jun 09, 2022
SLTA	Jupyter Notebook	32	ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》	Oct 06, 2022
X-MRS	Python	8	Food image / recipe (text) cross-modal representation learning, retrieval and (image) synthesis. Code from ACM-Multimedia …	Jul 11, 2022
cross-modal-retrieval	Python	14	Code for cross-modal image retrieval for SYSU-MM01	May 05, 2023
rosita	Python	51	ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration	Apr 13, 2023
tirg	Python	248	deep learning, image retrieval, vision and language	Jul 20, 2022
MAN	Python	26	Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)	Nov 19, 2022
M3AE	Python	63	[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.	Apr 19, 2023
crossModalRetrieval	JavaScript	25	A demo of a cross-modal retrieval system	May 10, 2023
VLMixer	None	14	VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)	Feb 21, 2023
mPLUG	Python	5	mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)	May 08, 2023
VL-CheckList	Python	80	Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.	Apr 16, 2023
FLM	None	9	Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)	Apr 06, 2023
robo-vln	Python	39	Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"	Sep 05, 2022
HMNet	Python	56	Official Implementation of "A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining""	Aug 04, 2022
CMDL	Jupyter Notebook	3	Cross-Modal Data Discovery over Structured and Unstructured Data Lakes	Jun 06, 2023
pvse	Python	122	Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)	Nov 11, 2022
HSCH-TCSVT	MATLAB	2	A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval	Jul 17, 2023
audio_sheet_retrieval	Jupyter Notebook	23	Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification	Feb 02, 2022
ResViT	Python	70	Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis	May 06, 2023
protoclip	Python	19	Official pytorch implementation of ProtoCLIP in paper Prototypical Contrastive Language Image Pretraining	Jul 27, 2022
ACMR	Python	18	reproduce the results of Adversarial Cross-Modal retrieval (ACMR)	Dec 09, 2021
DSCMR	Python	122	Deep Supervised Cross-modal Retrieval (CVPR 2019, PyTorch Code)	Mar 28, 2023
rootle	TypeScript	2	Cross-language structured log library	Jan 09, 2023
VisGel	Python	56	[CVPR 2019] Connecting Touch and Vision via Cross-Modal Prediction	Jul 22, 2022
Comprehensive-Distance-Preserving-Autoencoders-for-Cross-Modal-Retrieval	Python	5	The code of Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval	Apr 07, 2021
multimodal_vtt	Python	66	Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval	Jul 30, 2022
V2L	None	26	[CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models …	Apr 18, 2023
VALOR	Python	85	Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset	May 30, 2023
LCMCG-PyTorch	Jupyter Notebook	48	AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"	Jul 05, 2022
Cross-Modal-Retrieval	Python	82	Cross-Modal Retrieval, triplet loss, Pytorch, Resnet18, Bert, Deep Hashing	Sep 06, 2022
TextVR	Python	2	A large Cross-Modal Video Retrieval Dataset with Reading Comprehension	May 08, 2023
CMT	Python	144	Official implementation of paper "Cross Modal Transformer: Towards Fast and Robust 3D Object Detection"	May 02, 2023
ISVN	Python	3	Deep Semisupervised Cross-modal Retrieval/Cross-view Recognition (IEEE TCYB 2022, PyTorch Code)	Nov 19, 2022
ACME	Python	24	Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images	Jan 09, 2023
MvLDAN	Python	13	Multi-view Linear Discriminant Analysis Network for Cross-modal Retrieval and Cross-view Recognition (Keras&Theano Code)	Mar 07, 2023
OAN	Python	3	ICMR2019- Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attentiom Networks	Oct 20, 2021