Stars
8
Forks
1
Language
Jupyter Notebook
Last Updated
Jan 15, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
HTML | 21 | [CVPRW22] Official Implementation of T-Food: "Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval". Accepted … | May 18, 2023 | |
MATLAB | 4 | Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval | May 23, 2022 | |
Python | 20 | A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval | Apr 11, 2022 | |
Jupyter Notebook | 32 | [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment" | Apr 20, 2023 | |
None | 14 | Awesome Vision-Language Pretraining Papers | Apr 23, 2023 | |
None | 2 | Official PyTorch implementation of CVPR 2023 MULA Workshop paper "Exposing and Mitigating Spurious Correlations for … | Apr 06, 2023 | |
Python | 2652 | PyTorch original implementation of Cross-lingual Language Model Pretraining. | Aug 09, 2022 | |
Python | 2 | PyTorch original implementation of Cross-lingual Language Model Pretraining. | Feb 19, 2020 | |
Python | 43 | Cross Modal Retrieval with Querybank Normalisation | Apr 10, 2023 | |
None | 11 | Vision-Language Pretraining & Efficient Transformer Papers. | Jun 12, 2022 | |
Jupyter Notebook | 44 | Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval" | Apr 22, 2023 | |
MATLAB | 5 | Label Embedding Online Hashing for Cross-Modal Retrieval | Jul 25, 2022 | |
None | 34 | [arXiv] Cross-Modal Adapter for Text-Video Retrieval | Apr 29, 2023 | |
MATLAB | 3 | The baselines of cross-modal hashing retrieval. | Jun 09, 2022 | |
Jupyter Notebook | 32 | ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》 | Oct 06, 2022 | |
Python | 8 | Food image / recipe (text) cross-modal representation learning, retrieval and (image) synthesis. Code from ACM-Multimedia … | Jul 11, 2022 | |
Python | 14 | Code for cross-modal image retrieval for SYSU-MM01 | May 05, 2023 | |
Python | 51 | ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration | Apr 13, 2023 | |
Python | 248 | deep learning, image retrieval, vision and language | Jul 20, 2022 | |
Python | 26 | Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code) | Nov 19, 2022 | |
Python | 63 | [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training. | Apr 19, 2023 | |
JavaScript | 25 | A demo of a cross-modal retrieval system | May 10, 2023 | |
None | 14 | VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022) | Feb 21, 2023 | |
Python | 5 | mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022) | May 08, 2023 | |
Python | 80 | Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. | Apr 16, 2023 | |
None | 9 | Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023) | Apr 06, 2023 | |
Python | 39 | Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation" | Sep 05, 2022 | |
Python | 56 | Official Implementation of "A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining"" | Aug 04, 2022 | |
Jupyter Notebook | 3 | Cross-Modal Data Discovery over Structured and Unstructured Data Lakes | Jun 06, 2023 | |
Python | 122 | Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019) | Nov 11, 2022 | |
MATLAB | 2 | A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval | Jul 17, 2023 | |
Jupyter Notebook | 23 | Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification | Feb 02, 2022 | |
Python | 70 | Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis | May 06, 2023 | |
Python | 19 | Official pytorch implementation of ProtoCLIP in paper Prototypical Contrastive Language Image Pretraining | Jul 27, 2022 | |
Python | 18 | reproduce the results of Adversarial Cross-Modal retrieval (ACMR) | Dec 09, 2021 | |
Python | 122 | Deep Supervised Cross-modal Retrieval (CVPR 2019, PyTorch Code) | Mar 28, 2023 | |
TypeScript | 2 | Cross-language structured log library | Jan 09, 2023 | |
Python | 56 | [CVPR 2019] Connecting Touch and Vision via Cross-Modal Prediction | Jul 22, 2022 | |
Python | 5 | The code of Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval | Apr 07, 2021 | |
Python | 66 | Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval | Jul 30, 2022 | |
None | 26 | [CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models … | Apr 18, 2023 | |
Python | 85 | Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset | May 30, 2023 | |
Jupyter Notebook | 48 | AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding" | Jul 05, 2022 | |
Python | 82 | Cross-Modal Retrieval, triplet loss, Pytorch, Resnet18, Bert, Deep Hashing | Sep 06, 2022 | |
Python | 2 | A large Cross-Modal Video Retrieval Dataset with Reading Comprehension | May 08, 2023 | |
Python | 144 | Official implementation of paper "Cross Modal Transformer: Towards Fast and Robust 3D Object Detection" | May 02, 2023 | |
Python | 3 | Deep Semisupervised Cross-modal Retrieval/Cross-view Recognition (IEEE TCYB 2022, PyTorch Code) | Nov 19, 2022 | |
Python | 24 | Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images | Jan 09, 2023 | |
Python | 13 | Multi-view Linear Discriminant Analysis Network for Cross-modal Retrieval and Cross-view Recognition (Keras&Theano Code) | Mar 07, 2023 | |
Python | 3 | ICMR2019- Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attentiom Networks | Oct 20, 2021 |