Stars
27
Forks
3
Language
Python
Last Updated
Dec 29, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 66 | Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval | Jul 30, 2022 | |
Python | 22 | Generative Retrieval Transformer | May 22, 2022 | |
Python | 7 | The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'. | Apr 27, 2023 | |
Python | 532 | [ACL'19] [PyTorch] Multimodal Transformer | Aug 16, 2022 | |
None | 3 | Multimodal Video Description | Jan 03, 2020 | |
Python | 13 | [Reproduce] Code for the ACL2019 paper "Multimodal Transformer for Unaligned Multimodal Language Sequences". | Jul 18, 2022 | |
Python | 26 | Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code) | Nov 19, 2022 | |
Python | 3 | The code for our paper Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment Analysis | Apr 17, 2023 | |
Python | 2 | Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis (TAC 2023) | May 16, 2023 | |
Python | 54 | Implementation of Zorro, Masked Multimodal Transformer, in Pytorch | Mar 19, 2023 | |
Python | 208 | METER: A Multimodal End-to-end TransformER Framework | Aug 28, 2022 | |
HTML | 21 | [CVPRW22] Official Implementation of T-Food: "Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval". Accepted … | May 18, 2023 | |
Python | 18 | Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval | May 19, 2023 | |
JavaScript | 2 | sites for my multidomain drydrop | Aug 13, 2019 | |
Python | 49 | [MICCAI 2022] The official code for "mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of … | May 21, 2023 | |
Python | 14 | Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text | Mar 13, 2023 | |
Python | 32 | Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code) | Apr 02, 2023 | |
None | 2 | Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code) | Feb 06, 2023 | |
Jupyter Notebook | 20 | Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification. | Aug 11, 2022 | |
Python | 88 | Video Swin Transformer - PyTorch | Aug 22, 2022 | |
Python | 10 | Video Graph Transformer for Video Question Answering (ECCV'22) | Sep 06, 2022 | |
Python | 16 | Benchmark API for Multidomain Language Modeling | Feb 21, 2023 | |
Python | 2 | Decoupled Multimodal Transformers (DMFormer) for Referring Video Object Segmentation | Jun 26, 2023 | |
None | 27 | Official repository for R2Former: Unified Retrieval and Reranking Transformer for Place Recognition | Jul 17, 2023 | |
Python | 2 | VIsual Information Retrieval in Video Archives | Dec 16, 2021 | |
Python | 42 | [arXiv22] Disentangled Representation Learning for Text-Video Retrieval | Sep 07, 2022 | |
Python | 285 | Video embeddings for retrieval with natural language queries | Apr 10, 2023 | |
None | 34 | [arXiv] Cross-Modal Adapter for Text-Video Retrieval | Apr 29, 2023 | |
Jupyter Notebook | 56 | Use CLIP to represent video for Retrieval Task | Mar 19, 2023 | |
None | 10 | Deep Learning for Video Retrieval by Natural Language | Nov 20, 2021 | |
C++ | 17 | [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Feb 17, 2023 | |
Python | 100 | [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based … | Apr 27, 2023 | |
PHP | 56 | A multidomain URL rewrite engine for REDAXO | Mar 22, 2023 | |
PHP | 26 | Glossar, auch für Multidomain Sites | May 18, 2022 | |
Python | 4 | Learning multimodal word embeddings from youtube video data | Sep 15, 2019 | |
Python | 2 | Code for ECIR 2023 paper "Dialogue-to-Video Retrieval" | Jul 07, 2023 | |
Jupyter Notebook | 186 | [ECCV 2022] Flow-Guided Transformer for Video Inpainting | Apr 29, 2023 | |
Jupyter Notebook | 49 | Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021 | Aug 09, 2022 | |
Python | 42 | Video Frame Interpolation with Transformer (CVPR2022) | Aug 12, 2022 | |
Python | 8 | Unsupervised Video Object Segmentation using Transformer | May 04, 2022 | |
None | 3 | BioReader: a Retrieval-Enhanced Text-to-Text Transformer for Biomedical Literature [EMNLP 2022] | Jan 02, 2024 | |
C# | 3 | Application for administer multidomain Active Directory-based networks | May 26, 2022 | |
PHP | 2 | Use Livewire in a multidomain environment | Sep 22, 2022 | |
Jupyter Notebook | 36 | [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning | Apr 25, 2023 | |
Python | 2 | Multimodal (Video, Audio, Voice) human emotion & engagement detection app | Jul 24, 2021 | |
Python | 27 | Source code of ECCV2022 LAFF for Text-to-Video Retrieval | Apr 03, 2023 | |
Python | 6 | SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries | Sep 13, 2022 | |
Python | 6 | Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval ECCV22 | Oct 15, 2023 | |
Python | 23 | A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning | Apr 20, 2023 | |
Java | 2 | near duplicate video retrieval and detection with Hive | Jan 03, 2020 |