Stars
214
Forks
15
Language
Python
Last Updated
Feb 29, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
None | 2 | Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset | May 30, 2023 | |
Python | 80 | Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. | Apr 16, 2023 | |
Python | 12 | Contrastive Language-Audio Pretraining | May 06, 2022 | |
Python | 25 | Contrastive Language-Audio Pretraining | Jun 14, 2022 | |
Python | 2 | Contrastive Language-Audio Pretraining | Apr 10, 2023 | |
None | 14 | Awesome Vision-Language Pretraining Papers | Apr 23, 2023 | |
None | 11 | Vision-Language Pretraining & Efficient Transformer Papers. | Jun 12, 2022 | |
None | 9 | Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023) | Apr 06, 2023 | |
Python | 180 | Bridging Vision and Language Model | Jul 06, 2022 | |
Python | 9 | Audio Dataset for training CLAP and other models | Jun 19, 2022 | |
Jupyter Notebook | 22 | Attacking Vision based Perception in End-to-end Autonomous Driving Models | May 15, 2022 | |
Python | 2652 | PyTorch original implementation of Cross-lingual Language Model Pretraining. | Aug 09, 2022 | |
Python | 2 | PyTorch original implementation of Cross-lingual Language Model Pretraining. | Feb 19, 2020 | |
None | 2 | The model zoo of TinyViT: Fast Pretraining Distillation for Small Vision Transformers | Jul 22, 2022 | |
Jupyter Notebook | 2 | Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval | Apr 24, 2023 | |
Python | 13 | Slovo: Russian Sign Language Dataset and Models | May 27, 2023 | |
Python | 2 | Slovo: Russian Sign Language Dataset and Models | Jul 23, 2023 | |
Python | 105 | Code accompanying the paper Pretraining Language Models with Human Preferences | May 17, 2023 | |
Python | 15 | Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints | Sep 16, 2022 | |
Python | 108 | Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready … | May 04, 2023 | |
Jupyter Notebook | 32 | [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment" | Apr 20, 2023 | |
Python | 9 | All-in-one repository for Fine-tuning & Pretraining (Large) Language Models | Apr 11, 2023 | |
Python | 87 | [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining | Aug 02, 2022 | |
Python | 24 | BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022] | Oct 09, 2022 | |
Python | 137 | Pipeline for pulling and processing online language model pretraining data from the web | Apr 23, 2023 | |
Python | 3447 | A large-scale 7B pretraining language model developed by BaiChuan-Inc. | Jun 19, 2023 | |
Python | 11 | Codes and Dataset (DFDM) for Face-swap Deepfakes Model Attribution | May 24, 2023 | |
Python | 9 | LSTM language model on LAMBADA dataset | Jul 26, 2022 | |
None | 904 | Recent Advances in Vision and Language PreTrained Models (VL-PTMs) | Sep 09, 2022 | |
None | 2 | Recent Advances in Vision and Language PreTrained Models (VL-PTMs) | Jul 25, 2022 | |
TeX | 2 | Situated Language and Perception Research Group | Apr 27, 2024 | |
Python | 31 | NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral) | Apr 26, 2023 | |
Python | 11 | PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?" | Apr 20, 2023 | |
Jupyter Notebook | 15 | Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks | Jun 06, 2022 | |
Python | 55 | Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction" | May 08, 2023 | |
Python | 44 | Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch | Jul 28, 2022 | |
Python | 32 | A general representation modal across vision, audio, language modalities. | May 19, 2023 | |
None | 11 | Up-to-date Vision Language Models collection. Mainly focus on computer vision | Apr 25, 2023 | |
Python | 14166 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Apr 24, 2023 | |
None | 3 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Jul 20, 2023 | |
Python | 2 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Jul 08, 2023 | |
Python | 2 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Dec 12, 2023 | |
Python | 2 | Code and models for Molecule-Morphology Contrastive Pretraining (MoCoP) | May 29, 2023 | |
Python | 6 | Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models | Apr 19, 2023 | |
None | 4 | A curated list of foundation models for vision and language tasks | Apr 09, 2023 | |
Python | 6 | An RL-Friendly Vision-Language Model for Minecraft | Apr 21, 2023 | |
None | 2 | Dynamics and Perception Dataset of AutoDRIVE Ecosystem's "Nigel" Vehicle | Jun 07, 2023 | |
Jupyter Notebook | 6 | Pretraining a large language model from scratch with your own custom domain data and Amazon … | Mar 06, 2023 | |
Python | 22 | Task Residual for Tuning Vision-Language Models (CVPR 2023) | Apr 17, 2023 | |
Python | 134 | Natural language processing & computer vision models optimized for AWS | Jul 06, 2022 |