VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Stars

214

Forks

15

Language

Python

Last Updated

Feb 29, 2024

Similar Repos

Repo	Language	Stars	Description	Updated At
VAST	None	2	Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset	May 30, 2023
VL-CheckList	Python	80	Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.	Apr 16, 2023
CLAP	Python	12	Contrastive Language-Audio Pretraining	May 06, 2022
CLAP	Python	25	Contrastive Language-Audio Pretraining	Jun 14, 2022
CLAP	Python	2	Contrastive Language-Audio Pretraining	Apr 10, 2023
awesome-vision-language-pretraining	None	14	Awesome Vision-Language Pretraining Papers	Apr 23, 2023
Awesome-VLP-and-Efficient-Transformer	None	11	Vision-Language Pretraining & Efficient Transformer Papers.	Jun 12, 2022
FLM	None	9	Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)	Apr 06, 2023
BriVL	Python	180	Bridging Vision and Language Model	Jul 06, 2022
audio-dataset	Python	9	Audio Dataset for training CLAP and other models	Jun 19, 2022
AdverseDrive	Jupyter Notebook	22	Attacking Vision based Perception in End-to-end Autonomous Driving Models	May 15, 2022
XLM	Python	2652	PyTorch original implementation of Cross-lingual Language Model Pretraining.	Aug 09, 2022
XLM	Python	2	PyTorch original implementation of Cross-lingual Language Model Pretraining.	Feb 19, 2020
TinyViT-model-zoo	None	2	The model zoo of TinyViT: Fast Pretraining Distillation for Small Vision Transformers	Jul 22, 2022
VLPCook	Jupyter Notebook	2	Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval	Apr 24, 2023
slovo	Python	13	Slovo: Russian Sign Language Dataset and Models	May 27, 2023
slovo	Python	2	Slovo: Russian Sign Language Dataset and Models	Jul 23, 2023
pretraining-with-human-feedback	Python	105	Code accompanying the paper Pretraining Language Models with Human Preferences	May 17, 2023
Pretraining_T5_custom_dataset	Python	15	Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints	Sep 16, 2022
flamingo-mini	Python	108	Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready …	May 04, 2023
ViCHA	Jupyter Notebook	32	[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"	Apr 20, 2023
FLM	Python	9	All-in-one repository for Fine-tuning & Pretraining (Large) Language Models	Apr 11, 2023
COCO-LM	Python	87	[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining	Aug 02, 2022
BioBART	Python	24	BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]	Oct 09, 2022
olm-datasets	Python	137	Pipeline for pulling and processing online language model pretraining data from the web	Apr 23, 2023
baichuan-7B	Python	3447	A large-scale 7B pretraining language model developed by BaiChuan-Inc.	Jun 19, 2023
Deepfake_Model_Attribution	Python	11	Codes and Dataset (DFDM) for Face-swap Deepfakes Model Attribution	May 24, 2023
wip-lambada-lm	Python	9	LSTM language model on LAMBADA dataset	Jul 26, 2022
awesome-vision-language-pretraining-papers	None	904	Recent Advances in Vision and Language PreTrained Models (VL-PTMs)	Sep 09, 2022
awesome-vision-language-pretraining-papers	None	2	Recent Advances in Vision and Language PreTrained Models (VL-PTMs)	Jul 25, 2022
language-and-perception	TeX	2	Situated Language and Perception Research Group	Apr 27, 2024
nlxgpt	Python	31	NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)	Apr 26, 2023
vlm_lexical_grounding	Python	11	PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"	Apr 20, 2023
meta_XLM	Jupyter Notebook	15	Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks	Jun 06, 2022
deepstruct	Python	55	Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction"	May 08, 2023
coco-lm-pytorch	Python	44	Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch	Jul 28, 2022
ONE-PEACE	Python	32	A general representation modal across vision, audio, language modalities.	May 19, 2023
up-to-date-Vision-Language-Models	None	11	Up-to-date Vision Language Models collection. Mainly focus on computer vision	Apr 25, 2023
MiniGPT-4	Python	14166	MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models	Apr 24, 2023
MiniGPT-4	None	3	MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models	Jul 20, 2023
MiniGPT-4	Python	2	MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models	Jul 08, 2023
MiniGPT-4	Python	2	MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models	Dec 12, 2023
mocop	Python	2	Code and models for Molecule-Morphology Contrastive Pretraining (MoCoP)	May 29, 2023
lmd	Python	6	Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models	Apr 19, 2023
Awesome-Foundation-Models	None	4	A curated list of foundation models for vision and language tasks	Apr 09, 2023
CLIP4MC	Python	6	An RL-Friendly Vision-Language Model for Minecraft	Apr 21, 2023
AutoDRIVE-Nigel-Dataset	None	2	Dynamics and Perception Dataset of AutoDRIVE Ecosystem's "Nigel" Vehicle	Jun 07, 2023
train-bert-from-scratch-on-sagemaker	Jupyter Notebook	6	Pretraining a large language model from scratch with your own custom domain data and Amazon …	Mar 06, 2023
TaskRes	Python	22	Task Residual for Tuning Vision-Language Models (CVPR 2023)	Apr 17, 2023
deep-learning-models	Python	134	Natural language processing & computer vision models optimized for AWS	Jul 06, 2022