Stars
30
Forks
4
Language
Jupyter Notebook
Last Updated
Jan 23, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 1687 | Large-scale pretraining for dialogue | Aug 07, 2022 | |
Python | 3 | E2S2: Encoding-enhanced sequence-to-sequence pretraining for language understanding and generation | Apr 12, 2023 | |
Python | 3447 | A large-scale 7B pretraining language model developed by BaiChuan-Inc. | Jun 19, 2023 | |
Python | 4 | XLNet: Generalized Autoregressive Pretraining for Language Understanding | Nov 29, 2020 | |
Python | 3 | XLNet: Generalized Autoregressive Pretraining for Language Understanding | Sep 26, 2019 | |
Python | 25 | XLNet: Generalized Autoregressive Pretraining for Language Understanding | Jul 27, 2022 | |
Python | 5932 | XLNet: Generalized Autoregressive Pretraining for Language Understanding | Oct 08, 2022 | |
None | 9 | xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval | May 22, 2023 | |
Python | 7 | Large Language Model Text Generation Inference | May 02, 2023 | |
Python | 2 | Large Language Model Text Generation Inference | Sep 18, 2023 | |
Shell | 14 | 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation) | Feb 26, 2023 | |
JavaScript | 5 | [DEPRECATED] PostCSS plugin for writing Italian Stylesheets 🇮🇹 | Apr 26, 2022 | |
None | 6 | A large scale dataset for Image Captioning in Italian | Nov 02, 2022 | |
None | 11 | A large scale dataset for Video Captioning in Italian | May 16, 2023 | |
None | 21 | A large scale dataset for Question Answering in Italian | Sep 15, 2022 | |
Jupyter Notebook | 8 | Masked Auto-Encoding for Large Scale Pretraining of Video Data | Feb 17, 2023 | |
None | 103 | Understanding large language models | Apr 22, 2023 | |
Python | 41 | Code for text augmentation method leveraging large-scale language models | Aug 17, 2022 | |
Python | 84 | Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379 | Jul 07, 2022 | |
Jupyter Notebook | 2 | Natural Language Understanding (Text Summarization) task for autonomous email title generation. | Aug 13, 2020 | |
Lua | 872 | Task generation for testing text understanding and reasoning | Apr 10, 2023 | |
Python | 112 | The human face subset of LAION-400M for large-scale face pretraining. | May 16, 2023 | |
Python | 7 | Large-Scale Scene Understanding Challenge at CVPR 2017 | Sep 07, 2022 | |
Python | 13 | Hephaestus: A large scale multitask dataset towards InSAR understanding | Jun 16, 2022 | |
Python | 9 | All-in-one repository for Fine-tuning & Pretraining (Large) Language Models | Apr 11, 2023 | |
Python | 68 | Evaluation suite for large-scale language models. | Jul 07, 2022 | |
None | 13 | [FG 2021🎈] A small-scale face image dataset with large-scale facial attributes for text-to-face generation and … | Aug 10, 2022 | |
Python | 231 | A Large Scale Text Summarization Dataset | Sep 15, 2022 | |
C++ | 9 | distill large scale web page text | Jul 14, 2023 | |
JavaScript | 16 | Code and Dataset for Memeify: A Large-scale Meme Generation System | Apr 29, 2023 | |
None | 4 | Material for AI Workshop on Natural Language Understanding and Generation | Aug 11, 2022 | |
R | 40 | Materials for RCC workshop, "Large-scale data analysis in R." | Jul 19, 2022 | |
Python | 5 | 🇮🇹 Emoji country flags for language codes and LCID's | Dec 04, 2022 | |
Python | 107 | Finetuning large language models for GDScript generation. | Apr 24, 2023 | |
Python | 1790 | A large-scale face dataset for face parsing, recognition, generation and editing. | Apr 25, 2023 | |
Python | 141 | a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and Multi-task Learning Framework. | Jul 30, 2022 | |
Python | 2 | Large scale web corpus of Austronesian text. | Jan 17, 2022 | |
Python | 87 | [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining | Aug 02, 2022 | |
Python | 8 | Generating Word2Vec (Text embeddings) - for large scale heterogenous networks | Sep 22, 2021 | |
Python | 140 | [CVPR 2021] A large-scale face image dataset that allows text-to-image generation, text-guided image manipulation, sketch-to-image … | Mar 31, 2023 | |
Python | 14166 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Apr 24, 2023 | |
None | 3 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Jul 20, 2023 | |
Python | 2 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Jul 08, 2023 | |
Python | 2 | MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models | Dec 12, 2023 | |
Python | 5050 | Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, … | Aug 08, 2022 | |
Python | 889 | COYO-700M: Large-scale Image-Text Pair Dataset | Apr 23, 2023 | |
None | 2 | A Large-scale Vietnamese News Text Classification Corpus | Dec 12, 2023 | |
None | 6 | Code for paper "PLoG: Table-to-Logic Pretraining for Logical Table-to-Text Generation" | Oct 30, 2022 | |
None | 54 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | Mar 21, 2023 | |
None | 3 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | May 07, 2023 |