Stars
34
Forks
4
Language
None
Last Updated
Jun 19, 2022
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 1060 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Oct 16, 2022 | |
Python | 3 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Apr 20, 2021 | |
None | 2 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Nov 05, 2021 | |
None | 2 | A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models | Nov 15, 2023 | |
Python | 2 | Large scale web corpus of Austronesian text. | Jan 17, 2022 | |
None | 673 | Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料 | Apr 28, 2023 | |
None | 2 | A Large-scale Vietnamese News Text Classification Corpus | Dec 12, 2023 | |
Jupyter Notebook | 2 | Mini-Luotuo: A Diverse Herd of Distilled Chinese Models from Large-Scale Instructions | May 20, 2023 | |
Julia | 86 | Large scale Gaussian Mixture Models | May 21, 2023 | |
None | 5 | Chinese Mandarin Ngrams Counts from large-scale corpora | Aug 29, 2022 | |
None | 12 | A Large-Scale Chinese Legal Case Retrieval Dataset | May 10, 2023 | |
None | 7202 | 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | Aug 12, 2022 | |
None | 18 | 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | Jul 28, 2022 | |
Python | 53 | T2Ranking: A large-scale Chinese benchmark for passage ranking. | May 20, 2023 | |
Python | 68 | Evaluation suite for large-scale language models. | Jul 07, 2022 | |
Python | 1903 | GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想) | Aug 13, 2022 | |
Python | 15 | GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想) | Mar 21, 2023 | |
Python | 2 | GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想) | Nov 22, 2022 | |
Python | 446 | GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors | Aug 31, 2022 | |
Python | 24 | Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling) | Jun 21, 2022 | |
Python | 26 | ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion | Jun 20, 2022 | |
Python | 433 | A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset | Oct 15, 2022 | |
Python | 334 | Large-scale, Informative, and Diverse Multi-round Chat Data (and Models) | Apr 24, 2023 | |
Python | 188 | EVA: Large-scale Pre-trained Chit-Chat Models | Oct 16, 2022 | |
Python | 156 | Exploring Visual Prompts for Adapting Large-Scale Models | Apr 29, 2023 | |
Python | 161 | Large-scale pretrained models for goal-directed dialog | Aug 08, 2022 | |
Python | 364 | A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation | Aug 02, 2022 | |
None | 54 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | Mar 21, 2023 | |
None | 3 | A large-scale complex question answering evaluation of ChatGPT and similar large-language models | May 07, 2023 | |
Python | 250 | CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed) | Jul 28, 2022 | |
Python | 4 | A large labeled corpus for Application Privacy Policy in Chinese to train named entity recognition … | May 18, 2023 | |
Python | 755 | A Python library for creating and simulating large-scale brain models | May 14, 2023 | |
Python | 9 | FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale … | Apr 24, 2023 | |
Python | 1243 | FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale … | May 10, 2023 | |
Python | 1633 | Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard | Sep 12, 2022 | |
Python | 2 | Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard | Feb 26, 2023 | |
Scala | 4 | Training Large Scale Statistical Machine Translation Models on Spark | Aug 18, 2019 | |
Python | 3 | Training Large-scale Text Embedding Models with 🤗 Transformers | Aug 26, 2023 | |
Python | 52 | Script to interact with the DialoGPT models. | Mar 11, 2023 | |
None | 3 | Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks | Feb 19, 2021 | |
Python | 13 | Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset | May 08, 2023 | |
Python | 53 | Yet Another Chinese Learner Corpus | Apr 13, 2023 | |
Python | 39 | Corpus creator for Chinese Wikipedia | Apr 25, 2022 | |
Python | 715 | Collections of Chinese NLP corpus | Oct 10, 2022 | |
None | 5 | Collections of Chinese NLP corpus | Sep 05, 2022 | |
Python | 96 | Traditional Machine Learning Models for Large-Scale Datasets in PyTorch. | Apr 05, 2023 | |
C | 2 | EnKF code for DA with large-scale layered geophysical models. | Jan 06, 2019 | |
Python | 41 | Code for text augmentation method leveraging large-scale language models | Aug 17, 2022 | |
C | 32 | EnKF code for DA with large-scale layered geophysical models. | May 02, 2023 | |
Python | 99 | PIE: A Large-Scale Dataset and Models for Pedestrian Intention Estimation and Trajectory Prediction | Nov 18, 2022 |