|
None |
2 |
Cross-language English-Arabic corpus derived from WikiQA |
May 29, 2023 |
|
Jupyter Notebook |
2 |
The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations … |
Aug 03, 2022 |
|
None |
2 |
Reference Corpus of Novels for Sentiment Analysis |
Mar 21, 2024 |
|
Jupyter Notebook |
5 |
The IIT Bombay English-Hindi Parallel Corpus |
Mar 18, 2022 |
|
None |
2 |
Small Japanese-English Subtitle Corpus |
Feb 26, 2023 |
|
Python |
14 |
Searching in-memory corpus with Corpus Query Language (CQL) |
Feb 15, 2023 |
|
Python |
14 |
The New York Times English-Chinese parallel corpus |
May 02, 2023 |
|
None |
3 |
corpus of English and Frech collocations |
May 14, 2022 |
|
Shell |
8 |
English conversation corpus for conversational TTS. |
Apr 02, 2023 |
|
None |
35 |
Xlit-Crowd: Hindi-English Transliteration Corpus |
May 19, 2023 |
|
Python |
2 |
Train model based on Wikipedia English corpus with gensim package |
May 13, 2023 |
|
None |
3 |
Cherokee English Corpus material for NLP research |
Jan 12, 2022 |
|
JavaScript |
3 |
translated italian-language corpus |
Jul 14, 2022 |
|
Python |
6 |
A Cantonese-English parallel corpus extracted from words.hk |
Apr 15, 2023 |
|
Python |
71 |
A large parallel corpus of English and Japanese |
May 07, 2023 |
|
HTML |
77 |
Text corpus calculation in Javascript. Supports Chinese, English. |
Apr 28, 2023 |
|
Python |
5 |
The experiment with applying NLP to correction of definite/indefinite articles in English text corpus |
Nov 22, 2020 |
|
JavaScript |
230 |
:closed_book: The largest English-language thesaurus |
Sep 10, 2022 |
|
None |
128 |
DrupalConsole English Language |
Oct 19, 2022 |
|
None |
3 |
A text corpus collection for the DroppedText language. |
Aug 28, 2022 |
|
Python |
8 |
The unified corpus building environment for Language Models. |
Jan 13, 2022 |
|
Jupyter Notebook |
4 |
Corpus reader extension for the Classical Language Toolkit |
Apr 09, 2023 |
|
JavaScript |
3 |
An educational language with English-like syntax |
Sep 24, 2018 |
|
Python |
163 |
chinese and english corpus process script, python, c++, java |
Apr 23, 2023 |
|
None |
15 |
English Lemma Database - Compiled by Referencing British National Corpus |
Aug 17, 2022 |
|
Perl |
2 |
Texts from Corpus of Middle English Prose and Verse |
Feb 15, 2022 |
|
Python |
4 |
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English |
Dec 02, 2021 |
|
Python |
3 |
KB RiR project to Collect a corpus of Dutch novels 1800-2000 and Investigate Canonicity |
Apr 17, 2022 |
|
HTML |
2 |
An English language translation of the Niddesa |
Feb 08, 2023 |
|
Python |
15 |
Co-reference resolution for the English language. |
Jun 15, 2022 |
|
None |
2 |
A dependency treebank based on the Russian Error-Annotated English Learner Corpus (REALEC) |
Sep 04, 2021 |
|
None |
44 |
MultilingualShareGPT, the free multi-language corpus for LLM training |
Apr 12, 2023 |
|
Shell |
5 |
Text corpus the of Tlingit language for linguistic research. |
May 21, 2023 |
|
None |
2 |
A YouTube speech corpus to study Asian North American English. |
Feb 22, 2023 |
|
None |
7 |
Unraid English Language Repo |
Jan 08, 2022 |
|
JavaScript |
150 |
English (natural language) parser |
Sep 10, 2022 |
|
JavaScript |
7 |
English language writing assistant |
Jul 25, 2022 |
|
JavaScript |
10 |
English language blog website |
Feb 20, 2023 |
|
Makefile |
2 |
Taiwan English Language Setter |
May 26, 2023 |
|
JavaScript |
47 |
Language-annotated Abstraction and Reasoning Corpus |
Apr 24, 2023 |
|
Python |
6 |
Indonesian corpus for Natural Language Processing |
Dec 14, 2019 |
|
None |
20 |
Microsoft Speech Language Translation (MSLT) Corpus |
Apr 24, 2023 |
|
None |
2 |
Combining Reactive.jl with the Gtk toolkit |
Oct 29, 2021 |
|
Python |
2 |
Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE). |
Feb 13, 2023 |
|
C |
2 |
English language support for the mimic TTS system |
Feb 12, 2022 |
|
E |
2 |
English script samples for the angle programming language |
Mar 17, 2020 |
|
HTML |
2 |
An English guide to the YAYA programming language |
Jan 31, 2023 |
|
None |
6 |
Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text |
Apr 30, 2023 |
|
None |
3 |
A Hmong language corpus derived from the soc.culture.hmong Usenet group |
Jan 30, 2023 |
|
None |
2 |
The corpus and models for Burmese (Myanmar language) Sentence Tokenization |
Apr 30, 2023 |