Stars
6
Forks
2
Language
Shell
Last Updated
Jun 30, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
R | 2 | Corpus linguistic analysis for Corpus Workbench corpora | Feb 07, 2022 | |
None | 3 | A text corpus collection for the DroppedText language. | Aug 28, 2022 | |
HTML | 5 | linguistic data, text, and code relating to the Gothic language | Jun 20, 2022 | |
Python | 2 | Text pre-processing for downstream linguistic analyses | Oct 07, 2022 | |
Java | 2 | A text management tool for linguistic purposes... | Sep 07, 2018 | |
None | 3 | Cherokee English Corpus material for NLP research | Jan 12, 2022 | |
Python | 8 | The unified corpus building environment for Language Models. | Jan 13, 2022 | |
Jupyter Notebook | 4 | Corpus reader extension for the Classical Language Toolkit | Apr 09, 2023 | |
Python | 6 | Discover archetypes in your text corpus using Watson Natural Language Understanding. | May 15, 2021 | |
None | 14 | Arabic vocalized text corpus | Nov 30, 2022 | |
Perl | 44 | Kyoto University Text Corpus | Apr 26, 2023 | |
HTML | 2 | The NENA corpus in plain-text markup | Jan 21, 2022 | |
JavaScript | 3 | translated italian-language corpus | Jul 14, 2022 | |
Python | 6 | Indonesian corpus for Natural Language Processing | Dec 14, 2019 | |
None | 44 | MultilingualShareGPT, the free multi-language corpus for LLM training | Apr 12, 2023 | |
Jupyter Notebook | 5 | A corpus of research software used in COVID-19 research. | Dec 23, 2021 | |
Python | 5 | Functions for extracting commonly used linguistic features from text. | Aug 20, 2022 | |
Python | 28 | Language data store and linguistic query API | Mar 20, 2023 | |
Python | 14 | Searching in-memory corpus with Corpus Query Language (CQL) | Feb 15, 2023 | |
TypeScript | 3 | Linguistic utilities for Interslavic language: https://medzuslovjansky.github.io/js-utils/ | Nov 14, 2023 | |
Jupyter Notebook | 2 | Probing language models for linguistic features in their representations | Oct 05, 2023 | |
Python | 20 | Cookpad Parsed Corpus: a dataset of linguistically annotated recipes (Linguistic Annotation Workshop 2020) | Aug 09, 2022 | |
JavaScript | 11 | Text mining on the Royal Library newspaper corpus | May 24, 2022 | |
Python | 74 | Corpus of auto-labeled text for the cyber security domain | May 14, 2023 | |
None | 2 | The corpus and models for Burmese (Myanmar language) Sentence Tokenization | Apr 30, 2023 | |
Python | 19 | Annotated corpus + evaluation metrics for text anonymisation | Jul 22, 2022 | |
HTML | 2 | Corpus of Raw text for Classical Hindi | Nov 22, 2020 | |
Jupyter Notebook | 5 | Language Model for Historic Dutch (Delpher Corpus) | May 31, 2022 | |
None | 7 | Myanmar Sign Language Corpus for Emergency Domain | Aug 26, 2022 | |
PHP | 7 | Multilingual text corpus integrated with machine-readable dictionary (DICTionary + cORPUS). | Dec 20, 2022 | |
Python | 2 | A research prototype for the Dyna language | Jan 19, 2024 | |
Python | 15 | A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air … | May 07, 2023 | |
Jupyter Notebook | 7 | A corpus of English-language novels combining the ~250 novels of the Corpus of English Novels … | Dec 30, 2022 | |
Python | 8 | Building and Using A Seed Corpus for the Human Language Project | Apr 30, 2019 | |
None | 2 | A corpus of supersense-annotated adpositions and case markers in German natural-language text. | Jun 06, 2022 | |
HTML | 12 | Multilingual parallel corpus,and tools for preprocessing text | Jul 26, 2022 | |
JavaScript | 37 | A Serverless Text Annotation Tool for Corpus Development | May 05, 2023 | |
None | 16 | Public domain corpus of Catalan text | Feb 26, 2023 | |
Shell | 2 | Language models baseline Kaldi script for TORGO corpus | Apr 21, 2022 | |
JavaScript | 47 | Language-annotated Abstraction and Reasoning Corpus | Apr 24, 2023 | |
None | 20 | Microsoft Speech Language Translation (MSLT) Corpus | Apr 24, 2023 | |
R | 7 | Text Mining for Psychological Research | Aug 03, 2022 | |
Jupyter Notebook | 38 | Text Summarization for Research Papers | Sep 17, 2022 | |
HTML | 43 | R-package for text mining with the Corpus Workbench (CWB) as backend | May 11, 2023 | |
JavaScript | 9 | 👩🔬 A web-based, open-access platform for linguistic research on old indic texts | Apr 11, 2023 | |
Java | 4 | Code for the GGPOnc corpus - A Corpus of German Medical Text with Rich Metadata … | Sep 30, 2022 | |
Python | 75 | The Definition Extraction From Text corpus and relevant formatting scripts | Jun 02, 2022 | |
Python | 197 | UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language | Jul 21, 2022 | |
R | 2 | Supplementary materials for "Corpus linguistic and experimental studies on the meaning-preserving hypothesis in Indonesian voice … | Feb 10, 2022 | |
CWeb | 5 | Language research sketchbook | Apr 22, 2023 |