Stars
54
Forks
5
Language
Perl
Last Updated
Apr 25, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 72 | Kyoto University Web Document Leads Corpus | Apr 10, 2023 | |
JavaScript | 5 | An annotation tool for the Kyoto University Corpus | Apr 11, 2023 | |
Jupyter Notebook | 2 | Thai text summarization in NLP System course at Chulalongkorn University using thaigov-corpus | Aug 09, 2021 | |
None | 9 | Tokyo Metropolitan University Paraphrase Corpus (TMUP) | Jan 07, 2023 | |
None | 14 | Arabic vocalized text corpus | Nov 30, 2022 | |
Python | 9 | Nanyang Technological University - Multilingual Corpus (STB subcorpora) | Mar 11, 2019 | |
PHP | 7 | Multilingual text corpus integrated with machine-readable dictionary (DICTionary + cORPUS). | Dec 20, 2022 | |
Python | 74 | Repository for the Georgetown University Multilayer Corpus (GUM) | Apr 02, 2023 | |
None | 16 | Public domain corpus of Catalan text | Feb 26, 2023 | |
C++ | 192 | The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc. | Jan 10, 2023 | |
C++ | 2 | The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc. | Nov 10, 2023 | |
Python | 2 | Large scale web corpus of Austronesian text. | Jan 17, 2022 | |
Python | 16 | Generate poetry based on text corpus input | Aug 02, 2022 | |
Python | 19 | Annotated corpus + evaluation metrics for text anonymisation | Jul 22, 2022 | |
HTML | 2 | Corpus of Raw text for Classical Hindi | Nov 22, 2020 | |
HTML | 2 | The NENA corpus in plain-text markup | Jan 21, 2022 | |
C++ | 3 | Kyoto Cabinet bindings for Node.JS | Jun 05, 2017 | |
None | 2 | Kyoto summer2autumn dataset official repository | Feb 22, 2024 | |
HTML | 12 | Multilingual parallel corpus,and tools for preprocessing text | Jul 26, 2022 | |
None | 3 | A text corpus collection for the DroppedText language. | Aug 28, 2022 | |
Python | 14 | Recognize bio-medical entities from a text corpus | Sep 01, 2022 | |
Python | 31 | Answer questions on a given corpus of text. | Jun 12, 2022 | |
None | 5 | Word2vec models trained on an estonian text corpus. | Feb 16, 2021 | |
HTML | 77 | Text corpus calculation in Javascript. Supports Chinese, English. | Apr 28, 2023 | |
None | 112 | lists of text corpus and more (mainly Japanese) | Apr 19, 2023 | |
JavaScript | 37 | A Serverless Text Annotation Tool for Corpus Development | May 05, 2023 | |
JavaScript | 11 | Text mining on the Royal Library newspaper corpus | May 24, 2022 | |
Python | 2 | early chinese text corpus sourced from kanseki repository | Dec 10, 2021 | |
Python | 2 | Electronic Text Corpus of Syntactically Annotated Neo-Sumerian | May 17, 2023 | |
None | 2 | A Large-scale Vietnamese News Text Classification Corpus | Dec 12, 2023 | |
Python | 9 | Kyoto Tycoon client library for Python. | Oct 22, 2021 | |
Clojure | 5 | Kyoto Cabinet client implementation for Clojure | Oct 24, 2014 | |
JavaScript | 2 | kyoto-tycoon session store for connect | Apr 16, 2014 | |
Go | 16 | Go bindings for Kyoto Cabinet library. | Feb 11, 2021 | |
Go | 36 | Go bindings for Kyoto Cabinet library. | Oct 31, 2021 | |
OCaml | 6 | OCaml bindings for kyoto cabinet DBM | Nov 13, 2019 | |
C++ | 3 | Use Kyoto Cabinet like python dictionary | Jun 05, 2021 | |
Go | 3 | Kyoto-U PandA API for Golang | Feb 19, 2022 | |
Python | 6 | Wikipedia text corpus for self-supervised NLP model training | Apr 26, 2022 | |
Jupyter Notebook | 3 | Exploratory Text Analytics project on corpus of political texts | Jun 13, 2022 | |
Shell | 5 | Text corpus the of Tlingit language for linguistic research. | May 21, 2023 | |
Python | 4 | RDFLib Store backed by Kyoto Cabinet (Python2.6+) | Dec 01, 2021 | |
C++ | 2 | MessagePack-RPC Server Plugin for Kyoto Tycoon | Aug 16, 2021 | |
GLSL | 3 | TouchDesigner WorksShop in MTRL Kyoto on 20.5.2018 | Jun 21, 2021 | |
R | 7 | Hult University Student Repository "Text Analytics" | Dec 12, 2021 | |
Rust | 4 | Rust library for text feminization using open corpus linguistics data | Nov 23, 2021 | |
Python | 75 | The Definition Extraction From Text corpus and relevant formatting scripts | Jun 02, 2022 | |
JavaScript | 3 | For presentation of audio/video/text corpus of Kofan texts. | Jan 28, 2023 | |
Jupyter Notebook | 211 | Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html | Apr 12, 2023 | |
Jupyter Notebook | 7 | Multi-way parallel text corpus of 5 key Ugandan languages. | Apr 19, 2023 |