|
TeX |
7 |
Stefanowitsch: Corpus linguistics |
Oct 19, 2022 |
|
Rust |
4 |
Rust library for text feminization using open corpus linguistics data |
Nov 23, 2021 |
|
Python |
2 |
Python tool for corpus linguistics |
Apr 05, 2018 |
|
C |
5 |
Alignment functions for corpus linguistics |
Nov 02, 2022 |
|
TeX |
4 |
Metadata header format for computational and corpus linguistics |
Jul 24, 2017 |
|
R |
3 |
Corpus Linguistics slides, labs, assignments and data |
Nov 25, 2022 |
|
Python |
96 |
Japanese text8 corpus for word embedding. |
Jul 17, 2022 |
|
Python |
2 |
(unofficial) niconico japanese news corpus |
Oct 03, 2019 |
|
None |
2 |
Small Japanese-English Subtitle Corpus |
Feb 26, 2023 |
|
Python |
2 |
Japanese Livedoor news corpus for huggingface datasets |
Feb 11, 2024 |
|
Python |
70 |
Laboro BERT Japanese: Japanese BERT Pre-Trained With Web-Corpus |
Aug 25, 2022 |
|
Python |
2 |
Example codes for Japanese Realistic Textual Entailment Corpus |
Dec 27, 2021 |
|
None |
58 |
Japanese IOB2 tagged corpus for Named Entity Recognition. |
Jul 25, 2022 |
|
Python |
198 |
Tutorial to train fastText with Japanese corpus |
Jul 10, 2022 |
|
Python |
2 |
(A reboot of) a Pinax-based platform for crowd-sourced collaborative corpus linguistics |
Feb 17, 2019 |
|
GDScript |
5 |
A set of GUI tools for hobbyist audio series production |
Jun 30, 2021 |
|
Java |
2 |
twitter-corpus-tools |
Jan 28, 2023 |
|
Python |
71 |
A large parallel corpus of English and Japanese |
May 07, 2023 |
|
None |
112 |
lists of text corpus and more (mainly Japanese) |
Apr 19, 2023 |
|
Jupyter Notebook |
11 |
Natural language processing on Buddhist texts in Pāli and Sanskrit, mostly corpus linguistics. |
Apr 21, 2022 |
|
Python |
66 |
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020) |
Jul 16, 2022 |
|
Rust |
57 |
Rust programming, in Japanese |
Apr 09, 2023 |
|
Python |
36 |
The corpus of Japanese spam messages of invitation Mama Katu. |
Aug 06, 2022 |
|
Python |
3 |
For evaluating speech recognition system using the Corpus of Spontaneous Japanese (CSJ) |
Jul 22, 2022 |
|
Python |
10 |
Scripts for creating a Japanese-English parallel corpus and training NMT models |
May 23, 2023 |
|
Python |
4 |
Scraper, Twitter streaming API collectors and nltk scripts, used in IEM2201D Corpus Linguistics Research Project |
Dec 22, 2014 |
|
None |
3 |
Corpus for speech-to-text translation in Japanese-English based on CoVoST 2 |
Feb 02, 2023 |
|
Nix |
9 |
Declarative infrastructure for my hobbyist datacenter. |
Aug 14, 2022 |
|
C++ |
22 |
mirakc-tools for Japanese TV broadcast contents |
Jan 28, 2023 |
|
CoffeeScript |
14 |
Tools for Japanese romanization, verb deinflection, etc. |
Jan 17, 2023 |
|
HTML |
12 |
Multilingual parallel corpus,and tools for preprocessing text |
Jul 26, 2022 |
|
None |
11 |
The Japanese translation for "The Rust Performance Book" |
Oct 08, 2022 |
|
Rust |
3 |
lightweight japanese ime written in rust |
Jul 27, 2022 |
|
Rust |
69 |
Japanese Morphological Analysis written in Rust |
Apr 02, 2023 |
|
Rust |
4 |
tools for rust |
Jul 12, 2023 |
|
Rust |
275 |
A hobbyist microkernel written in Rust, featuring a capability-based system similar to seL4. |
May 05, 2023 |
|
Python |
23 |
tools for creating computer-generated, corpus-driven graded readers |
Jan 28, 2023 |
|
Shell |
15 |
Japanese translation of rust-lang-nursery/nomicon |
Feb 04, 2022 |
|
Rust |
2 |
A Japanese Sentence Tokenizer written in Rust. |
Jun 28, 2022 |
|
Python |
7 |
some basic scripts for linguistics |
Nov 28, 2022 |
|
Pascal |
5 |
A set of tools for parsing and studying Japanese |
Jan 25, 2022 |
|
C |
163 |
liballoc - a memory allocator for hobbyist operating systems |
Apr 20, 2023 |
|
C |
4 |
Libunix - a unix abstration for hobbyist operating systmes |
Sep 16, 2022 |
|
R |
2 |
Supplementary materials for "Corpus linguistic and experimental studies on the meaning-preserving hypothesis in Indonesian voice … |
Feb 10, 2022 |
|
Rust |
2 |
Performance Tools for Rust |
Jun 16, 2022 |
|
None |
2 |
SMTP tools for Rust. |
Jan 31, 2020 |
|
Rust |
17 |
A quiz/training app for Japanese learners. Written in Rust. |
Mar 22, 2023 |
|
Python |
2 |
Tools to manage and convert GiellaLT corpus files |
Nov 09, 2022 |
|
Scala |
3 |
The Percy Bysshe Shelley Manuscript Corpus and Tools |
Sep 24, 2017 |
|
Rust |
2 |
A small rust library to conjugate Japanese words |
Jul 21, 2022 |