|
Python |
67 |
Open morphology for Finnish |
Aug 02, 2022 |
|
Jupyter Notebook |
4 |
Curated collection of open source Irish language NLP datasets |
May 07, 2023 |
|
None |
2 |
Igbo Datasets for NLP |
May 23, 2022 |
|
Python |
30 |
NLP Datasets for Indonesian |
Aug 09, 2022 |
|
None |
24 |
datasets for NLP research |
Mar 30, 2022 |
|
Python |
3 |
Readers for NLP Datasets |
Dec 16, 2021 |
|
Python |
4 |
Open Finnish Two-Level morphological analyzer |
Jan 01, 2023 |
|
R |
12 |
Algorithms for Finnish open goverment data |
Jan 19, 2022 |
|
Python |
2 |
Open Assistant dataset translated to Finnish |
May 03, 2023 |
|
Python |
2 |
datasets useful for Tibetan NLP |
Aug 18, 2022 |
|
None |
4 |
Papers, Datasets, Codes about Clinical NLP |
Mar 16, 2022 |
|
Python |
6 |
Text pre-processing for NLP datasets |
Jun 22, 2022 |
|
None |
28 |
List some datasets in NLP field. |
May 19, 2022 |
|
C# |
263 |
My NLP datasets for Russian language |
Oct 07, 2022 |
|
None |
2 |
Datasets for NLP tasks in Luxembourgish |
Nov 09, 2022 |
|
HTML |
4 |
NLP Datasets Collection, for self-use. |
Sep 19, 2022 |
|
R |
10 |
Finnish Meteorological Institute open data API R client |
Apr 17, 2023 |
|
None |
70 |
NLP NER datasets video/music/book bio |
May 08, 2022 |
|
None |
2 |
Audio Datasets | Open Source Audiot Datasets |
May 30, 2023 |
|
Python |
49 |
The Finnish dependency parsing pipeline being developed by the Turku NLP group. Documentation: |
Jun 24, 2022 |
|
Jupyter Notebook |
6 |
Status open datasets |
Dec 02, 2021 |
|
None |
18 |
open traffic datasets |
Apr 03, 2022 |
|
None |
15 |
Open Neuroimaging Datasets |
Dec 20, 2022 |
|
Python |
9 |
Open broad-coverage corpus for Finnish named entity recognition. |
Apr 19, 2022 |
|
Python |
12 |
Open Vietnamese NLP Resources |
Oct 29, 2021 |
|
None |
2 |
Open Source NLP data |
Mar 13, 2022 |
|
CSS |
3 |
Open Source NLP Annotator |
Jan 10, 2022 |
|
None |
5 |
Finnish data |
May 03, 2022 |
|
Jupyter Notebook |
3 |
Open source Coiled datasets |
Jun 28, 2022 |
|
Jupyter Notebook |
23 |
Open Datasets example notebooks |
Aug 01, 2022 |
|
None |
54 |
open-source audio datasets |
Aug 23, 2022 |
|
HTML |
1615 |
Datasets, SOTA results of every fields of Chinese NLP |
Aug 11, 2022 |
|
Python |
55 |
Datasets collection and standardization for NLP extreme multitask learning |
Apr 26, 2023 |
|
Python |
2 |
Datasets for Core NLP tasks in Southeast Asian languages |
Jun 16, 2022 |
|
Python |
242 |
The tool to make NLP datasets ready to use |
Aug 15, 2022 |
|
None |
3 |
creating this repo to host some turkish nlp datasets |
Nov 09, 2022 |
|
Jupyter Notebook |
4 |
DQI (Data Quality Index) calculates quality of NLP datasets. |
Dec 26, 2022 |
|
None |
3 |
NLP360: Curated list of NLP datasets, libraries and articles |
Nov 14, 2022 |
|
Shell |
2 |
Baseline Finnish models trained with Finnish Parliament Speech corpus |
Mar 21, 2023 |
|
R |
7 |
Reboot of the Finnish Meteorological Institute open data API R client |
Jan 22, 2023 |
|
HTML |
5 |
open source datasets for machine learning, the dinosaur datasets |
Aug 07, 2021 |
|
HTML |
2 |
Finnish OmegaT Localisation |
Nov 02, 2022 |
|
CSS |
3 |
Finnish Proposition Bank |
Jun 24, 2022 |
|
None |
18 |
Every Finnish word |
Jun 20, 2022 |
|
Adblock Filter List |
34 |
Finnish Easylist addition. |
Sep 09, 2022 |
|
None |
9 |
Finnish stopwords collection |
Nov 11, 2022 |
|
None |
10 |
Open-source 3D Model datasets |
Oct 03, 2022 |
|
Jupyter Notebook |
22 |
A collection of open datasets |
May 30, 2022 |
|
None |
22 |
Running list of Open Datasets |
Dec 16, 2022 |
|
Jupyter Notebook |
16 |
Colab Compatible FastAI notebooks for NLP and Computer Vision Datasets |
Apr 15, 2022 |