Stars
5
Forks
1
Language
Jupyter Notebook
Last Updated
Dec 19, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Java | 2 | Parsing dumps of wiktionary | Nov 21, 2021 | |
Emacs Lisp | 3 | Translate words using wiktionary | May 09, 2022 | |
Python | 9 | Etymological graphs based on Wiktionary dumps | Aug 22, 2022 | |
Python | 20 | Extract corpora from Wikipedia dumps | Oct 25, 2022 | |
Jupyter Notebook | 81 | Interactive visualization of Wiktionary words and etymologies. | Apr 13, 2023 | |
Python | 4 | Wiktionary skill to define words | Jan 06, 2022 | |
Python | 4 | extract protobuf structures from compiled applications and dumps | Dec 20, 2022 | |
Python | 43 | Tools to manipulate and extract data from wikipedia dumps | Jan 10, 2023 | |
Python | 11 | Extract plain text from Arabic Wikipedia dumps. | Feb 19, 2021 | |
Python | 5 | Extract news stories from LexisNexis Bulk API dumps | Jan 24, 2024 | |
Python | 2 | Easily extract words from a corpus. | Mar 31, 2021 | |
Jupyter Notebook | 2 | To extract words from the excel files | Jul 19, 2022 | |
Python | 2 | Extract text from papers PDFs and abstracts, and remove uninformative words. | Apr 27, 2023 | |
Shell | 62 | Extract individual (natural-language) words from source code | Aug 01, 2022 | |
R | 2 | Extracting Morphemes from Wiktionary Entries | Oct 27, 2021 | |
Ruby | 7 | Extract, process and import Discogs monthly XML Data Dumps | Oct 14, 2022 | |
PHP | 13 | Class to extract relevant words from a given text | Sep 26, 2022 | |
Python | 2 | A small tool to extract words from any file | Oct 02, 2013 | |
Python | 2 | A Python tool that translates words by using the Wiktionary API. | May 24, 2022 | |
Python | 11 | Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations. | May 09, 2021 | |
Python | 2 | API based on FastApi and Tesseract to extract words from scanned documents | Sep 23, 2022 | |
Nim | 133 | Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia. | Jun 13, 2022 | |
JavaScript | 9 | Provides a two-way map between the words and phonemes listed in the CMU Pronouncing Dictionary. | Jul 14, 2022 | |
Python | 23 | Extract data from German Wiktionary XML files. Allows you to add your own extraction methods … | Apr 15, 2023 | |
Python | 20 | Inflecting Finnish words (verb inflection, comparatives, cases, possessive suffixes, clitics) using Wiktionary-compatible declensions and conjugations | Aug 10, 2022 | |
HTML | 2 | Read / Parse article from wiktionary to JSON | Sep 19, 2021 | |
Python | 5 | Scraping grapheme-to-phoneme data from Wiktionary | Jul 09, 2022 | |
JavaScript | 2 | Book/Text Summarizer - Extract most frequent words and phrases | Apr 18, 2022 | |
Python | 4 | These scripts extract articles from wikipedia and wikivoyage into csv, gpx, osm or sql dumps … | May 02, 2019 | |
Jupyter Notebook | 2 | Utility to extract interesting words from documents, and store their counts and co-occurring documents and … | Sep 06, 2022 | |
Python | 2 | Proof-of-concept OpenCV project to extract characters from Wordbase screenshots and generate suitable words | Jun 20, 2017 | |
Python | 5 | Scrapes some Finnish word definitions from English Wiktionary. | Dec 16, 2021 | |
C | 2 | Example signal/protocol dumps (from git://sigrok.org/sigrok-dumps) | Jan 30, 2022 | |
Python | 2 | Pull Chinese character information from Wiktionary and begin to analyze it. | Feb 14, 2023 | |
Shell | 2 | Restoring Mysql and Mongodb dumps from s3 | Aug 29, 2018 | |
Clojure | 3 | A modern latin dictionary, built with what I could extract from Whitaker's WORDS. | May 23, 2016 | |
Python | 2 | Golchin is a telegram bot that extract English words from file and translate it to … | Sep 06, 2022 | |
Python | 3 | Wiktionary for machines (and exacting people). | Mar 07, 2023 | |
Jupyter Notebook | 2 | Maori word list from Wiktionary with information about borrowings from English | Dec 29, 2021 | |
Python | 2 | Tool for extracting IPA pronunciations from Wiktionary XML dump | Nov 13, 2022 | |
Python | 9 | 📖 A list of 4262 German abbreviations from Wiktionary | Jul 23, 2021 | |
JavaScript | 26 | Parses FTP dumps from FedBizOpps. | Jan 28, 2023 | |
Python | 41 | Dump objects from .NET dumps. | Jul 27, 2022 | |
JavaScript | 3 | Dumps data from mcbe packets | Nov 03, 2022 | |
Java | 4 | A small tool use tesseract-ocr engine to extract words from screenshot, AWT GUI | Aug 17, 2020 | |
TypeScript | 3 | parse words from uploaded text file and count each words | Mar 29, 2022 | |
Rust | 2 | Crawl Wiktionary Courteously yet Correctly and Persistently | Jul 09, 2022 | |
None | 7 | Extracts the french-to-english translations from the French Wiktionary | Nov 23, 2021 | |
Java | 2 | A program with grafic interface used to extract multiple meaning words starting from another word | Jan 23, 2014 | |
JavaScript | 3 | Extracts server access keys from 3DS and WiiU dumps | Nov 05, 2021 |