Stars
181
Forks
71
Language
Java
Last Updated
Sep 07, 2022
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 2 | Convert Chinese to zhuyin(bopomofo) or pinyin via text segmentation and dictionaries | Sep 06, 2022 | |
C++ | 2 | Deep Learning Chinese Word Segment | Apr 04, 2018 | |
C++ | 2081 | Deep Learning Chinese Word Segment | Aug 12, 2022 | |
None | 4 | Chinese Word Segment using Seq2Seq Framework. | Aug 09, 2018 | |
Python | 7 | Convert Chinese text to Pinyin or Jyutping | Mar 10, 2023 | |
None | 18 | Frequency dictionaries - one word per line simple text files | Apr 06, 2023 | |
Jupyter Notebook | 11 | Chinese Clinical Text Named Entity Recognition | Jul 05, 2022 | |
Python | 84 | A Chinese word segment model based on BERT, F1-Score 97% | Jul 06, 2022 | |
JavaScript | 4 | Check if text contains a Chinese word. | Apr 29, 2016 | |
Emacs Lisp | 4 | processing chinese text using word frequency data | Jul 02, 2020 | |
JavaScript | 77 | Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation. | Apr 15, 2023 | |
Python | 3 | Application and implementation of Chinese text automatic summarization system and click bait recognition based on … | Aug 11, 2020 | |
Python | 5 | Simple optical character recognition (OCR) for Chinese text. | Oct 27, 2020 | |
JavaScript | 3 | Extracting n-grams from text and display in beautiful D3 word cloud. | May 29, 2021 | |
Python | 4 | Create handwritten word embeddings from a text recognition Seq2Seq system. | Sep 11, 2022 | |
Python | 189 | This repository contains datasets and baselines for benchmarking Chinese text recognition. | Jul 06, 2022 | |
Python | 60 | A hack to replace Pride & Prejudice text with closest word2vec model word, and visualize … | Feb 12, 2022 | |
Python | 60 | Word Cloud for Chinese Text Corpus (中文词云制作) | Feb 26, 2023 | |
None | 2 | [Journal of Chinese Informatics]A text semantic matching model based on multi-knowledge about shape, pinyin and … | May 28, 2022 | |
Python | 52 | Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even … | Apr 25, 2023 | |
Jupyter Notebook | 2 | Dictionaries for OCR text recognition of federal sources of Switzerland in German, French and Italian … | Mar 28, 2020 | |
Python | 185 | An implementation of CRNN (CNN+LSTM+warpCTC) on MxNet for chinese text recognition | Oct 13, 2022 | |
Jupyter Notebook | 9 | NLP for classifying text. Using word Word2Vec word embedding and a neural net with bidirectional … | Apr 05, 2022 | |
Python | 18 | Multi Class Text (Feedback) Classification using CNN, GRU Network and pre trained Word2Vec embedding, word … | Feb 12, 2022 | |
Jupyter Notebook | 2 | Train from scratch word embeddings for hindi text (using word2vec python library) and 3D visualize … | Mar 28, 2022 | |
None | 7 | A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text | Jun 08, 2022 | |
None | 355 | A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text | Oct 19, 2022 | |
Python | 9 | Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese | Aug 23, 2022 | |
JavaScript | 94 | Markov Chain combined with word vector embedding (word2vec) and part-of-speech tagging, for context-aware text generation. … | Oct 07, 2022 | |
Java | 2 | Java library for Chinese text match using Pinyin - 用于各类汉语拼音匹配问题的 Java 库 | Mar 11, 2020 | |
Java | 36 | Java library for Chinese text match using Pinyin - 用于各类汉语拼音匹配问题的 Java 库 | Jul 29, 2022 | |
Jupyter Notebook | 978 | Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text … | Sep 07, 2022 | |
Python | 2 | Internet Chinese emergency text classification, incident element recognition and incident sentence relation judgment system based … | Feb 11, 2022 | |
Python | 13 | Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve … | Aug 19, 2022 | |
Jupyter Notebook | 7 | Deep Learning Project for the Latin Language using a character based RNN to generate text … | Jun 09, 2022 | |
Jupyter Notebook | 3 | text tokenization, part of speech, named entity recognition, vector space model, word embedding, text classification/clustering, … | May 31, 2022 | |
None | 102 | The SCUT-EPT Dataset for the research of offline handwritten Chinese text recognition (HCTR) in educational … | Mar 31, 2023 | |
Python | 57 | End-to-end model training and deployment reference for handwritten Chinese text recognition, and can also be … | Aug 02, 2022 | |
PHP | 1194 | "結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be … | Sep 12, 2022 | |
HTML | 11 | Flask website integrated with Tesseract-OCR for reading multiple images, extracting text from them, and saving … | Oct 12, 2022 | |
Python | 1471 | Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named … | Sep 09, 2022 | |
Python | 2 | This module will help to convert your voice (speech) into text using Speech Recognition Library. … | Dec 15, 2021 | |
Jupyter Notebook | 4 | Multilingual text recognition using machine learning ,I in this project i have extracted text from … | Sep 21, 2021 | |
Python | 3 | In this Repository, there is a Detailed Explanation of Text Detection from Images. Text Detection … | Mar 27, 2023 | |
Jupyter Notebook | 5 | Research oriented, developing word embeddings for binary text-polarity classifier based on movie reviews using BoW, … | Jul 28, 2022 | |
Python | 4 | NLP pipeline (Reading text from files .doc, .docx, .pdf, .txt; Basic text cleaning; Tokenization; Stemming; … | Jul 23, 2022 | |
Java | 2 | This is pipeline module for cerating bengali word clusters. Word clusters can be a very … | Mar 04, 2023 | |
Python | 29 | This is a novel project for mathematical knowledge entity recognition. The algorithm is mainly modeled … | Dec 13, 2022 | |
Jupyter Notebook | 56 | 自然语言处理相关实验实现 some experiment of natural language processing, Like text classification, named entity recognition, pos-tags, segment, … | Jul 15, 2022 | |
Python | 43 | Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image … | Aug 04, 2022 |