Stars
6
Forks
1
Language
Python
Last Updated
Dec 26, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
JavaScript | 79 | Find and fix bothersome punctuation and formatting errors in English texts. | Feb 17, 2023 | |
Ruby | 28 | Repository of synonyms, protected words, stop words, and localizations | Jun 13, 2022 | |
PHP | 18 | A stop-watch and timer with start/stop/pause features and minimal human-friendly formatting. | Jul 29, 2021 | |
Go | 3 | Natural Language Processing (NLP) Tokenization Libary designed for English. Fast, Lean, Customizable. Tokenizes text, replaces … | Feb 07, 2023 | |
Python | 4 | Basic Text Mining and NLP operations such as Tokenization, Portuguese POS Tagging, Stopword Removal among … | May 04, 2019 | |
R | 491 | Pipeable steps for feature engineering and data preprocessing to prepare for modeling | Apr 25, 2023 | |
PHP | 2 | Integrates WordPress stop-words (and related filters) with ElasticPress mapping. | Jun 17, 2021 | |
Python | 87 | GPT-2 Discord Bot and Steps to Train Something Like You | Mar 04, 2023 | |
Python | 3 | Predicting next set of words with BERT, GPT, and XLNET | Jun 10, 2022 | |
Ruby | 3 | Contains stop words lists and methods for extracting keywords from strings | May 14, 2021 | |
Dart | 3 | Data package containing currencies and information about them that helps formatting. | Dec 31, 2022 | |
JavaScript | 6 | Tokenizes sentences containing a mix of Chinese and English words. | Nov 07, 2022 | |
Jupyter Notebook | 4 | There are 8 different text files of ebooks which are available freely on http://www.gutenberg.org/ . … | Dec 26, 2022 | |
C++ | 4 | Implemented Preprocessing steps, Feature Extraction techniques and Naive Bayes Classifier in C++. Moreover, we have … | Apr 15, 2023 | |
JavaScript | 154 | NPM package for creating a keyword array from a string and excluding stop words. | Oct 02, 2022 | |
Python | 2 | A tokenization and text manipulation command line tool for removing various symbols & replace words … | May 14, 2019 | |
Python | 3 | Censor a range of words and phrases with ease, stop people from bypassing censors and … | Mar 19, 2023 | |
Jupyter Notebook | 2 | Spam detection and removal in Twitter using Machine Learning techniques containing text pre-processing techniques and … | Aug 03, 2022 | |
Python | 2 | A plugin containing xblocks and apps supporting GPT and other LLM use on edX | Aug 22, 2023 | |
Java | 3 | A basic Docx to PDF converter. Supports text, tables (without formatting) and images. It's based … | Apr 22, 2022 | |
Python | 4 | An Anki add-on to convert user highlighted words to Anki decks containing cards of sentences … | Apr 30, 2023 | |
Java | 155 | A small Java library for simple text analysis - counting strings, identifying languages, and removing … | Mar 10, 2023 | |
JavaScript | 3 | Detect swear words, and handle strings containing them. Smart Detection helps to detect words using … | Apr 10, 2023 | |
None | 25 | Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different … | Jan 21, 2023 | |
Python | 25 | A Repository for maintaining various human skeleton preprocessing steps in numpy and tensorflow along with … | Sep 13, 2022 | |
None | 27 | Dataset for Natural Language Processing using a corpus of medical transcriptions and custom-generated clinical stop … | Aug 21, 2022 | |
Python | 2 | This respository contains preprocessing steps taken while making a classifier for OCT - Optical Coherence … | May 07, 2023 | |
PHP | 149 | ⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any … | Jul 10, 2023 | |
HTML | 4 | Page containing information for all formatting options usable in Markdown files, issues, PRs and the … | Apr 11, 2023 | |
HTML | 9 | Building a Natural language Responsive system by Preprocessing(removing stop words , numbers, urls and stemmming … | Feb 13, 2020 | |
PHP | 215 | Text formatting library that supports BBCode, HTML and other markup via plugins. Handles emoticons, censors … | Oct 17, 2022 | |
Python | 2 | A tool to evaluate the performance of various machine learning algorithms and preprocessing steps to … | Apr 19, 2024 | |
Python | 3 | GPT-3 based Question Answering System that reads text from PDF, DOCX, or TXT files and … | Apr 30, 2023 | |
Python | 2 | A python app that searches for words containing specific letters in the Holy Quran and … | Nov 02, 2021 | |
TypeScript | 2 | Chat with an AI that's powered by GPT-j. Talk with it, set parameters, ask questions, … | Mar 06, 2022 | |
Python | 17 | A NARS implemented as a GPT model prompted to invoke reasoning steps, with NARS-based memory … | Apr 24, 2023 | |
Jupyter Notebook | 3 | In this Project the concept of Topic Modeling has been Implemented, basic NLP preprocessing has … | Dec 23, 2020 | |
Jupyter Notebook | 2 | Goal:Understand about customers coming up in mall. Performed steps of data profiling, data preprocessing and … | Jan 18, 2022 | |
PHP | 12 | This is a repo containing all code and steps taken to download, setup the process … | Apr 04, 2023 | |
Jupyter Notebook | 2 | This project is based on a Basic Text summarizer implemented with the concept of Cosine … | Sep 01, 2020 | |
C++ | 58 | Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It … | May 09, 2022 | |
Jupyter Notebook | 2 | Text preprocessing by removing stop words, URLs and Numbers which will not give much meaning … | Dec 10, 2021 | |
Go | 3 | The parser can read given folder with *.json files, filtering and qualifying input data with … | May 05, 2023 | |
C++ | 2 | A repo containing microcontroller code and some steps on how I turned my "dumb" overhead … | Oct 27, 2022 | |
Java | 11 | Preprocessing for NIST Special Dataset 19 (uppercase single-character handwritten characters A..Z). Converts to same formatting … | Oct 22, 2022 | |
Jupyter Notebook | 2 | Text preprocessing is one of the most important tasks in Natural Language Processing (NLP). For … | Sep 09, 2022 | |
HTML | 8 | This project is based on a Basic Text summarizer implemented with the concept of Cosine … | Jul 22, 2022 | |
Jupyter Notebook | 2 | Built a practical Multi-Factor Backtesting Framework from scratch based on Huatai Security's(One of China's largest … | Aug 07, 2022 | |
Dart | 2 | A flutter app that can generate Chinese lyrics, poems and prose based on your given … | Jan 07, 2023 | |
Python | 19 | HistoClean is a tool for the preprocessing and augmentation of images used in deep learning … | May 07, 2022 |