Stars
211
Forks
46
Language
Ruby
Last Updated
Sep 26, 2022
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Jupyter Notebook | 6 | Extracting text, images and annotations from pdf files. | Nov 29, 2021 | |
Java | 7 | Project for extracting reference strings from PDF publications. | Mar 28, 2022 | |
Python | 7 | Jina Executor used for extracting images and text as chunks from PDF data | Jul 27, 2022 | |
Python | 2 | NLP Pdf Minning Extracting text from pdf | May 30, 2021 | |
Python | 4 | A minimal CLI tool for extracting highlighted text from PDF files. | Mar 06, 2022 | |
C | 25 | cli for extracting text from PDF files | Sep 03, 2022 | |
Python | 26 | Extracting text from images, removing grids from images, removing background and extracting useful text using … | Sep 29, 2022 | |
C++ | 3 | A fast and accurate command line tool for extracting text from PDF files. | May 31, 2023 | |
None | 2 | Extracting text from pdf file - C# | Feb 22, 2022 | |
Python | 2 | Experiment in extracting text from PDF | Jun 15, 2015 | |
Rust | 2 | A tool for extracting text from text | Sep 26, 2021 | |
Swift | 9 | Command line tool for extracting text from images using Apple's Vision framework. | Aug 29, 2022 | |
Python | 21 | Tool for extracting examination questions from PDF documents | May 27, 2022 | |
HTML | 4 | A text extractor for extracting text from HTML, PDF, Image and other files. | Sep 27, 2022 | |
Python | 2 | A Python tool for extracting Metadata from PDF files. | Jul 02, 2022 | |
Python | 3 | This Package is for extracting tables, table_image, and text from pdf files. | Nov 12, 2020 | |
C | 28 | Tool for reading mhtml files and extracting images and text into separate files. | Mar 07, 2022 | |
Ruby | 30 | Tool for extracting plain text from wikipedia data | Mar 14, 2021 | |
Python | 3 | Tool for extracting and sorting links from a text file. | Sep 23, 2014 | |
Java | 31 | A tool for extracting arbitrary tables from untagged PDF documents | Oct 02, 2022 | |
Python | 52 | A software for extracting text from scanned images of printed text documents | Jul 01, 2022 | |
Go | 4 | A simple Go package for extracting text from a PDF file | Jun 08, 2021 | |
Python | 6 | An image processing tool for extracting Sudokus from images | Feb 10, 2021 | |
CSS | 2 | Extracting titles and paragraphs from PDF | Oct 13, 2020 | |
HTML | 3 | A small library for extracting main text content from web pages | Jan 06, 2017 | |
Java | 8 | This is a java software for extracting unit strings from plain text and from table … | Dec 09, 2019 | |
Python | 12 | A Python script for extracting encoded text from PNG images | Jul 21, 2022 | |
Python | 13 | A tool for extracting plain text from Wikipedia dumps | Jun 16, 2022 | |
Python | 3118 | A tool for extracting plain text from Wikipedia dumps | Oct 07, 2022 | |
Python | 3 | A tool for extracting text from speech on videos | Oct 23, 2022 | |
Python | 2 | A tool for extracting plain text from Wikipedia dumps | Jun 25, 2023 | |
Python | 12 | A Django + PyPDF2 application extracting PDF pages, merging and replacing PDF files online. | Oct 04, 2022 | |
Ruby | 3 | A simple MacRuby tool for extracting highlighted passages from PDF files | Feb 15, 2015 | |
C# | 2 | Tool for extracting embedded images from SSRS .rdl report files. | Apr 12, 2021 | |
JavaScript | 3 | Extract an array of pages/text from a pdf. | Oct 26, 2018 | |
HTML | 4 | Go tool for extracting text from specially tagged Go comments | Apr 07, 2021 | |
Python | 9 | Extracting text from pdf files and translate the text into designated language by calling google … | Sep 25, 2022 | |
Java | 31 | DyAnnotationExtractor is software for extracting annotations (highlighted text and comments) from e-documents like PDF. | Sep 30, 2022 | |
Python | 6 | An API for extracting metadata, text, section titles, figures, and references from a PDF file | Jun 29, 2021 | |
Python | 2 | a tool for extracting text in a PDF file and generating a ready-to-print index | Mar 17, 2022 | |
Jupyter Notebook | 2 | python module for extracting texts from URL and PDF | May 20, 2021 | |
Python | 2 | Tool for searching pdfs withthin google and extracting pdf metadata | Aug 30, 2021 | |
Shell | 15 | OwncloudOCR uses tesseract OCR and OCRmyPDF for reading text from images and images in PDF … | Jul 31, 2021 | |
Python | 9 | A tool for extracting/injecting layout images. | Dec 14, 2020 | |
Jupyter Notebook | 5 | Code for extracting images from PDFs | Jun 22, 2021 | |
Python | 31 | Extracting addresses from text | Apr 07, 2022 | |
R | 2 | Extracting features from text | Mar 06, 2021 | |
JavaScript | 61 | Node.js module for rendering pdf pages to images, svgs, html files, text files and json … | Sep 19, 2022 | |
Python | 2 | Python tool for extracting the plain text and hyperlinks from Common Crawl database. | Nov 22, 2019 | |
Elixir | 3 | 🔎 A fast and flexible keyword parser for extracting words, phrases & simple patterns from … | Sep 13, 2022 |