|
CoffeeScript |
28 |
CoffeeScript lib for PDF OCR and text extraction |
Dec 24, 2018 |
|
Makefile |
4 |
Docker container for the kraken OCR engine |
Feb 10, 2021 |
|
HTML |
5 |
Docker container to render PDF files using puppeteer |
Sep 25, 2023 |
|
Jupyter Notebook |
2 |
Make editable Persian(Farsi) PDF from Non-Editable ones, using OCR. |
Jun 21, 2023 |
|
HTML |
108 |
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based |
Oct 10, 2022 |
|
None |
7 |
Using PDFPlumber for PDF data extraction |
Sep 25, 2022 |
|
Dockerfile |
2 |
Docker setup of Camelot: PDF Table Extraction |
Jan 02, 2024 |
|
C++ |
4 |
A PDF to text converter using a MLP-based OCR. |
Jan 09, 2021 |
|
Visual Basic |
24 |
Batch convert PDF files to text under Windows, using several text extraction methods or OCR |
Oct 07, 2021 |
|
None |
7 |
Tabula data table PDF extraction for Docker http://tabula.technology/ |
Jun 29, 2022 |
|
Shell |
9 |
OCR recognition with Ocropus in a Docker container |
Oct 19, 2022 |
|
CSS |
2 |
Image To PDF converter using OCR |
Dec 03, 2020 |
|
JavaScript |
7 |
Docker container to Export HTML to PDF/PNG (using Google Chrome) |
May 18, 2022 |
|
Jupyter Notebook |
2 |
Convert voting log pdf by using OCR |
Jan 04, 2024 |
|
Python |
7 |
Extracting data from Image-based PDF files using OCR to JSON files |
Oct 09, 2022 |
|
Python |
5 |
PDF parser using pdfminer and pytesseract for OCR support |
Aug 02, 2022 |
|
JavaScript |
2 |
Using OCR to make digital cutups |
Apr 19, 2022 |
|
CSS |
3 |
ocr-docker is small, Flask powerd web app, helps us to extract text from images and … |
Aug 18, 2022 |
|
Go |
13 |
Make minimum, reproducible Docker container for Go application |
Mar 22, 2023 |
|
None |
33 |
Using ChatGPT and PDF OCR to investigate documentation |
Apr 15, 2023 |
|
HTML |
19 |
PDF OCR using Pure Javascript by tesseract.js api |
Sep 19, 2022 |
|
C |
2 |
PDF Text Extraction |
Oct 09, 2015 |
|
Shell |
6 |
grep word in pdf or image based on OCR |
Jun 03, 2020 |
|
Jupyter Notebook |
5 |
Qubitrics -ML Intern assignment(Text Extraction from image using OCR) |
Aug 10, 2021 |
|
Makefile |
2 |
Make a persistent FreeIPA docker container PDQ |
Feb 03, 2017 |
|
Python |
2 |
ocr,pdf转docx,pdf to docx |
Apr 18, 2023 |
|
Python |
54 |
Python library to extract text from PDF, and default to OCR when text extraction fails. |
Dec 22, 2021 |
|
Dockerfile |
17 |
Make docker compose wait for container dependencies being ready |
Jul 29, 2022 |
|
Python |
349 |
Python script to do PDF OCR conversion using Tesseract |
Aug 19, 2022 |
|
C# |
6 |
Convert image and pdf to text using Window OCR |
Oct 10, 2022 |
|
Dockerfile |
4 |
Docker container to run PDF manipulation utitilies (pdftk, ghostscript...). |
Jan 13, 2022 |
|
C++ |
2 |
SfTesseract is a PDF OCR processer based on Tesseract engine |
Apr 19, 2022 |
|
Shell |
8 |
Alpine-based docker container for chrooted ftp |
Aug 13, 2022 |
|
JavaScript |
33 |
Docker container for Markdown based Raneto Knowledgebase |
Mar 21, 2023 |
|
Java |
5 |
Simplified PDF Data Extraction |
Dec 01, 2021 |
|
Python |
630 |
Simple PDF text extraction |
Oct 16, 2022 |
|
Python |
2 |
MCQ extraction from PDF |
Aug 22, 2021 |
|
JavaScript |
2 |
PDF Automation - OCR Text Recognition |
Jul 27, 2022 |
|
JavaScript |
2 |
OCR Engine/PDF text parser |
Jun 27, 2021 |
|
None |
2 |
Camelot: PDF Table Extraction for Humans |
Mar 23, 2023 |
|
Python |
3281 |
Camelot: PDF Table Extraction for Humans |
Oct 14, 2022 |
|
Python |
5 |
PDF image analysis and selective text extraction using tesseract |
Sep 18, 2020 |
|
Jupyter Notebook |
2 |
Entity Extraction from PDF files using spacy NLP model |
Jun 20, 2023 |
|
Shell |
4 |
Docker container to make building gRPC proxies for Tyk easier. |
May 26, 2019 |
|
Shell |
11 |
Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick |
Jan 03, 2022 |
|
Ruby |
308 |
Adds text to PDF files using the cuneiform OCR software |
Sep 09, 2022 |
|
C# |
3 |
Checking out various OCR text extraction stacks. |
Jan 23, 2020 |
|
C |
13 |
Examples for using the ROOT Docker container. |
Jun 29, 2022 |
|
HTML |
7 |
Example Docker Container, based upon `nginx` |
Oct 28, 2021 |
|
Dockerfile |
2 |
Alpine Linux based Rspamd docker container |
Apr 27, 2023 |