Stars
30
Forks
12
Language
Python
Last Updated
Dec 15, 2021
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 3 | Create PDFs and plain text from hOCR documents | May 22, 2022 | |
Ruby | 100 | Create PDFs from Jekyll pages & documents. | Oct 10, 2022 | |
Python | 7 | Helpers to create .csv files of word-level bounding boxes from text-based pdfs, or from hocr … | May 31, 2022 | |
Python | 36 | Create small, searchable PDFs from scanned documents | Apr 25, 2023 | |
Python | 2 | Script to extract plain text from pdfs and recover broken sentences | Aug 14, 2022 | |
Python | 12 | Create realistic looking handwritten text PDFs from text files. | Dec 28, 2021 | |
Python | 2 | create plain text transcript from subtitles file | Aug 13, 2019 | |
Python | 10 | :rocket:Parse PDFs, Word and Excel documents. Read, Create, Merge/Combine, Extract data from office documents. | Aug 20, 2022 | |
Ruby | 2 | Break Apart Documents into Images, Text, Pages and PDFs | Aug 31, 2021 | |
Ruby | 813 | Break Apart Documents into Images, Text, Pages and PDFs | Oct 08, 2022 | |
JavaScript | 4 | Quickly create Google Calendar events from plain text. | May 24, 2023 | |
JavaScript | 10 | Create Jira tickets from a plain text format | May 15, 2023 | |
JavaScript | 4 | Find text and create bookmarks in PDFs | Oct 06, 2022 | |
Python | 8 | minimalist wrapper around ocropus for generating hOCR documents from images | Jul 09, 2019 | |
Java | 6 | Create PDF files from images + optional HOCR support | Nov 11, 2020 | |
Rich Text Format | 3 | Library for extracting plain text from documents(files) for further processing (indexing and searching) | Jun 05, 2020 | |
Python | 2 | Python command line tool to quickly create PDFs from text content. | May 13, 2017 | |
Python | 2 | Create PDFs from Zendesk forums | May 24, 2017 | |
Ruby | 6 | Mutate URLs and hyperlinks in HTML and plain text documents with ease | Feb 04, 2020 | |
Ruby | 3 | Texas provides an easy way to create PDFs from LaTeX documents using ERb templates. | Aug 13, 2019 | |
None | 2 | A GUI to produce PDFs from scanned documents | Jan 25, 2020 | |
Dockerfile | 458 | Run LibreOffice in AWS Lambda to create PDFs & convert documents | Oct 01, 2022 | |
HTML | 3 | Clean up text copied from PDFs | May 14, 2022 | |
JavaScript | 2 | Create an FTS index of text inside PDFs | Mar 17, 2022 | |
JavaScript | 2 | Create readable PDFs from web pages | Oct 03, 2020 | |
Ruby | 23 | Slaw is a lightweight library for rendering and generating Akoma Ntoso acts from plain text … | May 06, 2022 | |
Haskell | 18 | A prettyprinting library designed for laying out plain text documents | Jul 18, 2022 | |
Go | 4 | Turn pdf documents into plain text for searching, indexing, etc. | Jun 05, 2020 | |
Python | 108 | Create quizzes in QTI format for Canvas from Markdown-based plain text | Oct 09, 2022 | |
Python | 95 | Create a Gephi Citation Graph based on Text Analysis of PDFs from Zotero | Oct 16, 2022 | |
TeX | 2 | Plain TeX and documents for upTeX | Jan 29, 2022 | |
Ruby | 2 | Provides a Jruby wrapper for Apache PDFBox library to extract plain text from PDF documents. | Nov 18, 2020 | |
JavaScript | 108 | Extract text from pdfs that contain searchable pdf text | Sep 24, 2022 | |
Ruby | 56 | Create PDFs from Cucumber features and scenarios for printing | Sep 13, 2021 | |
Shell | 19 | Extracts plain text from docx files | May 31, 2021 | |
C | 6 | Extract plain text from pdf files. | Jun 29, 2022 | |
Rust | 4 | Extract Bible references from plain text | Jun 16, 2022 | |
Jupyter Notebook | 2 | Extract tables from Plain-Text Files. | Sep 10, 2023 | |
R | 20 | :no_entry: ARCHIVED :no_entry: Extract Text from 'PDFs' | Jun 23, 2022 | |
None | 103 | Create Microsoft Documents automatically using Text and Template files | Aug 05, 2022 | |
HTML | 5 | Create PDFs from a variety of formats. | Feb 14, 2023 | |
C | 134 | scan paper documents 📄 from a scanner 🖨️ as PDFs to Google Drive for full-text … | Sep 04, 2022 | |
Python | 69 | Simple, Pythonic extraction of text, shapes and images from PDFs | Oct 14, 2022 | |
Scala | 198 | Create PDFs from Scala using plain old HTML and CSS. Uses wkhtmltopdf on the back-end … | Oct 12, 2022 | |
Scala | 2 | Create PDFs from Scala using plain old HTML and CSS. Uses wkhtmltopdf on the back-end … | May 13, 2024 | |
Python | 2 | Extract text from papers PDFs and abstracts, and remove uninformative words. | Apr 27, 2023 | |
C# | 5 | Extracts plain text data from given RTF formatted text. | Jun 16, 2021 | |
Python | 19 | Get text from documents format | Sep 08, 2021 | |
JavaScript | 9 | Tool to convert Google Docs Documents into Markdown and Org Mode plain text formats. | May 06, 2023 | |
PHP | 7 | Transform PDFs and other documents into Algolia records | Jan 07, 2021 |