Stars
6
Forks
1
Language
PHP
Last Updated
Feb 15, 2020
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 16 | Open Access PDF harvester, metadata aggregator and full-text ingester | Sep 30, 2022 | |
Java | 10 | extract text content from doc, docx, pdf, rtf, txt. and html files | Mar 12, 2022 | |
Ruby | 478 | Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf) | Oct 05, 2022 | |
C# | 4 | Extract text from PDF with Metadata | Aug 19, 2021 | |
C | 7 | A simple file reader can extract content from .pdf,.doc,.ppt,xls... files without other tools | Jun 01, 2020 | |
PHP | 2 | A basic PDF generator which takes in user input through a text-box and puts the … | Jul 04, 2019 | |
Jupyter Notebook | 2 | python scrip to convert text to excel , pdf,doc files | Sep 04, 2021 | |
PHP | 67 | Old PHP scripts to read text content from different binary formats: PDF, DOC, PPT, RTF … | Apr 29, 2022 | |
JavaScript | 3 | Nodejs Upload images/pdf/text file to server | May 10, 2020 | |
Python | 4 | Full text search in your pdf documents. | Jun 02, 2022 | |
Python | 18 | A full text and metadata extractor for CKAN | May 24, 2022 | |
Go | 987 | Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text | Aug 24, 2022 | |
Go | 2 | Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text | Sep 18, 2021 | |
Python | 2 | Use python to search text in doc,xls,pdf,txt files | May 07, 2020 | |
Shell | 24 | Some useful Nemo Actions and Shell Scripts with zenity GUIs: 1. Sandwich PDF Maker (OCR, … | Aug 19, 2022 | |
Python | 6 | Easy to use text extractor, from PDF, DOC, DOCX and other documents, including if necessary … | May 08, 2022 | |
Java | 2 | Minecraft mod that expands upon Echo Shards and other 1.19 content | Jun 08, 2022 | |
PHP | 28 | Allows users to upload, manage, search, and download publications, documents, and similar content (PDF, Power-Point, … | Apr 13, 2023 | |
Java | 27 | Convert text content in PDF to EXCEL format. | Aug 25, 2022 | |
Python | 3 | This app allows users to upload CSV or PDF files, or enter text and ask … | Apr 06, 2023 | |
Clojure | 3 | OpenCompany Search Service - full-text search of content | Apr 29, 2023 | |
C | 6 | A Python extension to extract content and metadata from PDF files efficiently | Aug 31, 2021 | |
Python | 2 | News, full-text, and article metadata extraction in Python 3 | Jul 08, 2018 | |
Jupyter Notebook | 5 | Processing pipeline for smart metadata extraction from full-text publications | Feb 08, 2021 | |
Python | 2 | News, full-text, and article metadata extraction in Python 3 | Nov 09, 2022 | |
PHP | 2 | Make Joomla/Wordpress/Drupal work nicely with Varnish. Send purge requests upon content change. | Nov 09, 2019 | |
Java | 12 | A library enabling easy Lucene indexing of PDF text and metadata | Sep 19, 2021 | |
PHP | 2 | WordPress plugin that enhance full text search with Mroonga | Jan 11, 2022 | |
Python | 938 | Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced … | Oct 05, 2022 | |
JavaScript | 5 | Convert images to PDF, extract PDF text and execute commands depending on the content | Jul 06, 2019 | |
Java | 2 | Simple full text index of a directory with PDF files | Dec 26, 2015 | |
PHP | 2 | WordPress plugin to expose text content of searchable PDFs to the WordPress search box | Jul 05, 2022 | |
Python | 30 | Telegram bot to upload manga/comic to Telegram as PDF/ZIP or Folder (each chapter) | Uploaded … | May 07, 2023 | |
Vue | 2 | Full stack application to upload and display image or text posts | Jun 25, 2023 | |
Java | 2 | pdf extract to text page by page and bulk upload to elasticsearch | Nov 12, 2021 | |
HTML | 4 | A text extractor for extracting text from HTML, PDF, Image and other files. | Sep 27, 2022 | |
JavaScript | 2 | Script to migrate full-text content from Elasticsearch to S3 | May 12, 2024 | |
None | 13 | Automatically updated WordPress composer package (full version, with wp-content and themes) | Jun 06, 2022 | |
JavaScript | 19 | A PDF collection reader with built-in full-text search engine | Feb 26, 2022 | |
None | 3 | The full text and metadata of thousands of climate laws and policies | Apr 07, 2023 | |
Python | 12209 | News, full-text, and article metadata extraction in Python 3. Advanced docs: | Oct 18, 2022 | |
Python | 2 | News, full-text, and article metadata extraction in Python 3. Advanced docs: | Oct 16, 2022 | |
HTML | 2 | News, full-text, and article metadata extraction in Python 3. Advanced docs: | Apr 13, 2024 | |
Ruby | 2 | Extract text from ugly PLS slides and condense the content to a nice markdown doc | Sep 06, 2017 | |
Python | 7 | [WIP] Download full text pdf and supplemental materials for each PubMed IDs. | Feb 24, 2022 | |
C# | 6 | Recognize page content of a PDF as text using Tesseract and Ghostscript. | Sep 16, 2022 | |
Python | 119 | Python DJVU to PDF converter which preserves OCR text and bookmark metadata (e.g. TOC) | Oct 10, 2022 | |
Python | 2 | Python DJVU to PDF converter which preserves OCR text and bookmark metadata (e.g. TOC) | Jan 01, 2022 | |
HTML | 21 | Content for SuttaCentral, including texts both legacy and bilara, parallels, structure, and other metadata. | Jun 12, 2022 | |
Java | 4 | PDF-Zensor can be used to censor PDF-files. As such it strips annotations and metadata as … | Sep 07, 2022 |