pdfx

Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.

Stars

1013

Forks

118

Language

Python

Last Updated

Feb 06, 2024

Similar Repos