trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars

2934

Forks

223

Language

Python

Last Updated

May 16, 2024

Similar Repos