An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "html-parsing" keyword

View the packages on the pypi.org package registry that are tagged with the "html-parsing" keyword.

Top 3.8% on pypi.org
htmldate 1.9.3 💰
Fast and robust extraction of original and updated publication dates from URLs and web pages.
58 versions - Latest release: 9 months ago - 5 dependent packages - 50 dependent repositories - 4.52 million downloads last month - 142 stars on GitHub - 1 maintainer
scrapling 0.3.6 💰
Scrapling is an undetectable, powerful, flexible, high-performance Python library that makes Web ...
29 versions - Latest release: about 21 hours ago - 19.2 thousand downloads last month - 7,383 stars on GitHub - 1 maintainer
kryptone 6.0.0
Kryptone is a hight level web scapper dedicated to marketers and wrapped around the Selenium libr...
2 versions - Latest release: 8 months ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
justext 3.0.2 💰
Heuristic based boilerplate removal tool
8 versions - Latest release: 7 months ago - 7 dependent packages - 43 dependent repositories - 1.33 million downloads last month - 797 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
breadability 0.1.20
Port of Readability HTML parser in Python
21 versions - Latest release: over 11 years ago - 2 dependent packages - 212 dependent repositories - 67 thousand downloads last month - 204 stars on GitHub - 1 maintainer
scrapy-beautifulsoup 0.0.2
Simple Scrapy middleware to process non-well-formed HTML with BeautifulSoup
2 versions - Latest release: about 9 years ago - 1 dependent repositories - 62 downloads last month - 21 stars on GitHub - 1 maintainer
django-janitor 0.5.1
django-janitor allows you to use bleach to clean HTML stored in a Model's field.
11 versions - Latest release: almost 8 years ago - 2 dependent repositories - 41 downloads last month - 6 stars on GitHub - 1 maintainer
procyclingstats 0.2.7
A Python API wrapper for procyclingstats.com
17 versions - Latest release: 17 days ago - 1 dependent repositories - 1.01 thousand downloads last month - 51 stars on GitHub - 1 maintainer
typed-soup 0.1.6 💰
A type-safe wrapper around BeautifulSoup and related HTML parsing utilities
6 versions - Latest release: 4 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
tana2tree 1.1.19
Parses Tanagra description into usable formats.
30 versions - Latest release: almost 5 years ago - 1 dependent repositories - 32 downloads last month - 0 stars on GitHub - 1 maintainer
beautifulsoup4-slurp 0.0.2
Slurp packages Beautifulsoup4 into command line.
2 versions - Latest release: over 10 years ago - 7 dependent repositories - 51 downloads last month - 8 stars on GitHub - 1 maintainer