Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "html-parsing" keyword
Top 2.7% on pypi.org
7 versions - Latest release: 7 days ago - 7 dependent packages - 43 dependent repositories - 538 thousand downloads last month - 682 stars on GitHub - 1 maintainer
justext 3.0.1 💰
Heuristic based boilerplate removal tool7 versions - Latest release: 7 days ago - 7 dependent packages - 43 dependent repositories - 538 thousand downloads last month - 682 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
54 versions - Latest release: about 1 month ago - 5 dependent packages - 50 dependent repositories - 1.1 million downloads last month - 106 stars on GitHub - 1 maintainer
htmldate 1.8.1
Fast and robust extraction of original and updated publication dates from URLs and web pages.54 versions - Latest release: about 1 month ago - 5 dependent packages - 50 dependent repositories - 1.1 million downloads last month - 106 stars on GitHub - 1 maintainer
procyclingstats 0.1.10
A Python API wrapper for procyclingstats.com9 versions - Latest release: about 2 months ago - 1 dependent repositories - 147 downloads last month - 43 stars on GitHub - 1 maintainer
tana2tree 1.1.19
Parses Tanagra description into usable formats.30 versions - Latest release: over 3 years ago - 1 dependent repositories - 162 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-beautifulsoup 0.0.2
Simple Scrapy middleware to process non-well-formed HTML with BeautifulSoup2 versions - Latest release: over 7 years ago - 1 dependent repositories - 811 downloads last month - 20 stars on GitHub - 1 maintainer
django-janitor 0.5.1
django-janitor allows you to use bleach to clean HTML stored in a Model's field.11 versions - Latest release: over 6 years ago - 2 dependent repositories - 50 downloads last month - 6 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
21 versions - Latest release: about 10 years ago - 2 dependent packages - 212 dependent repositories - 225 thousand downloads last month - 203 stars on GitHub - 1 maintainer
breadability 0.1.20
Port of Readability HTML parser in Python21 versions - Latest release: about 10 years ago - 2 dependent packages - 212 dependent repositories - 225 thousand downloads last month - 203 stars on GitHub - 1 maintainer
beautifulsoup4-slurp 0.0.2
Slurp packages Beautifulsoup4 into command line.2 versions - Latest release: almost 9 years ago - 7 dependent repositories - 97 downloads last month - 8 stars on GitHub - 1 maintainer
Related Keywords
python
4
web-scraping
3
text-extraction
2
html-extraction
2
html
2
parsing
2
beautifulsoup4
1
data-structures
1
python3
1
tanagra
1
scrapy
1
beautifulsoup
1
django
1
whitelist
1
bookie
1
breadability
1
content
1
HTML
1
readability
1
readable
1
html-extractor
1
text-mining
1
bookmarks
1
cli-utilities
1
netscape
1
html-parser
1
datetime
1
date-parser
1
entity-extraction
1
metadata-extraction
1
webarchives
1
date
1
information-extraction
1
lxml
1
metadata
1
natural-language-processing
1
nlp
1
time
1
webscraping
1
cycling
1
cycling-stats
1
procyclingstats
1
scraper
1
sports-analytics
1
python-package
1