An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "html-parser" keyword

allscrape 0.1.4
Easy web scraping with built-in Cloudflare bypass
1 version - Latest release: 6 months ago - 15 downloads last month - 1 maintainer
wizardhtml 1.0.1
WHATWG-compliant HTML5 toolkit: DFA tokenizer, spec-guided tree builder, DOM, configurable serial...
2 versions - Latest release: 8 months ago - 28 downloads last month - 1 stars on GitHub - 1 maintainer
sec-parser 0.58.1
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of...
88 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 79.3 thousand downloads last month - 237 stars on GitHub - 1 maintainer
webpage-reader 0.0.4
Reads a webpage and extracts the information out of it, based on the HTML5 tags/classes
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
llmparser 0.1.0
Extract structured, LLM-ready content from any website — adaptive engine, no LLMs required
1 version - Latest release: 2 months ago - 21 downloads last month - 1 maintainer
pyxml3 0.0.4
Pure python3 Alternative to stdlib xml.etree with HTML support
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 3.23 thousand downloads last month - 1 stars on GitHub - 1 maintainer
hap 1.3.2
A simple HTML scraping tool
16 versions - Latest release: over 7 years ago - 1 dependent repositories - 61 downloads last month - 1 stars on GitHub - 1 maintainer
newsatlas 0.1.0
A smart news list and article content extraction library
1 version - Latest release: 3 months ago - 25 downloads last month - 1 maintainer
bookmarkdown 0.1.1
Parse your browser's exported HTML bookmark file to Markdown.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 21 downloads last month - 18 stars on GitHub - 1 maintainer
apifier 2.1.0
A web parser for tabular and/or paginated data
5 versions - Latest release: about 9 years ago - 1 dependent repositories - 36 downloads last month - 6 stars on GitHub - 1 maintainer
qarsmac 0.0.3
Cliente Python da API de Qualidade do Ar da SMAC.
3 versions - Latest release: 12 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
bubt-routinepy 1.0.0
An unofficial Python wrapper of the BUBT Routine API + a robust web scraper and PDF extractor for...
1 version - Latest release: 11 months ago - 16 downloads last month - 0 stars on GitHub - 1 maintainer
dedoc 2.6.1
Extract content and logical tree structure from textual documents
33 versions - Latest release: 4 months ago - 1.21 thousand downloads last month - 592 stars on GitHub - 1 maintainer
parse-utils 1.3.6
Page Parser Utils For scraping, List index update
16 versions - Latest release: over 4 years ago - 1 dependent repositories - 51 downloads last month - 2 stars on GitHub - 1 maintainer
parse-utils-yogen48 0.0.5
Page Parser Utils For scraping
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 2 stars on GitHub - 1 maintainer
alterx 0.0.4
A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML f...
2 versions - Latest release: 10 months ago - 71 downloads last month - 0 stars on GitHub - 1 maintainer
muslimnamesgenerator 0.1
MuslimNamesGenerator is application to generate and search muslim names.
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 20 downloads last month - 3 stars on GitHub - 1 maintainer
semantic-dom-ssg 0.2.0
Machine-readable web semantics for AI agents. O(1) lookup, deterministic navigation, token-effici...
2 versions - Latest release: 3 months ago - 58 downloads last month - 1 maintainer
trace-scraper 1.0.0
High-performance async web scraper with Playwright and httpx. Scrape JavaScript-rendered pages, c...
1 version - Latest release: 4 months ago - 21 downloads last month - 1 maintainer
yirabot 1.0.9
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, o...
20 versions - Latest release: about 2 years ago - 109 downloads last month - 19 stars on GitHub - 1 maintainer
pyhtmltext 0.1.0
Usefull tool for extracting text and sentences from html
6 versions - Latest release: over 3 years ago - 41 downloads last month - 1 stars on GitHub - 1 maintainer
dompy-parser 0.1.1
JavaScript Dom Api for Python, Html Parser and a Web scraping library
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
harser 0.0.3
Easy way for HTML parsing and building XPath.
4 versions - Latest release: over 9 years ago - 1 dependent repositories - 18 downloads last month - 138 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
justext 3.0.2 💰
Heuristic based boilerplate removal tool
8 versions - Latest release: about 1 year ago - 7 dependent packages - 43 dependent repositories - 4.44 million downloads last month - 797 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
advancedhtmlparser 9.0.2
A Powerful HTML Parser/Scraper/Validator/Formatter that constructs a modifiable, searchable DOM t...
51 versions - Latest release: about 3 years ago - 2 dependent repositories - 5.31 thousand downloads last month - 102 stars on GitHub - 1 maintainer
fast-scrape 0.2.5
High-performance HTML parsing library for Python
13 versions - Latest release: about 1 month ago - 3.98 thousand downloads last month - 3 stars on GitHub - 1 maintainer
htmtl 0.2.0
A templating language that is both a superset and subset of HTML.
2 versions - Latest release: over 1 year ago - 13 downloads last month - 4 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
pywebcopy 7.0.2
Python library to clone/archive pages or sites from the Internet.
18 versions - Latest release: almost 4 years ago - 2 dependent packages - 40 dependent repositories - 3.27 thousand downloads last month - 620 stars on GitHub - 1 maintainer
pydhtmlparser 2.2.3 💰
Python HTML/XML parser for easy web scraping.
28 versions - Latest release: about 6 years ago - 8 dependent repositories - 44 downloads last month - 6 stars on GitHub - 1 maintainer
pgreaper 1.0.0a2
A simple, flexible, and robust wrapper around the Postgres COPY command. Supports loading CSV/JSO...
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 31 downloads last month - 12 stars on GitHub - 1 maintainer