An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "web-scraping" keyword

View the packages on the pypi.org package registry that are tagged with the "web-scraping" keyword.

gallery-thief 1.1.0
Simple python package for scraping images from different search engines by prompt.
6 versions - Latest release: over 2 years ago - 40 downloads last month - 0 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
scrapy 2.14.0
A high-level Web Crawling and Web Scraping framework
106 versions - Latest release: 2 days ago - 136 dependent packages - 2,753 dependent repositories - 3.63 million downloads last month - 52,897 stars on GitHub - 4 maintainers
ragoon 0.0.15 💰
RAGoon : High level library for batched embeddings generation, blazingly-fast web-based RAG and q...
15 versions - Latest release: about 1 year ago - 65 downloads last month - 0 stars on GitHub - 1 maintainer
anime-api-scraper 1.0.1
Scraper robusto para extraer información de sitios de anime, comenzando con AnimeFlv
2 versions - Latest release: 5 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
minner 0.1.4
Scrapy, a fast high-level web crawling & scraping framework for Python.
13 versions - Latest release: about 1 year ago - 52 downloads last month - 52,897 stars on GitHub - 1 maintainer
cesail 0.2.3
A comprehensive web automation and DOM parsing platform with AI-powered agents
6 versions - Latest release: 4 months ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
hdistill 1.0.0
CLI tool for parsing HTML
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
scrapfly-sdk 0.8.23
Scrapfly SDK for Scrapfly
43 versions - Latest release: 8 months ago - 2 dependent repositories - 98.5 thousand downloads last month - 48 stars on GitHub - 1 maintainer
parse-utils-yogen48 0.0.5
Page Parser Utils For scraping
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 2 stars on GitHub - 1 maintainer
tiny-web-crawler 0.5.0
A simple and efficient web crawler in Python.
10 versions - Latest release: over 1 year ago - 70 downloads last month - 62 stars on GitHub - 1 maintainer
py-easy-scrap 0.1.3
A useful package for web scraping with Selenium
4 versions - Latest release: over 2 years ago - 150 downloads last month - 1 stars on GitHub - 1 maintainer
crawl4ai-news-fetcher 0.1.0
A specialized news content fetcher with redirect resolution built on crawl4ai
1 version - Latest release: 2 months ago - 13 downloads last month - 1 maintainer
selectorllm 0.0.2
Add your description here
2 versions - Latest release: 2 months ago - 20 downloads last month - 1 maintainer
tokopaedi-async 0.1.2
High-performance Async Python scraper for Tokopedia (Fork of tokopaedi)
3 versions - Latest release: about 1 month ago - 56 downloads last month
thisisapogreq 21.3.3 💰
Faster & simpler requests replacement for Python
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 6 downloads last month - 1,115 stars on GitHub - 1 maintainer
gg-scrape 0.1.8
A little Python CLI app that provides a summary of League champion build information.
7 versions - Latest release: almost 5 years ago - 1 dependent repositories - 34 downloads last month - 6 stars on GitHub - 1 maintainer
fodio 1.0.1
A scraping library made to be simple and asynchronous
1 version - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 3 stars on GitHub - 1 maintainer
markupever 0.3.2
The fast, most optimal, and correct HTML & XML parsing library.
8 versions - Latest release: 2 months ago - 2.23 thousand downloads last month - 25 stars on GitHub - 1 maintainer
crawl4ai-mcp-sse-stdio 1.1.0
MCP (Model Context Protocol) server for Crawl4AI - Universal web crawling and data extraction
10 versions - Latest release: 4 months ago - 64 downloads last month - 3 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
google-search-results 2.4.2
Scrape and search localized results from Google, Bing, Baidu, Yahoo, Yandex, Ebay, Homedepot, you...
41 versions - Latest release: almost 3 years ago - 113 dependent packages - 2,025 dependent repositories - 775 thousand downloads last month - 459 stars on GitHub - 3 maintainers
multi-webbing 0.3.0
A multi-threaded libary for web scraping in python, built upon the python threading. Supports sel...
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 19 downloads last month - 3 stars on GitHub - 1 maintainer
twat-search 2.7.0
Web search plugin for twat
20 versions - Latest release: 10 months ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
proxar 0.7.0
A Python client for fetching public proxies from multiple sources.
8 versions - Latest release: 6 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
scrapeai 0.3.0
A Python library to scrape web data using LLMs and Selenium
2 versions - Latest release: over 1 year ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
curl-cffi 0.14.0 💰
libcurl ffi bindings for Python, with impersonation support.
83 versions - Latest release: 23 days ago - 56 dependent packages - 155 dependent repositories - 11.5 million downloads last month - 1,838 stars on GitHub - 1 maintainer
openintel 1.0.0
A comprehensive tool to detect technology stacks used by websites
1 version - Latest release: 6 months ago - 10 downloads last month - 4 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
trafilatura 2.0.0 💰
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction...
50 versions - Latest release: about 1 year ago - 71 dependent packages - 63 dependent repositories - 1.9 million downloads last month - 5,095 stars on GitHub - 1 maintainer
whosyouragent 2.0.1
Self updating package for generating random user agent strings.
22 versions - Latest release: about 1 year ago - 6 dependent packages - 1 dependent repositories - 185 downloads last month - 1 stars on GitHub - 1 maintainer
comic-scraper 0.9.0
Scraps comics,mangas and creates cbz (/pdf) files for offline reading
5 versions - Latest release: almost 8 years ago - 40 downloads last month - 19 stars on GitHub - 1 maintainer
drissionpage-expend 1.0.1
DrissionPage XHR请求扩展库,支持多种数据类型和请求方式
2 versions - Latest release: 6 months ago - 23 downloads last month - 1 maintainer
kaggle-discussion-extractor 1.3.0
A professional-grade tool for extracting and analyzing discussions from Kaggle competitions
17 versions - Latest release: 2 months ago - 119 downloads last month - 1 stars on GitHub - 1 maintainer
ssai 1.0.10
SuperSummarizeAI is a versatile Python tool designed to extract and summarize textual content. Wh...
9 versions - Latest release: over 2 years ago - 44 downloads last month - 13 stars on GitHub - 1 maintainer
scsapi 0.1.1
A third-party API to connect to, browse, and interact with SpeedCubeShop.
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
vodscrepe 2.1.0
https://vods.co/ vod scraper
24 versions - Latest release: almost 5 years ago - 1 dependent repositories - 102 downloads last month - 2 stars on GitHub - 1 maintainer
firecrawl-simple-client 0.1.3
Python client for Firecrawl-Simple
4 versions - Latest release: about 1 year ago - 22 downloads last month - 1 stars on GitHub - 1 maintainer
sitemap-mcp-server 0.1.3
Sitemap MCP is a Model Context Protocol (MCP) server for fetching, parsing, analyzing, and visual...
3 versions - Latest release: 9 months ago - 118 downloads last month - 5 stars on GitHub - 1 maintainer
tourist 0.2.5
Tourist Framework
26 versions - Latest release: 2 days ago - 458 downloads last month - 3 stars on GitHub - 1 maintainer
mcp-server-xfetch 0.1.1
Enhanced MCP fetch server (fetch on steroids) that works through xfetch.ai service. Unlocks acces...
3 versions - Latest release: 9 months ago - 15 downloads last month - 1 maintainer
Top 8.5% on pypi.org
facebook-page-scraper 5.0.6
Python package to scrap facebook's pages front end with no limitations
29 versions - Latest release: over 1 year ago - 2 dependent repositories - 754 downloads last month - 261 stars on GitHub - 1 maintainer
undoom-douyin-data-analysis 0.1.3
抖音数据分析 MCP 服务器 - 提供抖音视频和用户数据的采集、分析和导出功能
4 versions - Latest release: 5 months ago - 105 downloads last month - 4 stars on GitHub - 1 maintainer
vinted-api-kit 0.1.0
Lightweight asynchronous Python client library for accessing Vinted API and scraping item data.
2 versions - Latest release: 5 months ago - 550 downloads last month - 4 stars on GitHub - 1 maintainer
httpz-scanner 2.1.9
Hyper-fast HTTP Scraping Tool
27 versions - Latest release: 11 months ago - 239 downloads last month - 4 stars on GitHub - 1 maintainer
digidownload 1.0.5
API to download books from digi4school.at.
8 versions - Latest release: over 2 years ago - 65 downloads last month - 1 stars on GitHub - 1 maintainer
souperscraper 1.0.2
A simple web scraper base combining Beautiful Soup and Selenium
3 versions - Latest release: over 1 year ago - 67 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-qfm 2.11.2
A high-level Web Crawling and Web Scraping framework
2 versions - Latest release: about 1 year ago - 22 downloads last month - 59,287 stars on GitHub - 1 maintainer
bf-scrapy-base 0.0.6
基于scrapy的二次开发
6 versions - Latest release: 4 months ago - 20 downloads last month - 59,287 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
pylab-utils 0.5
python utility tools
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 16 downloads last month - 59,287 stars on GitHub - 1 maintainer
aminer-scrapy 2.11.1
A high-level Web Crawling and Web Scraping framework
5 versions - Latest release: almost 2 years ago - 68 downloads last month - 52,303 stars on GitHub - 1 maintainer
scrapy-hls 0.1
scrapy integration for m3u8 files
1 version - Latest release: over 4 years ago - 1 dependent repositories - 9 downloads last month - 52,248 stars on GitHub - 1 maintainer
pyrua 1.0.0
Professional Random User-Agent Generator for web scraping and browser simulation
2 versions - Latest release: about 1 month ago - 169 downloads last month - 1 maintainer
bostas 0.0.2
Tool for social media automation
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 17 downloads last month - 3 stars on GitHub - 1 maintainer
nudecrawler 0.3.28
Crawl telegra.ph searching for nudes!
48 versions - Latest release: over 1 year ago - 202 downloads last month - 334 stars on GitHub - 1 maintainer
webshotr 1.0.0 💰
A simple and fast website screenshot tool - WebShotr
1 version - Latest release: 7 months ago - 10 downloads last month - 3 stars on GitHub - 1 maintainer
thordata-sdk 1.0.1
The Official Python SDK for Thordata - AI Data Infrastructure & Proxy Network.
12 versions - Latest release: 3 days ago - 737 downloads last month - 1 stars on GitHub - 1 maintainer
campbells 0.3.0
A condensed web scraping library.
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 53 downloads last month - 0 stars on GitHub - 1 maintainer
ucsdasiga 1.0.0a7
A simple API wrapper and web scraper for the UCSD Associated Students (AS) instructor grade archive.
4 versions - Latest release: over 2 years ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
soupsavvy 1.1.0
Powerful and flexible web scraping Search Engine
19 versions - Latest release: 2 months ago - 278 downloads last month - 9 stars on GitHub - 1 maintainer
changedetection.io 0.51.4 💰
Website change detection and monitoring service, detect changes to web pages and send alerts/noti...
170 versions - Latest release: about 1 month ago - 13.2 thousand downloads last month - 29,215 stars on GitHub - 1 maintainer
reddit-multimodal-crawler 1.3.2
A scraper which will scrape out multimedia data from reddit.
4 versions - Latest release: about 3 years ago - 15 downloads last month - 11 stars on GitHub - 1 maintainer
idealista-scraper 1.0.0
Production web scraper for Idealista real estate listings
1 version - Latest release: about 1 month ago - 14 downloads last month - 1 maintainer
cbbpy 2.1.2
A Python-based web scraper for NCAA basketball.
15 versions - Latest release: 12 months ago - 2.25 thousand downloads last month - 11 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
sbase 4.45.8
A complete web automation framework for end-to-end testing.
727 versions - Latest release: 8 days ago - 2 dependent repositories - 6.16 thousand downloads last month - 4,950 stars on GitHub - 1 maintainer
selenium-base 4.45.8
A complete web automation framework for end-to-end testing.
680 versions - Latest release: 8 days ago - 29.4 thousand downloads last month - 4,950 stars on GitHub - 1 maintainer
pastebin-bisque 1.0.0
Scrape all public Pastebin pastes from a user.
1 version - Latest release: over 2 years ago - 26 downloads last month - 32 stars on GitHub - 1 maintainer
flyto-core 1.8.14
Atomic workflow automation modules for git-native automation
50 versions - Latest release: 3 days ago - 4.46 thousand downloads last month - 0 stars on GitHub - 1 maintainer
crewai-olostep 0.1.1
CrewAI tools for web scraping and research using Olostep API
2 versions - Latest release: about 1 month ago - 48 downloads last month - 1 maintainer
sentinel-core 0.1.7
Self-Healing Knowledge Graph for RAG Pipelines - pip-installable library
8 versions - Latest release: 3 days ago - 94 downloads last month - 1 maintainer
haystack-brightdata 0.1.0
Bright Data integration for Haystack - web scraping, SERP API, and data extraction from 45+ websites
1 version - Latest release: 3 days ago - 1 maintainer
tarzi 0.1.9
Rust-native lite search for AI applications
10 versions - Latest release: 3 days ago - 322 downloads last month - 2 stars on GitHub - 1 maintainer
whizoai 1.0.0
Official WhizoAI SDK for Python - Enterprise-grade web scraping API client
1 version - Latest release: 3 months ago - 9 downloads last month - 1 maintainer
py-easy-scrape 0.1.6
A useful package for web scraping with Selenium
4 versions - Latest release: over 2 years ago - 20 downloads last month - 1 stars on GitHub - 1 maintainer
owl-browser 1.2.4
AI-first browser automation SDK with on-device vision model and natural language selectors
4 versions - Latest release: 4 days ago - 321 downloads last month - 1 maintainer
scrapling 0.3.14 💰
Scrapling is an undetectable, powerful, flexible, high-performance Python library that makes Web ...
37 versions - Latest release: 4 days ago - 17.2 thousand downloads last month - 8,331 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
wayback-machine-scraper 1.0.8
A command-line utility for scraping Wayback Machine snapshots from archive.org.
6 versions - Latest release: almost 5 years ago - 4 dependent repositories - 159 downloads last month - 454 stars on GitHub - 2 maintainers
torcrawl 1.35
A Python script to crawl and extract (regular or onion) webpages through TOR network.
3 versions - Latest release: 4 days ago - 203 downloads last month - 399 stars on GitHub - 1 maintainer
pyvigate 0.0.3 💰
A brief description of what your package does
4 versions - Latest release: almost 2 years ago - 23 downloads last month - 26 stars on GitHub - 1 maintainer
scrapy-rnet 0.0.5
A blazing-fast Python HTTP Client with TLS/HTTP2 fingerprint
5 versions - Latest release: 7 months ago - 55 downloads last month - 306 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
finvizfinance 1.3.0 💰
Finviz Finance. Information downloader.
63 versions - Latest release: 4 days ago - 14 dependent packages - 86 dependent repositories - 76.4 thousand downloads last month - 781 stars on GitHub - 1 maintainer
crypto-exchange-news-crawler 0.1.9
Cryptocurrency exchange announcement news crawler for major crypto exchanges
14 versions - Latest release: 7 months ago - 29 downloads last month - 4 stars on GitHub - 1 maintainer
detect-expert-client 1.0.0
Python client for detect.expert DNS checking service with Cloudflare bypass
1 version - Latest release: 4 days ago - 1 maintainer
bluemoss 1.0.0
bluemoss enables you to easily scrape websites.
18 versions - Latest release: about 2 years ago - 61 downloads last month - 7 stars on GitHub - 1 maintainer
openatlas 1.0.7
An autonomous browser agent with web search and interactive browsing capabilities
8 versions - Latest release: 3 months ago - 506 downloads last month - 1 maintainer
scrapy-beautifulsoup 0.0.2
Simple Scrapy middleware to process non-well-formed HTML with BeautifulSoup
2 versions - Latest release: over 9 years ago - 1 dependent repositories - 57 downloads last month - 21 stars on GitHub - 1 maintainer
tabulate-html 0.1.1
A robust HTML table parser for Python with full rowspan/colspan and multi-header support
1 version - Latest release: 5 days ago - 1 maintainer
webrover 0.1.12
Generate high-quality datasets from web content for AI training
11 versions - Latest release: about 1 year ago - 23 downloads last month - 6 stars on GitHub - 1 maintainer
flamingtext 0.0.7
Unofficial API of flamingtext.com
7 versions - Latest release: almost 3 years ago - 22 downloads last month - 3 stars on GitHub - 1 maintainer
torvend 0.0.1
A framework for public torrent vendor scrapers
2 versions - Latest release: about 8 years ago - 1 dependent repositories - 64 downloads last month - 1 stars on GitHub - 1 maintainer
vigorish 0.7.0
Hybrid Python/Node.js web scraper for Major League Baseball (MLB) data.
48 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
basketball_reference_web_scraper 4.15.4
A Basketball Reference client that generates data by scraping the website
48 versions - Latest release: 5 months ago - 3 dependent repositories - 4.22 thousand downloads last month - 401 stars on GitHub - 1 maintainer
scrape-and-ntfy 0.1.2
An extremely customizable web scraper with a modular notification system and persistent storage v...
3 versions - Latest release: over 1 year ago - 17 downloads last month - 1 stars on GitHub - 1 maintainer
manga-scraper 0.12
Download Manga into chapterwise PDF files
1 version - Latest release: about 4 years ago - 1 dependent repositories - 18 downloads last month - 4 stars on GitHub - 1 maintainer
substack2md 0.1.1
A CAPTCHA-safe Python scraper with Cloudflare bypass that downloads Substack posts and converts t...
2 versions - Latest release: 7 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
pywebber 5.0
Common tools employed in web development
3 versions - Latest release: over 7 years ago - 1 dependent repositories - 26 downloads last month - 1 stars on GitHub - 1 maintainer
Top 7.0% on pypi.org
tableauscraper 0.1.29
Python library to scrape data from Tableau viz
39 versions - Latest release: about 4 years ago - 8 dependent repositories - 18.7 thousand downloads last month - 136 stars on GitHub - 1 maintainer
ambi-alert 0.0.2
This is a reverse search tool. Agentic Alerting
1 version - Latest release: 11 months ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
manga-down 0.1.2
Python package to download manga available on mangareader and mangapanda
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 16 downloads last month - 5 stars on GitHub - 1 maintainer
scrapely-client 1.0.1
Python client for Scrapely browser automation service - simple, intuitive web scraping and automa...
2 versions - Latest release: about 1 month ago - 1 maintainer
googlesearch-tool 1.1.3
A Python library for performing Google searches with support for dynamic query parameters, result...
9 versions - Latest release: 8 months ago - 52 downloads last month - 1 maintainer
pinscrape 5.0.0
Pinterest | a simple data scraper for pinterest
20 versions - Latest release: 4 months ago - 1 dependent repositories - 683 downloads last month - 113 stars on GitHub - 1 maintainer
html2rss-ai 0.3.2
🚀 AI-powered web scraping with modern CSS support. Extract content from any website using GPT-4, ...
4 versions - Latest release: 6 months ago - 18 downloads last month - 1 maintainer