pypi.org "web-scraping" keyword
View the packages on the pypi.org package registry that are tagged with the "web-scraping" keyword.
gallery-thief 1.1.0
Simple python package for scraping images from different search engines by prompt.6 versions - Latest release: over 2 years ago - 40 downloads last month - 0 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
106 versions - Latest release: 2 days ago - 136 dependent packages - 2,753 dependent repositories - 3.63 million downloads last month - 52,897 stars on GitHub - 4 maintainers
scrapy 2.14.0
A high-level Web Crawling and Web Scraping framework106 versions - Latest release: 2 days ago - 136 dependent packages - 2,753 dependent repositories - 3.63 million downloads last month - 52,897 stars on GitHub - 4 maintainers
ragoon 0.0.15 💰
RAGoon : High level library for batched embeddings generation, blazingly-fast web-based RAG and q...15 versions - Latest release: about 1 year ago - 65 downloads last month - 0 stars on GitHub - 1 maintainer
anime-api-scraper 1.0.1
Scraper robusto para extraer información de sitios de anime, comenzando con AnimeFlv2 versions - Latest release: 5 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
minner 0.1.4
Scrapy, a fast high-level web crawling & scraping framework for Python.13 versions - Latest release: about 1 year ago - 52 downloads last month - 52,897 stars on GitHub - 1 maintainer
cesail 0.2.3
A comprehensive web automation and DOM parsing platform with AI-powered agents6 versions - Latest release: 4 months ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
hdistill 1.0.0
CLI tool for parsing HTML9 versions - Latest release: over 4 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
scrapfly-sdk 0.8.23
Scrapfly SDK for Scrapfly43 versions - Latest release: 8 months ago - 2 dependent repositories - 98.5 thousand downloads last month - 48 stars on GitHub - 1 maintainer
parse-utils-yogen48 0.0.5
Page Parser Utils For scraping3 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 2 stars on GitHub - 1 maintainer
tiny-web-crawler 0.5.0
A simple and efficient web crawler in Python.10 versions - Latest release: over 1 year ago - 70 downloads last month - 62 stars on GitHub - 1 maintainer
py-easy-scrap 0.1.3
A useful package for web scraping with Selenium4 versions - Latest release: over 2 years ago - 150 downloads last month - 1 stars on GitHub - 1 maintainer
crawl4ai-news-fetcher 0.1.0
A specialized news content fetcher with redirect resolution built on crawl4ai1 version - Latest release: 2 months ago - 13 downloads last month - 1 maintainer
selectorllm 0.0.2
Add your description here2 versions - Latest release: 2 months ago - 20 downloads last month - 1 maintainer
tokopaedi-async 0.1.2
High-performance Async Python scraper for Tokopedia (Fork of tokopaedi)3 versions - Latest release: about 1 month ago - 56 downloads last month
thisisapogreq 21.3.3 💰
Faster & simpler requests replacement for Python1 version - Latest release: almost 4 years ago - 1 dependent repositories - 6 downloads last month - 1,115 stars on GitHub - 1 maintainer
gg-scrape 0.1.8
A little Python CLI app that provides a summary of League champion build information.7 versions - Latest release: almost 5 years ago - 1 dependent repositories - 34 downloads last month - 6 stars on GitHub - 1 maintainer
fodio 1.0.1
A scraping library made to be simple and asynchronous1 version - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 3 stars on GitHub - 1 maintainer
markupever 0.3.2
The fast, most optimal, and correct HTML & XML parsing library.8 versions - Latest release: 2 months ago - 2.23 thousand downloads last month - 25 stars on GitHub - 1 maintainer
crawl4ai-mcp-sse-stdio 1.1.0
MCP (Model Context Protocol) server for Crawl4AI - Universal web crawling and data extraction10 versions - Latest release: 4 months ago - 64 downloads last month - 3 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
41 versions - Latest release: almost 3 years ago - 113 dependent packages - 2,025 dependent repositories - 775 thousand downloads last month - 459 stars on GitHub - 3 maintainers
google-search-results 2.4.2
Scrape and search localized results from Google, Bing, Baidu, Yahoo, Yandex, Ebay, Homedepot, you...41 versions - Latest release: almost 3 years ago - 113 dependent packages - 2,025 dependent repositories - 775 thousand downloads last month - 459 stars on GitHub - 3 maintainers
multi-webbing 0.3.0
A multi-threaded libary for web scraping in python, built upon the python threading. Supports sel...3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 19 downloads last month - 3 stars on GitHub - 1 maintainer
twat-search 2.7.0
Web search plugin for twat20 versions - Latest release: 10 months ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
proxar 0.7.0
A Python client for fetching public proxies from multiple sources.8 versions - Latest release: 6 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
scrapeai 0.3.0
A Python library to scrape web data using LLMs and Selenium2 versions - Latest release: over 1 year ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
83 versions - Latest release: 23 days ago - 56 dependent packages - 155 dependent repositories - 11.5 million downloads last month - 1,838 stars on GitHub - 1 maintainer
curl-cffi 0.14.0 💰
libcurl ffi bindings for Python, with impersonation support.83 versions - Latest release: 23 days ago - 56 dependent packages - 155 dependent repositories - 11.5 million downloads last month - 1,838 stars on GitHub - 1 maintainer
openintel 1.0.0
A comprehensive tool to detect technology stacks used by websites1 version - Latest release: 6 months ago - 10 downloads last month - 4 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
50 versions - Latest release: about 1 year ago - 71 dependent packages - 63 dependent repositories - 1.9 million downloads last month - 5,095 stars on GitHub - 1 maintainer
trafilatura 2.0.0 💰
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction...50 versions - Latest release: about 1 year ago - 71 dependent packages - 63 dependent repositories - 1.9 million downloads last month - 5,095 stars on GitHub - 1 maintainer
whosyouragent 2.0.1
Self updating package for generating random user agent strings.22 versions - Latest release: about 1 year ago - 6 dependent packages - 1 dependent repositories - 185 downloads last month - 1 stars on GitHub - 1 maintainer
comic-scraper 0.9.0
Scraps comics,mangas and creates cbz (/pdf) files for offline reading5 versions - Latest release: almost 8 years ago - 40 downloads last month - 19 stars on GitHub - 1 maintainer
drissionpage-expend 1.0.1
DrissionPage XHR请求扩展库,支持多种数据类型和请求方式2 versions - Latest release: 6 months ago - 23 downloads last month - 1 maintainer
kaggle-discussion-extractor 1.3.0
A professional-grade tool for extracting and analyzing discussions from Kaggle competitions17 versions - Latest release: 2 months ago - 119 downloads last month - 1 stars on GitHub - 1 maintainer
ssai 1.0.10
SuperSummarizeAI is a versatile Python tool designed to extract and summarize textual content. Wh...9 versions - Latest release: over 2 years ago - 44 downloads last month - 13 stars on GitHub - 1 maintainer
scsapi 0.1.1
A third-party API to connect to, browse, and interact with SpeedCubeShop.3 versions - Latest release: over 6 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
vodscrepe 2.1.0
https://vods.co/ vod scraper24 versions - Latest release: almost 5 years ago - 1 dependent repositories - 102 downloads last month - 2 stars on GitHub - 1 maintainer
firecrawl-simple-client 0.1.3
Python client for Firecrawl-Simple4 versions - Latest release: about 1 year ago - 22 downloads last month - 1 stars on GitHub - 1 maintainer
sitemap-mcp-server 0.1.3
Sitemap MCP is a Model Context Protocol (MCP) server for fetching, parsing, analyzing, and visual...3 versions - Latest release: 9 months ago - 118 downloads last month - 5 stars on GitHub - 1 maintainer
tourist 0.2.5
Tourist Framework26 versions - Latest release: 2 days ago - 458 downloads last month - 3 stars on GitHub - 1 maintainer
mcp-server-xfetch 0.1.1
Enhanced MCP fetch server (fetch on steroids) that works through xfetch.ai service. Unlocks acces...3 versions - Latest release: 9 months ago - 15 downloads last month - 1 maintainer
Top 8.5% on pypi.org
29 versions - Latest release: over 1 year ago - 2 dependent repositories - 754 downloads last month - 261 stars on GitHub - 1 maintainer
facebook-page-scraper 5.0.6
Python package to scrap facebook's pages front end with no limitations29 versions - Latest release: over 1 year ago - 2 dependent repositories - 754 downloads last month - 261 stars on GitHub - 1 maintainer
undoom-douyin-data-analysis 0.1.3
抖音数据分析 MCP 服务器 - 提供抖音视频和用户数据的采集、分析和导出功能4 versions - Latest release: 5 months ago - 105 downloads last month - 4 stars on GitHub - 1 maintainer
vinted-api-kit 0.1.0
Lightweight asynchronous Python client library for accessing Vinted API and scraping item data.2 versions - Latest release: 5 months ago - 550 downloads last month - 4 stars on GitHub - 1 maintainer
httpz-scanner 2.1.9
Hyper-fast HTTP Scraping Tool27 versions - Latest release: 11 months ago - 239 downloads last month - 4 stars on GitHub - 1 maintainer
digidownload 1.0.5
API to download books from digi4school.at.8 versions - Latest release: over 2 years ago - 65 downloads last month - 1 stars on GitHub - 1 maintainer
souperscraper 1.0.2
A simple web scraper base combining Beautiful Soup and Selenium3 versions - Latest release: over 1 year ago - 67 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-qfm 2.11.2
A high-level Web Crawling and Web Scraping framework2 versions - Latest release: about 1 year ago - 22 downloads last month - 59,287 stars on GitHub - 1 maintainer
bf-scrapy-base 0.0.6
基于scrapy的二次开发6 versions - Latest release: 4 months ago - 20 downloads last month - 59,287 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 16 downloads last month - 59,287 stars on GitHub - 1 maintainer
pylab-utils 0.5
python utility tools2 versions - Latest release: over 4 years ago - 1 dependent repositories - 16 downloads last month - 59,287 stars on GitHub - 1 maintainer
aminer-scrapy 2.11.1
A high-level Web Crawling and Web Scraping framework5 versions - Latest release: almost 2 years ago - 68 downloads last month - 52,303 stars on GitHub - 1 maintainer
scrapy-hls 0.1
scrapy integration for m3u8 files1 version - Latest release: over 4 years ago - 1 dependent repositories - 9 downloads last month - 52,248 stars on GitHub - 1 maintainer
pyrua 1.0.0
Professional Random User-Agent Generator for web scraping and browser simulation2 versions - Latest release: about 1 month ago - 169 downloads last month - 1 maintainer
bostas 0.0.2
Tool for social media automation2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 17 downloads last month - 3 stars on GitHub - 1 maintainer
nudecrawler 0.3.28
Crawl telegra.ph searching for nudes!48 versions - Latest release: over 1 year ago - 202 downloads last month - 334 stars on GitHub - 1 maintainer
webshotr 1.0.0 💰
A simple and fast website screenshot tool - WebShotr1 version - Latest release: 7 months ago - 10 downloads last month - 3 stars on GitHub - 1 maintainer
thordata-sdk 1.0.1
The Official Python SDK for Thordata - AI Data Infrastructure & Proxy Network.12 versions - Latest release: 3 days ago - 737 downloads last month - 1 stars on GitHub - 1 maintainer
campbells 0.3.0
A condensed web scraping library.8 versions - Latest release: over 2 years ago - 1 dependent repositories - 53 downloads last month - 0 stars on GitHub - 1 maintainer
ucsdasiga 1.0.0a7
A simple API wrapper and web scraper for the UCSD Associated Students (AS) instructor grade archive.4 versions - Latest release: over 2 years ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
soupsavvy 1.1.0
Powerful and flexible web scraping Search Engine19 versions - Latest release: 2 months ago - 278 downloads last month - 9 stars on GitHub - 1 maintainer
changedetection.io 0.51.4 💰
Website change detection and monitoring service, detect changes to web pages and send alerts/noti...170 versions - Latest release: about 1 month ago - 13.2 thousand downloads last month - 29,215 stars on GitHub - 1 maintainer
reddit-multimodal-crawler 1.3.2
A scraper which will scrape out multimedia data from reddit.4 versions - Latest release: about 3 years ago - 15 downloads last month - 11 stars on GitHub - 1 maintainer
idealista-scraper 1.0.0
Production web scraper for Idealista real estate listings1 version - Latest release: about 1 month ago - 14 downloads last month - 1 maintainer
cbbpy 2.1.2
A Python-based web scraper for NCAA basketball.15 versions - Latest release: 12 months ago - 2.25 thousand downloads last month - 11 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
727 versions - Latest release: 8 days ago - 2 dependent repositories - 6.16 thousand downloads last month - 4,950 stars on GitHub - 1 maintainer
sbase 4.45.8
A complete web automation framework for end-to-end testing.727 versions - Latest release: 8 days ago - 2 dependent repositories - 6.16 thousand downloads last month - 4,950 stars on GitHub - 1 maintainer
selenium-base 4.45.8
A complete web automation framework for end-to-end testing.680 versions - Latest release: 8 days ago - 29.4 thousand downloads last month - 4,950 stars on GitHub - 1 maintainer
pastebin-bisque 1.0.0
Scrape all public Pastebin pastes from a user.1 version - Latest release: over 2 years ago - 26 downloads last month - 32 stars on GitHub - 1 maintainer
flyto-core 1.8.14
Atomic workflow automation modules for git-native automation50 versions - Latest release: 3 days ago - 4.46 thousand downloads last month - 0 stars on GitHub - 1 maintainer
crewai-olostep 0.1.1
CrewAI tools for web scraping and research using Olostep API2 versions - Latest release: about 1 month ago - 48 downloads last month - 1 maintainer
sentinel-core 0.1.7
Self-Healing Knowledge Graph for RAG Pipelines - pip-installable library8 versions - Latest release: 3 days ago - 94 downloads last month - 1 maintainer
haystack-brightdata 0.1.0
Bright Data integration for Haystack - web scraping, SERP API, and data extraction from 45+ websites1 version - Latest release: 3 days ago - 1 maintainer
tarzi 0.1.9
Rust-native lite search for AI applications10 versions - Latest release: 3 days ago - 322 downloads last month - 2 stars on GitHub - 1 maintainer
whizoai 1.0.0
Official WhizoAI SDK for Python - Enterprise-grade web scraping API client1 version - Latest release: 3 months ago - 9 downloads last month - 1 maintainer
py-easy-scrape 0.1.6
A useful package for web scraping with Selenium4 versions - Latest release: over 2 years ago - 20 downloads last month - 1 stars on GitHub - 1 maintainer
owl-browser 1.2.4
AI-first browser automation SDK with on-device vision model and natural language selectors4 versions - Latest release: 4 days ago - 321 downloads last month - 1 maintainer
scrapling 0.3.14 💰
Scrapling is an undetectable, powerful, flexible, high-performance Python library that makes Web ...37 versions - Latest release: 4 days ago - 17.2 thousand downloads last month - 8,331 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
6 versions - Latest release: almost 5 years ago - 4 dependent repositories - 159 downloads last month - 454 stars on GitHub - 2 maintainers
wayback-machine-scraper 1.0.8
A command-line utility for scraping Wayback Machine snapshots from archive.org.6 versions - Latest release: almost 5 years ago - 4 dependent repositories - 159 downloads last month - 454 stars on GitHub - 2 maintainers
torcrawl 1.35
A Python script to crawl and extract (regular or onion) webpages through TOR network.3 versions - Latest release: 4 days ago - 203 downloads last month - 399 stars on GitHub - 1 maintainer
pyvigate 0.0.3 💰
A brief description of what your package does4 versions - Latest release: almost 2 years ago - 23 downloads last month - 26 stars on GitHub - 1 maintainer
scrapy-rnet 0.0.5
A blazing-fast Python HTTP Client with TLS/HTTP2 fingerprint5 versions - Latest release: 7 months ago - 55 downloads last month - 306 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
63 versions - Latest release: 4 days ago - 14 dependent packages - 86 dependent repositories - 76.4 thousand downloads last month - 781 stars on GitHub - 1 maintainer
finvizfinance 1.3.0 💰
Finviz Finance. Information downloader.63 versions - Latest release: 4 days ago - 14 dependent packages - 86 dependent repositories - 76.4 thousand downloads last month - 781 stars on GitHub - 1 maintainer
crypto-exchange-news-crawler 0.1.9
Cryptocurrency exchange announcement news crawler for major crypto exchanges14 versions - Latest release: 7 months ago - 29 downloads last month - 4 stars on GitHub - 1 maintainer
detect-expert-client 1.0.0
Python client for detect.expert DNS checking service with Cloudflare bypass1 version - Latest release: 4 days ago - 1 maintainer
bluemoss 1.0.0
bluemoss enables you to easily scrape websites.18 versions - Latest release: about 2 years ago - 61 downloads last month - 7 stars on GitHub - 1 maintainer
openatlas 1.0.7
An autonomous browser agent with web search and interactive browsing capabilities8 versions - Latest release: 3 months ago - 506 downloads last month - 1 maintainer
scrapy-beautifulsoup 0.0.2
Simple Scrapy middleware to process non-well-formed HTML with BeautifulSoup2 versions - Latest release: over 9 years ago - 1 dependent repositories - 57 downloads last month - 21 stars on GitHub - 1 maintainer
tabulate-html 0.1.1
A robust HTML table parser for Python with full rowspan/colspan and multi-header support1 version - Latest release: 5 days ago - 1 maintainer
webrover 0.1.12
Generate high-quality datasets from web content for AI training11 versions - Latest release: about 1 year ago - 23 downloads last month - 6 stars on GitHub - 1 maintainer
flamingtext 0.0.7
Unofficial API of flamingtext.com7 versions - Latest release: almost 3 years ago - 22 downloads last month - 3 stars on GitHub - 1 maintainer
torvend 0.0.1
A framework for public torrent vendor scrapers2 versions - Latest release: about 8 years ago - 1 dependent repositories - 64 downloads last month - 1 stars on GitHub - 1 maintainer
vigorish 0.7.0
Hybrid Python/Node.js web scraper for Major League Baseball (MLB) data.48 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
48 versions - Latest release: 5 months ago - 3 dependent repositories - 4.22 thousand downloads last month - 401 stars on GitHub - 1 maintainer
basketball_reference_web_scraper 4.15.4
A Basketball Reference client that generates data by scraping the website48 versions - Latest release: 5 months ago - 3 dependent repositories - 4.22 thousand downloads last month - 401 stars on GitHub - 1 maintainer
scrape-and-ntfy 0.1.2
An extremely customizable web scraper with a modular notification system and persistent storage v...3 versions - Latest release: over 1 year ago - 17 downloads last month - 1 stars on GitHub - 1 maintainer
manga-scraper 0.12
Download Manga into chapterwise PDF files1 version - Latest release: about 4 years ago - 1 dependent repositories - 18 downloads last month - 4 stars on GitHub - 1 maintainer
substack2md 0.1.1
A CAPTCHA-safe Python scraper with Cloudflare bypass that downloads Substack posts and converts t...2 versions - Latest release: 7 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
pywebber 5.0
Common tools employed in web development3 versions - Latest release: over 7 years ago - 1 dependent repositories - 26 downloads last month - 1 stars on GitHub - 1 maintainer
Top 7.0% on pypi.org
39 versions - Latest release: about 4 years ago - 8 dependent repositories - 18.7 thousand downloads last month - 136 stars on GitHub - 1 maintainer
tableauscraper 0.1.29
Python library to scrape data from Tableau viz39 versions - Latest release: about 4 years ago - 8 dependent repositories - 18.7 thousand downloads last month - 136 stars on GitHub - 1 maintainer
ambi-alert 0.0.2
This is a reverse search tool. Agentic Alerting1 version - Latest release: 11 months ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
manga-down 0.1.2
Python package to download manga available on mangareader and mangapanda3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 16 downloads last month - 5 stars on GitHub - 1 maintainer
scrapely-client 1.0.1
Python client for Scrapely browser automation service - simple, intuitive web scraping and automa...2 versions - Latest release: about 1 month ago - 1 maintainer
googlesearch-tool 1.1.3
A Python library for performing Google searches with support for dynamic query parameters, result...9 versions - Latest release: 8 months ago - 52 downloads last month - 1 maintainer
pinscrape 5.0.0
Pinterest | a simple data scraper for pinterest20 versions - Latest release: 4 months ago - 1 dependent repositories - 683 downloads last month - 113 stars on GitHub - 1 maintainer
html2rss-ai 0.3.2
🚀 AI-powered web scraping with modern CSS support. Extract content from any website using GPT-4, ...4 versions - Latest release: 6 months ago - 18 downloads last month - 1 maintainer
Related Keywords
python
240
scraper
148
scraping
106
automation
90
crawler
79
ai
58
data
57
selenium
57
bot_studio
57
playwright
47
python3
44
search
44
llm
41
webscraping
38
api
38
crawling
35
data-extraction
34
beautifulsoup
32
async
31
mcp
29
web-scraping-python
29
browser
28
requests
28
search-results
27
scrapy
25
web-scraper
25
browser-automation
25
proxy
25
framework
23
product
23
web-automation
23
web-crawler
22
html
22
web
22
keyword
21
http
21
spider
20
webdriver
19
asyncio
17
youtube
17
content-extraction
17
beautifulsoup4
16
linkedin
16
markdown
15
hacktoberfest
15
bot-detection
14
anti-detection
14
info
14
parser
14
cli
13
stealth
13
html-to-markdown
13
http-client
13
twitter
13
cloudflare-bypass
13
parsing
12
cloudflare
12
webscraper
12
ai-agents
12
scrape
11
google
11
web-crawling
11
nlp
11
machine-learning
11
selenium-python
11
web scraping
10
openai
10
model-context-protocol
10
claude
9
captcha
9
anti-bot
9
ai-scraping
9
bs4
9
testing
9
library
9
bot
9
xpath
8
user-agent
8
anthropic
8
python-scraper
8
python-library
8
chrome
8
amazon
8
research
8
download
8
video
8
scraping-framework
8
lxml
8
test-automation
7
json
7
html-parsing
7
scraping-python
7
data-mining
7
css
7
news
7
data-collection
7
pdf
7
curl
7
social-media
7
langchain
7