An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "scrapy" keyword

View the packages on the pypi.org package registry that are tagged with the "scrapy" keyword.

hoopa 0.1.18
Asynchronous crawler micro-framework based on python.
24 versions - Latest release: 5 months ago - 1 dependent repositories - 39 downloads last month - 9 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
scrapy-redis 0.9.1
Redis-based components for Scrapy.
21 versions - Latest release: over 1 year ago - 6 dependent packages - 392 dependent repositories - 12.6 thousand downloads last month - 5,641 stars on GitHub - 4 maintainers
asyncpy 1.2.0
Use asyncio and aiohttp's concatenated web crawler framework
14 versions - Latest release: about 3 years ago - 1 dependent repositories - 266 downloads last month - 107 stars on GitHub - 1 maintainer
scrapy-impersonate 1.6.1 💰
Scrapy download handler that can impersonate browser fingerprints
18 versions - Latest release: 3 months ago - 26.8 thousand downloads last month - 191 stars on GitHub - 1 maintainer
scrapy-vampire 0.1.0
utils for scrapy
1 version - Latest release: about 2 years ago - 1 dependent repositories - 1 maintainer
Top 1.5% on pypi.org
scrapy-splash 0.11.1
JavaScript support for Scrapy using Splash
15 versions - Latest release: 9 months ago - 5 dependent packages - 429 dependent repositories - 51.9 thousand downloads last month - 3,092 stars on GitHub - 4 maintainers
Top 5.0% on pypi.org
scrapydweb 1.6.0
Web app for Scrapyd cluster management, with support for Scrapy log analysis & visualization.
20 versions - Latest release: 9 months ago - 23 dependent repositories - 2.4 thousand downloads last month - 3,330 stars on GitHub - 1 maintainer
aduana 0.2.1
Bindings for Aduana library
2 versions - Latest release: over 10 years ago - 3 dependent repositories - 43 downloads last month - 55 stars on GitHub - 2 maintainers
scrapydd 0.7.5
distributed scrapy spider scheduling system
46 versions - Latest release: about 5 years ago - 1 dependent repositories - 134 downloads last month - 6 stars on GitHub - 1 maintainer
fara_principals 0.0.7
A web scraper designed to collect Foreign Principal information from fara.gov
3 versions - Latest release: over 8 years ago - 36 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-count-filter 0.2.0
Scrapy Middleware for limiting requests based on a counter.
2 versions - Latest release: almost 6 years ago - 2 dependent repositories - 15 downloads last month - 7 stars on GitHub - 1 maintainer
ax-spider 0.1.4
A simple Python crawler framework
11 versions - Latest release: over 2 years ago - 65 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-redis-httpcache 1.4.0
Cache Scrapy responses with Redis.
6 versions - Latest release: 10 months ago - 377 downloads last month - 2 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
scrapy-rotating-proxies 0.6.2
Rotating proxies for Scrapy
13 versions - Latest release: over 6 years ago - 91 dependent repositories - 14.1 thousand downloads last month - 767 stars on GitHub - 2 maintainers
scrapy-sticky-meta-params 1.0.0
A spider middleware that forwards meta params through subsequent requests.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 2.08 thousand downloads last month - 3 stars on GitHub - 1 maintainer
scrapyz 0.3.3
Scrape Easy
6 versions - Latest release: over 10 years ago - 2 dependent repositories - 27 downloads last month - 186 stars on GitHub - 1 maintainer
frontoxy 1.0.3
Distributed URLs frontier for Scrapy with RabbitMQ
4 versions - Latest release: almost 9 years ago - 1 dependent repositories - 26 downloads last month - 7 stars on GitHub - 1 maintainer
scrapy-rss 1.0.1
RSS Tools for Scrapy Framework
18 versions - Latest release: about 2 months ago - 1 dependent repositories - 232 downloads last month - 33 stars on GitHub - 1 maintainer
scrapy-mcp-middleware 0.1.0
Scrapy middleware for MCP request tracking
1 version - Latest release: 1 day ago - 1 maintainer
scrapy-kinesispipeline 0.3.9
Scrapy pipeline to store aggregated items into AWS Kinesis
17 versions - Latest release: over 6 years ago - 1 dependent repositories - 47 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-mongoengine-item 0.1.5
Scrapy extension to write scraped items using MongoEngine documents
4 versions - Latest release: over 6 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
lich_scrapy_hdfs_pipeline 0.0.1
Auto Generated by os-scrapy-cookiecutter
1 version - Latest release: over 4 years ago - 7 downloads last month - 3 stars on GitHub - 1 maintainer
portiaitempipelineutils 0.0.1
Scrapy portia pipeline which allow you to do items related stuff.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 1 maintainer
Top 3.3% on pypi.org
python-scrapyd-api 2.1.2
A Python wrapper for working with the Scrapyd API
6 versions - Latest release: over 7 years ago - 3 dependent packages - 215 dependent repositories - 4.93 thousand downloads last month - 271 stars on GitHub - 1 maintainer
scrapy-kafka 0.1.1
Kafka-based components for Scrapy
1 version - Latest release: about 10 years ago - 2 dependent repositories - 9 downloads last month - 79 stars on GitHub - 1 maintainer
scrapy-influxdb-exporter 1.4.0
Export Scrapy spider stats to InfluxDB.
17 versions - Latest release: 10 months ago - 57 downloads last month - 6 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
scrapy-random-useragent 0.2
Scrapy Middleware to set a random User-Agent for every Request.
2 versions - Latest release: over 9 years ago - 39 dependent repositories - 356 downloads last month - 202 stars on GitHub - 1 maintainer
picoscrape 1.0
This library enables the user to easily scrape images from various websites like unsplash, pexels...
1 version - Latest release: over 5 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
feapder 1.9.2
feapder是一款支持分布式、批次采集、数据防丢、报警丰富的python爬虫框架
178 versions - Latest release: 9 months ago - 4 dependent repositories - 28.9 thousand downloads last month - 3,368 stars on GitHub - 1 maintainer
scrapy-sentry-sdk 0.4.1
Scrapy extension for integration of Sentry SDK to Scrapy projects
6 versions - Latest release: about 5 years ago - 1 dependent repositories - 57 downloads last month - 5 stars on GitHub - 1 maintainer
weblocust 1.0.3
A more Powerful Spider System in Python based on pyspider
4 versions - Latest release: about 9 years ago - 1 dependent repositories - 27 downloads last month - 6 stars on GitHub - 1 maintainer
lich_scrapy_referrer 0.0.2
Auto Generated by os-scrapy-cookiecutter
2 versions - Latest release: almost 5 years ago - 7 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-cffi 0.2.2
An asyncio-style web scraping framework inspired by Scrapy, powered by curl_cffi.
10 versions - Latest release: 3 days ago - 258 downloads last month - 3 stars on GitHub - 1 maintainer
crypto-exchange-news-crawler 0.1.9
Cryptocurrency exchange announcement news crawler for major crypto exchanges
14 versions - Latest release: 5 months ago - 58 downloads last month - 4 stars on GitHub - 1 maintainer
airscrapy 1.0.1
Scrapy contrib for Airflow
2 versions - Latest release: 9 months ago - 36 downloads last month - 2 stars on GitHub - 1 maintainer
os-scrapy-cookiecutter 0.0.13
Cookiecutter for Scrapy
13 versions - Latest release: almost 5 years ago - 1 dependent repositories - 1.52 thousand downloads last month - 3 stars on GitHub - 1 maintainer
scrapyd-egg-checksum 0.1.2
Get the checksum of eggs in case of building distributed scrapy clusters
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
scrapy-selenium 0.0.7
Scrapy with selenium
6 versions - Latest release: almost 7 years ago - 69 dependent repositories - 12.4 thousand downloads last month - 954 stars on GitHub - 1 maintainer
scrapy-save-statistics 0.2
Scrapy Save Statistics: Save statistics extension for Scrapy
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 11 downloads last month - 3 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
logparser 0.8.4
A tool for parsing Scrapy log files periodically and incrementally, designed for ScrapydWeb.
5 versions - Latest release: 10 months ago - 1 dependent package - 34 dependent repositories - 8.62 thousand downloads last month - 92 stars on GitHub - 1 maintainer
ze-the-scraper 0.0.17.dev1
Scaper to lager portal of news in Brazil.
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 8 downloads last month - 5 stars on GitHub - 1 maintainer
scrapy-processors 2.0.5
Provides processors for the itemloaders package, commonly used with scrapy.
9 versions - Latest release: over 1 year ago - 24 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-kafka-redis 0.0.7
Kafka and Redis based components for Scrapy.
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 46 stars on GitHub - 1 maintainer
scrapy-http-pipeline 0.2.0
Scrapy HTTP POST items pipeline
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 10 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-puppeteer 0.0.1b0
Scrapy with puppeteer
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 15 downloads last month - 110 stars on GitHub - 1 maintainer
scrapy-elves 0.1.0
utils for parse html
1 version - Latest release: about 2 years ago - 1 dependent repositories - 1 maintainer
scrapy-vectors 0.2.2
Vector embeddings generation and storage for Scrapy
4 versions - Latest release: 2 months ago - 34 downloads last month - 0 stars on GitHub - 1 maintainer
scrapyrunner 0.0.10
A Python library to run Scrapy spiders directly from your code.
10 versions - Latest release: 10 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
thisisapogreq 21.3.3 💰
Faster & simpler requests replacement for Python
1 version - Latest release: over 3 years ago - 1 dependent repositories - 10 downloads last month - 1,115 stars on GitHub - 1 maintainer
scrapyappsearch 0.1.0
Scrapy pipeline which allow you to store multiple scrapy items in AppSearch.
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 17 downloads last month - 3 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
scrapy-zyte-smartproxy 2.4.1
Scrapy middleware for Zyte Smart Proxy Manager
11 versions - Latest release: 7 months ago - 7 dependent repositories - 21.6 thousand downloads last month - 366 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
wechatsogou 4.5.4 💰
Api for wechat mp with sogou
25 versions - Latest release: over 6 years ago - 17 dependent repositories - 463 downloads last month - 6,113 stars on GitHub - 1 maintainer
sitesearcher 0.1a2
A command line tool that creates fulltext search indexes of your favourite websites on your machi...
2 versions - Latest release: about 9 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-proxyland-middleware 1.0
Middleware to easily implement Proxyland in Scrapy projects
1 version - Latest release: about 4 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
products-crawler 0.1.9
A scrapy project for crawl product pictures and information.
10 versions - Latest release: over 3 years ago - 1 dependent repositories - 54 downloads last month - 10 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
scrapyelasticsearch 0.9.2
Scrapy pipeline which allow you to store multiple scrapy items in Elastic Search.
22 versions - Latest release: almost 6 years ago - 1 dependent repositories - 591 downloads last month - 325 stars on GitHub - 3 maintainers
scrapy-selenium-mm 0.1.1
Scrapy with selenium
4 versions - Latest release: about 2 years ago - 15 downloads last month - 951 stars on GitHub - 1 maintainer
scrapy-autoextract 0.7.0
Zyte Automatic Extraction API integration for Scrapy
11 versions - Latest release: about 4 years ago - 1 dependent repositories - 291 downloads last month - 56 stars on GitHub - 2 maintainers
scrapy-drissionpage 1.0.3
将Scrapy爬虫框架与DrissionPage网页自动化工具进行无缝集成
2 versions - Latest release: 7 months ago - 33 downloads last month - 1 maintainer
crawlib 0.1.1 💰
tool set for crawler project.
29 versions - Latest release: almost 6 years ago - 2 dependent repositories - 477 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-warcio 0.0.8
Scrapy WARC I/O
8 versions - Latest release: almost 6 years ago - 1 dependent repositories - 67 downloads last month - 22 stars on GitHub - 1 maintainer
scrachy 0.21.1
Provides an SqlAlchemy based cache storage backend, a Selenium middleware, and a few other utilit...
26 versions - Latest release: 5 days ago - 1 dependent repositories - 340 downloads last month - 1 maintainer
Top 8.1% on pypi.org
spiderkeeper 1.2.0
Admin ui for spider service
6 versions - Latest release: about 8 years ago - 5 dependent repositories - 86 downloads last month - 2,767 stars on GitHub - 1 maintainer
scrapy-selenium-middleware 0.0.5
Scrapy middleware for downloading a page html source using selenium, and interacting with the web...
5 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 28 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-wayback-middleware 0.3.3
Scrapy middleware for submitting URLs to the Internet Archive Wayback Machine
10 versions - Latest release: almost 4 years ago - 11 dependent repositories - 947 downloads last month - 10 stars on GitHub - 1 maintainer
scrapy-status-mailer 0.3
Scrapy Status Mailer: Status mailer extension for Scrapy
3 versions - Latest release: over 8 years ago - 1 dependent repositories - 12 downloads last month - 1 stars on GitHub - 1 maintainer
pyfeeds 2024.5.1
DIY Atom feeds in times of social media and paywalls
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 47 downloads last month - 84 stars on GitHub - 2 maintainers
scrapy-html-storage 0.4.0
Scrapy downloader middleware that stores response HTML files to disk.
4 versions - Latest release: over 7 years ago - 3 dependent repositories - 13 downloads last month - 18 stars on GitHub - 1 maintainer
tkit-scrapy-mongo 0.0.0.116654862
Terry toolkit sdk for tkit_scrapy_mongo ,
1 version - Latest release: about 3 years ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-translate 1.1
Scrapy text translation pipeline
2 versions - Latest release: 3 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-xlsx 0.1.1
XLSX exporter for Scrapy
2 versions - Latest release: over 6 years ago - 6 dependent repositories - 877 downloads last month - 26 stars on GitHub - 1 maintainer
scrapybox 0.1
A Scrapy GUI
1 version - Latest release: over 9 years ago - 2 dependent repositories - 12 downloads last month - 12 stars on GitHub - 1 maintainer
crawltools 0.2.1 💰
Simple crawlers
8 versions - Latest release: over 4 years ago - 52 downloads last month - 1 stars on GitHub - 1 maintainer
scraper-factory 0.2.1
Scraping library to retrieve data from useful pages, such as Amazon wishlists
3 versions - Latest release: about 6 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-meili-pipeline 0.1.1
A Scrapy pipeline that batches and indexes items into Meilisearch, with task tracking and index c...
1 version - Latest release: 9 days ago
bookscrape 0.0.7
Scrape and build e-books from various websites
25 versions - Latest release: over 6 years ago - 169 downloads last month - 10 stars on GitHub - 1 maintainer
scrapfly-sdk 0.8.23
Scrapfly SDK for Scrapfly
43 versions - Latest release: 6 months ago - 2 dependent repositories - 60.9 thousand downloads last month - 48 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
scrapy-zyte-api 0.31.0
Client library to process URLs through Zyte API
54 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 48.9 thousand downloads last month - 33 stars on GitHub - 1 maintainer
netkeiba 0.0.3
A Django app which crawls and imports race data from netkeiba.com
1 version - Latest release: over 6 years ago - 1 dependent repositories - 19 downloads last month - 3 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
page_finder 0.1.9
Find which links on a web page are pagination links
10 versions - Latest release: almost 9 years ago - 14 dependent repositories - 187 downloads last month - 29 stars on GitHub - 2 maintainers
scrapy-puppeteer-client 0.4.0
A library to use Puppeteer-managed browser in Scrapy spiders
20 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 167 downloads last month - 52 stars on GitHub - 2 maintainers
scrapy-slackbot 0.3.0
A Scrapy extension to send notification to the Slack channel.
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-lambda 0.1.1
Scrapy pipeline which invokes a lambda with the scraped item
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 14 downloads last month - 10 stars on GitHub - 1 maintainer
scrapysplashwrapper 1.11.0
Scrapy splash wrapper as a standalone library.
30 versions - Latest release: over 3 years ago - 2 dependent repositories - 449 downloads last month - 9 stars on GitHub - 1 maintainer
indoquake 0.0.5
A Latest Earthquake Detection Package Taken Based on BMKG | Meteorological, Climatological, and G...
5 versions - Latest release: about 2 years ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-db-pipeline 1.1
persist item to the database table
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 7 downloads last month - 2 stars on GitHub - 1 maintainer
spdclient 0.0.1
Python Wrapper for Scrapyd WebService
1 version - Latest release: about 3 years ago - 8 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-scrapingbee 0.0.5
JavaScript support and proxy rotation for Scrapy with ScrapingBee
5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 5.97 thousand downloads last month - 39 stars on GitHub - 1 maintainer
web-walker 3.1.5
your can crawl web pages with litte settings. based on scrapy.
43 versions - Latest release: over 8 years ago - 1 dependent repositories - 226 downloads last month - 55 stars on GitHub - 1 maintainer
Top 5.9% on pypi.org
scrapy-mongodb 0.12.0
Pipeline to MongoDB for Scrapy. Supports MongoDB replica sets
22 versions - Latest release: almost 8 years ago - 41 dependent repositories - 675 downloads last month - 356 stars on GitHub - 1 maintainer
scrapy-dynamic-spiders 1.0.0a1
Dynamically generate spider subclasses. Run crawls sequentially with crochet. Do both.
1 version - Latest release: over 5 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
spider-admin-pro 3.0.9
a spider admin based vue, scrapyd api and APScheduler
41 versions - Latest release: about 1 year ago - 1 dependent repositories - 370 downloads last month - 602 stars on GitHub - 1 maintainer
scrapy-item-ingest 0.1.2
Scrapy extension for database ingestion with job/spider tracking
3 versions - Latest release: 3 months ago - 405 downloads last month - 0 stars on GitHub
scraprom 1.0.2
Scrapy stats collector for prometheus
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 40 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-ua-rotator 1.0.1
Flexible and modern User-Agent rotator middleware for Scrapy, supporting Faker, fake-useragent, a...
2 versions - Latest release: 4 months ago - 205 downloads last month - 2 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
wayback-machine-scraper 1.0.8
A command-line utility for scraping Wayback Machine snapshots from archive.org.
6 versions - Latest release: over 4 years ago - 4 dependent repositories - 197 downloads last month - 454 stars on GitHub - 2 maintainers
smallder 0.0.1
An out-of-the-box lightweight asynchronous crawler framework
1 version - Latest release: 11 months ago - 11 downloads last month - 8 stars on GitHub - 1 maintainer
scrapy-spiderdocs 0.1.3
Generate spiders md documentation based on spider docstrings.
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 41 downloads last month - 1 stars on GitHub - 1 maintainer
spymanga 0.1
A lib to download manga chapters
1 version - Latest release: over 7 years ago - 1 dependent repositories - 5 downloads last month - 2 stars on GitHub - 1 maintainer
ml-scrapy 0.0.4
Components For Scrapy Project
3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer