Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "scrapy" keyword

zcbot-scrapy-redis 0.7.3.2110.2
Redis-based components for Scrapy 2.11.0+.
3 versions - Latest release: 6 months ago - 24 downloads last month - 5,448 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
icrawler 0.6.8
A multi-thread crawler framework with many builtin image crawlers provided.
43 versions - Latest release: 10 days ago - 2 dependent packages - 86 dependent repositories - 99.9 thousand downloads last month - 813 stars on GitHub - 2 maintainers
fara_principals 0.0.7
A web scraper designed to collect Foreign Principal information from fara.gov
3 versions - Latest release: about 7 years ago - 25 downloads last month - 0 stars on GitHub - 1 maintainer
frontoxy 1.0.3
Distributed URLs frontier for Scrapy with RabbitMQ
4 versions - Latest release: over 7 years ago - 1 dependent repositories - 23 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
scrapy-user-agents 0.1.1
Automatically pick an User-Agent for every request
2 versions - Latest release: over 5 years ago - 2 dependent packages - 110 dependent repositories - 17.7 thousand downloads last month - 23 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
scrapy-playwright 0.0.34
Playwright integration for Scrapy
34 versions - Latest release: 5 months ago - 4 dependent packages - 22 dependent repositories - 35.5 thousand downloads last month - 839 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
random-user-agent 1.0.1
A package to get random user agents based filters provided by user
5 versions - Latest release: over 5 years ago - 24 dependent packages - 94 dependent repositories - 161 thousand downloads last month - 92 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
page_finder 0.1.9
Find which links on a web page are pagination links
10 versions - Latest release: over 7 years ago - 14 dependent repositories - 490 downloads last month - 30 stars on GitHub - 2 maintainers
page_clustering 0.0.1
Online k-means clustering of web pages
1 version - Latest release: almost 8 years ago - 13 dependent repositories - 239 downloads last month - 35 stars on GitHub - 2 maintainers
elasticstats-scrapy 0.1.5
A scrapy extension to send crawl stats to elasticsearch index.
5 versions - Latest release: about 7 years ago - 1 dependent repositories - 29 downloads last month - 0 stars on GitHub - 1 maintainer
m3u8-to-mp4 0.1.11
Python downloader for saving m3u8 video to local MP4 file.
12 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 2.58 thousand downloads last month - 60 stars on GitHub - 1 maintainer
scrapy-scraperapi-middleware 1.0
Middleware to easily implement ScraperAPI in Scrapy projects
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 155 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
anti-useragent 1.0.10
fake pc or app browser useragent, anti useragent, and other awesome tools
10 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 2.45 thousand downloads last month - 230 stars on GitHub - 1 maintainer
django-scratchy 0.4.0
Manage and run Scrapy spiders in Django
8 versions - Latest release: about 4 years ago - 1 dependent repositories - 49 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-selenium2 0.0.8
Scrapy with selenium
1 version - Latest release: 3 months ago - 39 downloads last month - 887 stars on GitHub - 1 maintainer
scrapy-ai 0.0.1
AI-powered scrapy plugin
1 version - Latest release: 4 months ago - 28 downloads last month - 1 stars on GitHub - 1 maintainer
new-frontera 0.9.0 💰
A scalable frontier for web crawlers
2 versions - Latest release: 4 months ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-qos 0.0.2
implement QOS(TokenBucket) in scrapy download middleware
2 versions - Latest release: 5 months ago - 17 downloads last month - 1 maintainer
scrapy-playwright-full 0.0.3404
Playwright integration for Scrapy
6 versions - Latest release: 17 days ago - 67 downloads last month - 839 stars on GitHub - 1 maintainer
scrapy-scraper 1.7 💰
Web crawler and scraper based on Scrapy and Playwright's headless browser.
8 versions - Latest release: 14 days ago - 39 downloads last month - 4 stars on GitHub - 1 maintainer
scrapy-aiohttp 0.1.2
Scrapy middleware for sending requests with aiohttp.
3 versions - Latest release: 6 months ago - 9 downloads last month - 1 stars on GitHub - 1 maintainer
volleystats 0.8.1
Command-line tool to scrape volleyball statistics from Data Project Web Competition websites
8 versions - Latest release: 4 months ago - 72 downloads last month - 6 stars on GitHub - 1 maintainer
boost-siper 0.9 removed
横冲直闯无回调写法的高速爬虫框架
8 versions - Latest release: 8 months ago - 480 downloads last month - 4 stars on GitHub - 1 maintainer
scrapy-crawlbase-middleware 1.0.0
Scrapy Crawlbase Proxy Middleware: Crawlbase interfacing middleware for Scrapy
1 version - Latest release: 11 months ago - 12 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-manipulate-request 0.0.2
An async scrapy request downloader middleware, support random request and response manipulation.
2 versions - Latest release: 11 months ago - 67 downloads last month - 10 stars on GitHub - 1 maintainer
scrapy-processors 2.0.5
Provides processors for the itemloaders package, commonly used with scrapy.
9 versions - Latest release: 17 days ago - 49 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-mattermostbot 1.0.0
A Scrapy extension for sending notification to Mattermost channels
1 version - Latest release: about 1 year ago - 15 downloads last month - 1 maintainer
proxyport2 1.1.0
Proxy Port SDK
3 versions - Latest release: about 1 year ago - 2 dependent packages - 45 downloads last month - 2 stars on GitHub - 1 maintainer
yugioh-scraper 0.2.0
Yu-Gi-Oh! Scraper is a project that crawls websites and APIs and extracts Yu-Gi-Oh! related data ...
5 versions - Latest release: over 1 year ago - 46 downloads last month - 1 stars on GitHub - 1 maintainer
spdclient 0.0.1
Python Wrapper for Scrapyd WebService
1 version - Latest release: over 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-athlinks 0.0.1
Web scraper for race results hosted on Athlinks.
1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
herodotus 0.1.0
Package for fast integration between SQLAlchemy models and Scrapy spiders
1 version - Latest release: over 1 year ago - 643 downloads last month - 0 stars on GitHub - 1 maintainer
zhihu-crawler 1.0.3
知乎关键词搜索、热榜、用户信息、回答、专栏文章、评论等信息的抓取程序
6 versions - Latest release: about 2 years ago - 1 dependent repositories - 24 downloads last month - 26 stars on GitHub - 1 maintainer
ze 0.0.17.dev1
Scaper to lager portal of news in Brazil.
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 19 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
wechatsogou 4.5.4 💰
Api for wechat mp with sogou
25 versions - Latest release: about 5 years ago - 17 dependent repositories - 595 downloads last month - 5,783 stars on GitHub - 1 maintainer
web-walker 3.1.5
your can crawl web pages with litte settings. based on scrapy.
43 versions - Latest release: almost 7 years ago - 1 dependent repositories - 145 downloads last month - 53 stars on GitHub - 1 maintainer
weblocust 1.0.3
A more Powerful Spider System in Python based on pyspider
4 versions - Latest release: over 7 years ago - 1 dependent repositories - 25 downloads last month - 6 stars on GitHub - 1 maintainer
web-crawler-plus 0.9.14
A micro-framework to crawl the web pages with crawlers configs. It can use MongoDB, Elasticsearch...
17 versions - Latest release: about 6 years ago - 1 dependent repositories - 61 downloads last month - 31 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
wayback-machine-scraper 1.0.8
A command-line utility for scraping Wayback Machine snapshots from archive.org.
6 versions - Latest release: over 3 years ago - 4 dependent repositories - 232 downloads last month - 400 stars on GitHub - 2 maintainers
take 0.2.0
A DSL for extracting data from a web page.
9 versions - Latest release: about 9 years ago - 10 dependent repositories - 57 downloads last month - 8 stars on GitHub - 1 maintainer
stickymeta 0.0.5
Handy tools to maintain persistent meta values between requests in Scrapy spiders
3 versions - Latest release: over 7 years ago - 2 dependent repositories - 23 downloads last month - 1 stars on GitHub - 1 maintainer
spymanga 0.1
A lib to download manga chapters
1 version - Latest release: about 6 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
spydy 0.1.25
light-weight high-level web-crawling framework
25 versions - Latest release: about 3 years ago - 1 dependent repositories - 52 downloads last month - 2 stars on GitHub - 1 maintainer
spider-renderer 0.2.3
Building a modular crawler template system based on Jinja2.
12 versions - Latest release: almost 4 years ago - 1 dependent repositories - 92 downloads last month - 0 stars on GitHub - 1 maintainer
spidermanager 1.3.1
Admin ui for spider service
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 1 maintainer
spiderkeeper-deploy 0.1.3
Deploy to SpiderKeeper
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 2 stars on GitHub - 1 maintainer
sodo 1.0.1
Redis-based scheduler and Message queue Spider for Scrapy, Provide more flexible and practical ...
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 15 downloads last month - 4 stars on GitHub - 1 maintainer
sentry-scrapy 0.2
Scrapy integration with Sentry SDK (unofficial)
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 38 downloads last month - 7 stars on GitHub - 1 maintainer
scrongo 0.0.0
Non-blocking ItemExporter for MongoDB
1 version - Latest release: over 6 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
Top 5.3% on pypi.org
scrapy-zyte-smartproxy 2.3.4
Scrapy middleware for Zyte Smart Proxy Manager
8 versions - Latest release: 16 days ago - 7 dependent repositories - 14.9 thousand downloads last month - 348 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
scrapy-zyte-api 0.18.2
Client library to process URLs through Zyte API
36 versions - Latest release: 30 days ago - 1 dependent package - 1 dependent repositories - 31 thousand downloads last month - 30 stars on GitHub - 1 maintainer
scrapy-xlsx 0.1.1
XLSX exporter for Scrapy
2 versions - Latest release: about 5 years ago - 6 dependent repositories - 479 downloads last month - 25 stars on GitHub - 1 maintainer
scrapy-wayback-middleware 0.3.3
Scrapy middleware for submitting URLs to the Internet Archive Wayback Machine
10 versions - Latest release: over 2 years ago - 11 dependent repositories - 1.08 thousand downloads last month - 10 stars on GitHub - 1 maintainer
scrapy-warcio 0.0.8
Scrapy WARC I/O
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 44 downloads last month - 13 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
scrapy-wayback-machine 1.0.3
A Scrapy middleware for scraping Wayback Machine snapshots from archive.org.
4 versions - Latest release: about 3 years ago - 8 dependent repositories - 227 downloads last month - 107 stars on GitHub - 2 maintainers
scrapy-venom 0.1.1
Generic classes to deal with data scraping using Scrapy
2 versions - Latest release: over 8 years ago - 2 dependent repositories - 4 downloads last month - 5 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
scrapy-useragents 0.0.1
A middleware to change user-agent in request for Scrapy
1 version - Latest release: over 6 years ago - 12 dependent repositories - 5.67 thousand downloads last month - 20 stars on GitHub - 1 maintainer
scrapy-tor-proxy-rotation 0.0.4
IP Rotator for Scrapy via Tor
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 37 downloads last month - 26 stars on GitHub - 1 maintainer
scrapyu 0.1.12
Scrapy utils
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 34 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-time-machine 1.1.1
A downloader middleware that stores the current request chain to be crawled at another time.
3 versions - Latest release: 4 months ago - 1 dependent repositories - 96 downloads last month - 4 stars on GitHub - 2 maintainers
scrapy-ssdb 0.0.1
scrapy and ssdb
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
scrapy-sqs-exporter 1.1.0
Scrapy extension for outputting scraped items to an Amazon SQS instance
6 versions - Latest release: almost 6 years ago - 1 dependent repositories - 34 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-sqspipeline 1.0.0
1 version - Latest release: about 4 years ago - 1 dependent repositories - 33 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-spiderstats-extension 0.0.2
Scrapy Spider Stats to MongoDB Extension
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 2 stars on GitHub - 1 maintainer
scrapysplashwrapper 1.11.0
Scrapy splash wrapper as a standalone library.
30 versions - Latest release: about 2 years ago - 2 dependent repositories - 162 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-spiderdocs 0.1.3
Generate spiders md documentation based on spider docstrings.
8 versions - Latest release: about 1 year ago - 1 dependent repositories - 37 downloads last month - 1 stars on GitHub - 1 maintainer
scrapysolr 0.2.0
Scrapy pipeline which allows you to store scrapy items in a Solr server.
2 versions - Latest release: about 8 years ago - 2 dependent repositories - 11 downloads last month - 19 stars on GitHub - 1 maintainer
scrapy-slackbot 0.3.0
A Scrapy extension to send notification to the Slack channel.
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 93 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-sentry-sdk 0.4.1
Scrapy extension for integration of Sentry SDK to Scrapy projects
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 207 downloads last month - 5 stars on GitHub - 1 maintainer
scrapy-selenium-middleware 0.0.5
Scrapy middleware for downloading a page html source using selenium, and interacting with the web...
5 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 51 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-script 1.0.0
Run a Scrapy spider programmatically from a script or a Celery task - no project required.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 16 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
scrapy-selenium 0.0.7
Scrapy with selenium
6 versions - Latest release: over 5 years ago - 69 dependent repositories - 14.3 thousand downloads last month - 887 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
scrapyscript 1.1.5
Run a Scrapy spider programmatically from a script or a Celery task - no project required.
10 versions - Latest release: over 2 years ago - 15 dependent repositories - 3.78 thousand downloads last month - 119 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
scrapyrt 0.16.0
Put Scrapy spiders behind an HTTP API
8 versions - Latest release: 3 months ago - 75 dependent repositories - 2.1 thousand downloads last month - 815 stars on GitHub - 2 maintainers
Top 4.3% on pypi.org
scrapy-rotating-proxies 0.6.2
Rotating proxies for Scrapy
13 versions - Latest release: almost 5 years ago - 91 dependent repositories - 12.3 thousand downloads last month - 713 stars on GitHub - 2 maintainers
scrapy-rethinkdb 0.0.4
Scrapy pipeline for rethinkdb.
2 versions - Latest release: almost 10 years ago - 2 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-requests 0.2.0
Scrapy with requests-html
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 101 downloads last month - 25 stars on GitHub - 1 maintainer
scrapy-redis-sentinel 0.7.2
Redis Cluster for Scrapy.
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 107 downloads last month - 30 stars on GitHub - 1 maintainer
scrapy-redis-ironman 0.1.0
Scrapy Redis Ironman
1 version - Latest release: over 4 years ago - 1 dependent repositories - 10 downloads last month - 1 maintainer
scrapy-redis-expiredupefilter 0.2.1
A distributed crawler component based on scrapy_redis which can specify the expiration time of fi...
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 22 downloads last month - 11 stars on GitHub - 1 maintainer
scrapy-redis-bloomfilter-block-cluster 1.9.0
Scrapy Redis BloomFilter Block Cluster
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 11 downloads last month - 22 stars on GitHub - 1 maintainer
scrapy-redirect 0.1.0
Restrict authorized Scrapy redirections to the website start_urls
1 version - Latest release: over 10 years ago - 2 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-rabbitmq-publisher 0.1.4
Scrapy Item Pipeline to send items to RabbitMQ
1 version - Latest release: over 5 years ago - 2 dependent repositories - 57 downloads last month - 13 stars on GitHub - 1 maintainer
scrapy-random-useragent-pro 1.0.0
A random user-agent for all your needs
1 version - Latest release: over 4 years ago - 1 dependent repositories - 29 downloads last month - 9 stars on GitHub - 1 maintainer
scrapy-pyppeteer 0.0.15
Pyppeteer integration for Scrapy
14 versions - Latest release: about 3 years ago - 2 dependent repositories - 527 downloads last month - 60 stars on GitHub - 1 maintainer
scrapy-puppeteer 0.0.1b0
Scrapy with puppeteer
1 version - Latest release: over 5 years ago - 1 dependent repositories - 41 downloads last month - 110 stars on GitHub - 1 maintainer
scrapy-qiniu 0.1.2
Scrapy pipeline extension for qiniu.com
3 versions - Latest release: over 8 years ago - 3 dependent repositories - 27 downloads last month - 24 stars on GitHub - 1 maintainer
scrapyproxyport 1.1.1
Proxy Port Scrapy middleware
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 40 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-proxyland-middleware 1.0
Middleware to easily implement Proxyland in Scrapy projects
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-proxycrawl-middleware 1.2.0
Scrapy ProxyCrawl Proxy Middleware: ProxyCrawl interfacing middleware for Scrapy
5 versions - Latest release: 11 months ago - 2 dependent repositories - 201 downloads last month - 10 stars on GitHub - 1 maintainer
scrapy-promise 0.0.6
Promise-style workflow for Scrapy
1 version - Latest release: over 3 years ago - 1 dependent repositories - 13 downloads last month - 1 maintainer
scrapy-prometheus 0.4.4
Exporting scrapy stats as prometheus metrics through pushgateway service
9 versions - Latest release: over 6 years ago - 1 dependent repositories - 523 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-plus 1.0.5
scrapy 常用爬网必备工具包
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 32 downloads last month - 22 stars on GitHub - 1 maintainer
scrapy-omdena-latam 0.0.2
Web Crawling application running Scrapy Tool extracting official policies
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 25 downloads last month - 0 stars on GitHub - 2 maintainers
scrapy-nimbus 0.5.3
scrapy extend
18 versions - Latest release: about 7 years ago - 1 dependent repositories - 17 downloads last month - 1 maintainer
scrapy-new 0.2.1
A package providing code generation command for scrapy CLI
17 versions - Latest release: almost 4 years ago - 1 dependent repositories - 110 downloads last month - 2 stars on GitHub - 1 maintainer
scrapymysql 0.1.2
make scrapy store data into mysql easier
1 version - Latest release: about 6 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-multifeedexporter 0.1.1
Export scraped items of different types to multiple feeds.
2 versions - Latest release: over 9 years ago - 3 dependent repositories - 13 downloads last month - 7 stars on GitHub - 1 maintainer
scrapymongodb 0.4.3
Scrapy pipeline which allow you to store scrapy items in MongoDB database.
8 versions - Latest release: about 7 years ago - 1 dependent repositories - 36 downloads last month - 101 stars on GitHub - 1 maintainer
scrapy-kafka 0.1.1
Kafka-based components for Scrapy
1 version - Latest release: almost 9 years ago - 2 dependent repositories - 14 downloads last month - 81 stars on GitHub - 1 maintainer