Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "scrapy" keyword

foot-fixtures 1.0.3
Utility to find schedule for a football club
4 versions - Latest release: almost 10 years ago - 2 dependent repositories - 18 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-redis-sentinel 0.7.2
Redis Cluster for Scrapy.
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 62 downloads last month - 31 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
advertools 0.14.2
Productivity and analysis tools for online marketing
57 versions - Latest release: 4 months ago - 4 dependent packages - 30 dependent repositories - 101 thousand downloads last month - 1,073 stars on GitHub - 1 maintainer
ska-advertools 0.14.0a8 removed
Digital Marketing productivity and analysis tools.
1 version - Latest release: about 2 years ago - 693 stars on GitHub
s01.client 0.5.0
JSON-RPC 2.0 s01.worker client
1 version - Latest release: almost 13 years ago - 2 dependent repositories - 5 downloads last month - 2 maintainers
invana-bot 0.1.36
A web spider framework that can transform websites into datasets with Crawl, Transform and Index ...
38 versions - Latest release: almost 5 years ago - 1 dependent repositories - 96 downloads last month - 31 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
scrapy-wayback-machine 1.0.3
A Scrapy middleware for scraping Wayback Machine snapshots from archive.org.
4 versions - Latest release: about 3 years ago - 8 dependent repositories - 378 downloads last month - 107 stars on GitHub - 2 maintainers
scrapy-kinesispipeline 0.3.9
Scrapy pipeline to store aggregated items into AWS Kinesis
17 versions - Latest release: over 5 years ago - 1 dependent repositories - 325 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-selenium-middleware 0.0.5
Scrapy middleware for downloading a page html source using selenium, and interacting with the web...
5 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 52 downloads last month - 9 stars on GitHub - 1 maintainer
lazy-crawler
Lazy Crawler is a Python package that simplifies web scraping tasks. It builds upon Scrapy, a pow...
3 versions - 424 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
scrapy-splash 0.9.0
JavaScript support for Scrapy using Splash
11 versions - Latest release: over 1 year ago - 5 dependent packages - 429 dependent repositories - 134 thousand downloads last month - 3,092 stars on GitHub - 4 maintainers
spymanga 0.1
A lib to download manga chapters
1 version - Latest release: over 6 years ago - 1 dependent repositories - 4 downloads last month - 2 stars on GitHub - 1 maintainer
tweetscraper 1.2.6 💰
TweetScraper is a simple crawler/spider for Twitter Search without using API
8 versions - Latest release: about 6 years ago - 1 dependent repositories - 22 downloads last month - 977 stars on GitHub - 1 maintainer
indoquake 0.0.5
A Latest Earthquake Detection Package Taken Based on BMKG | Meteorological, Climatological, and G...
5 versions - Latest release: 9 months ago - 33 downloads last month - 0 stars on GitHub - 1 maintainer
penknife 2021.5.15
Pacote de automação de processos
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 19 downloads last month - 1 maintainer
spidermanager 1.3.1
Admin ui for spider service
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 22 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-cookies 0.2.5
A middleware of cookies persistence for Scrapy
7 versions - Latest release: almost 6 years ago - 1 dependent repositories - 659 downloads last month - 25 stars on GitHub - 1 maintainer
web-crawler-plus 0.9.14
A micro-framework to crawl the web pages with crawlers configs. It can use MongoDB, Elasticsearch...
17 versions - Latest release: about 6 years ago - 1 dependent repositories - 210 downloads last month - 31 stars on GitHub - 1 maintainer
scrapy-playwright-full 0.0.3404
Playwright integration for Scrapy
7 versions - Latest release: about 1 month ago - 181 downloads last month - 862 stars on GitHub - 1 maintainer
hoopa 0.0.12
Asynchronous crawler micro-framework based on python.
8 versions - Latest release: almost 3 years ago - 1 dependent repositories - 21 downloads last month - 8 stars on GitHub - 1 maintainer
sodo 1.0.1
Redis-based scheduler and Message queue Spider for Scrapy, Provide more flexible and practical ...
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 11 downloads last month - 4 stars on GitHub - 1 maintainer
volleystats 0.8.1
Command-line tool to scrape volleyball statistics from Data Project Web Competition websites
8 versions - Latest release: 4 months ago - 43 downloads last month - 8 stars on GitHub - 1 maintainer
spider-renderer 0.2.3
Building a modular crawler template system based on Jinja2.
12 versions - Latest release: almost 4 years ago - 1 dependent repositories - 37 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-googlechat 1.1
Send crawl reports from Scrapy spiders to Google Chat
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 18 downloads last month - 1 stars on GitHub - 1 maintainer
kslabs-scrapy-heroku 0.7.1
Utilities for running scrapy on heroku
1 version - Latest release: over 4 years ago - 1 dependent repositories - 6 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-redis-bloomfilter-block-cluster 1.9.0
Scrapy Redis BloomFilter Block Cluster
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 35 downloads last month - 22 stars on GitHub - 1 maintainer
crawltools 0.2.1 💰
Simple crawlers
8 versions - Latest release: about 3 years ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
scrapy-redis 0.7.3
Redis-based components for Scrapy.
18 versions - Latest release: almost 2 years ago - 6 dependent packages - 392 dependent repositories - 7.44 thousand downloads last month - 5,448 stars on GitHub - 3 maintainers
asyncpy 1.2.0
Use asyncio and aiohttp's concatenated web crawler framework
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 81 downloads last month - 102 stars on GitHub - 1 maintainer
scrapy-rethinkdb 0.0.4
Scrapy pipeline for rethinkdb.
2 versions - Latest release: almost 10 years ago - 2 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
poseidonctrip 0.9.1
distributed crawler
1 version - Latest release: about 5 years ago - 1 dependent repositories - 7 downloads last month - 1 maintainer
scrapy-manipulate-request 0.0.2
An async scrapy request downloader middleware, support random request and response manipulation.
2 versions - Latest release: 12 months ago - 40 downloads last month - 10 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
scrapyelasticsearch 0.9.2
Scrapy pipeline which allow you to store multiple scrapy items in Elastic Search.
22 versions - Latest release: over 4 years ago - 1 dependent repositories - 707 downloads last month - 327 stars on GitHub - 3 maintainers
Top 2.8% on pypi.org
scrapy-playwright 0.0.34
Playwright integration for Scrapy
35 versions - Latest release: 5 months ago - 4 dependent packages - 22 dependent repositories - 36.6 thousand downloads last month - 862 stars on GitHub - 1 maintainer
scrapy-sqs-exporter 1.1.0
Scrapy extension for outputting scraped items to an Amazon SQS instance
6 versions - Latest release: almost 6 years ago - 1 dependent repositories - 30 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-autoextract 0.7.0
Zyte Automatic Extraction API integration for Scrapy
11 versions - Latest release: almost 3 years ago - 1 dependent repositories - 248 downloads last month - 54 stars on GitHub - 2 maintainers
scrapy-venom 0.1.1
Generic classes to deal with data scraping using Scrapy
2 versions - Latest release: over 8 years ago - 2 dependent repositories - 4 downloads last month - 5 stars on GitHub - 1 maintainer
scrapymongodb 0.4.3
Scrapy pipeline which allow you to store scrapy items in MongoDB database.
8 versions - Latest release: over 7 years ago - 1 dependent repositories - 30 downloads last month - 101 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
icrawler 0.6.8
A multi-thread crawler framework with many builtin image crawlers provided.
44 versions - Latest release: 26 days ago - 2 dependent packages - 86 dependent repositories - 84.9 thousand downloads last month - 824 stars on GitHub - 2 maintainers
geocrawl 0.2.2
A library to stream geocaching related entities from the official website
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
fishfishjump 0.2.3
FishFishJump is a solution that simply and basic for search engines and provide multiple demos th...
8 versions - Latest release: over 6 years ago - 1 dependent repositories - 27 downloads last month - 54 stars on GitHub - 1 maintainer
proxyport2 1.1.0
Proxy Port SDK
3 versions - Latest release: about 1 year ago - 2 dependent packages - 32 downloads last month - 2 stars on GitHub - 1 maintainer
scrapycouchdb 0.2
Scrapy pipeline which allow you to store scrapy items in CouchDB database.
2 versions - Latest release: over 12 years ago - 1 dependent repositories - 14 downloads last month - 17 stars on GitHub - 1 maintainer
scrapy-broadsoftxchange 1.17
Download documents and published software from Broadsoft Xchange
2 versions - Latest release: over 8 years ago - 2 dependent repositories - 18 downloads last month - 5 stars on GitHub - 1 maintainer
crappyspider 0.3
Test your site.
2 versions - Latest release: over 9 years ago - 2 dependent repositories - 14 downloads last month - 1 maintainer
abupy 0.4.0
阿布量化系统
6 versions - Latest release: over 6 years ago - 1 dependent repositories - 305 downloads last month - 11,138 stars on GitHub - 1 maintainer
scrapy-folder-tree 0.1.3 💰
A scrapy pipeline which stores files using folder trees.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 17 downloads last month - 8 stars on GitHub - 1 maintainer
scrapy-itemagic 0.2.4
Scrapy item parsing tools.
11 versions - Latest release: about 9 years ago - 2 dependent repositories - 34 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-rabbitmq-publisher 0.1.4
Scrapy Item Pipeline to send items to RabbitMQ
1 version - Latest release: over 5 years ago - 2 dependent repositories - 22 downloads last month - 13 stars on GitHub - 1 maintainer
scrapysolr 0.2.0
Scrapy pipeline which allows you to store scrapy items in a Solr server.
2 versions - Latest release: about 8 years ago - 2 dependent repositories - 17 downloads last month - 19 stars on GitHub - 1 maintainer
boost-siper 0.9 removed
横冲直闯无回调写法的高速爬虫框架
8 versions - Latest release: 8 months ago - 480 downloads last month - 4 stars on GitHub - 1 maintainer
boost-spiper 1.0 removed
横冲直闯 自由奔放 无回调 无继承写法的高速爬虫框架
1 version - Latest release: 8 months ago - 4 stars on GitHub
boost-spider 1.1
横冲直闯 自由奔放 无回调 无继承写法的高速爬虫框架
2 versions - Latest release: 6 months ago - 61 downloads last month - 11 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
scrapy-cloudflare-middleware 0.0.1
A Scrapy Middleware to bypass the CloudFlare's anti-bot protection
2 versions - Latest release: over 6 years ago - 7 dependent repositories - 1.84 thousand downloads last month - 103 stars on GitHub - 1 maintainer
yugioh-scraper 0.2.0
Yu-Gi-Oh! Scraper is a project that crawls websites and APIs and extracts Yu-Gi-Oh! related data ...
5 versions - Latest release: over 1 year ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
scrapy-ssdb 0.0.1
scrapy and ssdb
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 9 downloads last month - 1 maintainer
crawl-frontier 0.2.0
A flexible frontier for web crawlers
7 versions - Latest release: over 9 years ago - 2 dependent repositories - 40 downloads last month - 1,278 stars on GitHub - 3 maintainers
Top 4.4% on pypi.org
feapder 1.9.0
feapder是一款支持分布式、批次采集、数据防丢、报警丰富的python爬虫框架
175 versions - Latest release: 3 months ago - 4 dependent repositories - 84.1 thousand downloads last month - 2,596 stars on GitHub - 1 maintainer
weblocust 1.0.3
A more Powerful Spider System in Python based on pyspider
4 versions - Latest release: over 7 years ago - 1 dependent repositories - 52 downloads last month - 6 stars on GitHub - 1 maintainer
thisisapogreq 21.3.3 💰
Faster & simpler requests replacement for Python
1 version - Latest release: over 2 years ago - 1 dependent repositories - 19 downloads last month - 1,084 stars on GitHub - 1 maintainer
new-frontera 0.9.0 💰
A scalable frontier for web crawlers
2 versions - Latest release: 4 months ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-allen 0.1.0
模块描述
1 version - Latest release: over 3 years ago - 1 dependent repositories - 11 downloads last month - 1 maintainer
Top 2.8% on pypi.org
pyspider 0.3.10
A Powerful Spider System in Python
17 versions - Latest release: about 6 years ago - 1 dependent package - 98 dependent repositories - 1.04 thousand downloads last month - 16,371 stars on GitHub - 1 maintainer
pyspider3 0.0.2
由于pyspider模块年久失修,pyspider3基于原作者github源码制作而成
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 24 downloads last month - 16,371 stars on GitHub - 1 maintainer
s01.core 0.5.0
Scrapy worker core packages
1 version - Latest release: almost 13 years ago - 2 dependent repositories - 5 downloads last month - 2 maintainers
pyfeeds 2024.5.1
DIY Atom feeds in times of social media and paywalls
4 versions - Latest release: about 1 month ago - 1 dependent repositories - 149 downloads last month - 79 stars on GitHub - 2 maintainers
aioscpy 0.3.12
An asyncio + aiolibs crawler imitate scrapy framework
45 versions - Latest release: about 1 year ago - 1 dependent repositories - 92 downloads last month - 129 stars on GitHub - 1 maintainer
scraprom 1.0.2
Scrapy stats collector for prometheus
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 62 downloads last month - 1 stars on GitHub - 1 maintainer
ze-the-scraper 0.0.17.dev1
Scaper to lager portal of news in Brazil.
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 23 downloads last month - 5 stars on GitHub - 1 maintainer
scrapy-item 0.0.3
Item with general/unknown/dynamic fields
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 221 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-crawlbase-middleware 1.0.0
Scrapy Crawlbase Proxy Middleware: Crawlbase interfacing middleware for Scrapy
1 version - Latest release: 11 months ago - 11 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-aiohttp 0.1.2
Scrapy middleware for sending requests with aiohttp.
3 versions - Latest release: 6 months ago - 19 downloads last month - 1 stars on GitHub - 1 maintainer
ze 0.0.17.dev1
Scaper to lager portal of news in Brazil.
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 27 downloads last month - 5 stars on GitHub - 1 maintainer
aduana 0.2.1
Bindings for Aduana library
2 versions - Latest release: almost 9 years ago - 3 dependent repositories - 35 downloads last month - 53 stars on GitHub - 2 maintainers
spydy 0.1.25
light-weight high-level web-crawling framework
25 versions - Latest release: about 3 years ago - 1 dependent repositories - 36 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-rotated-proxy 0.1.5
A middleware to change proxy rotated for Scrapy
12 versions - Latest release: almost 6 years ago - 8 dependent repositories - 1.04 thousand downloads last month - 24 stars on GitHub - 1 maintainer
rankcomp 0.0.2
high fided extraction of differential expression genes without considering the batch effects
1 version - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 5,392 stars on GitHub - 1 maintainer
scrapfly-sdk 0.8.16
Scrapfly SDK for Scrapfly
38 versions - Latest release: about 2 months ago - 2 dependent repositories - 13.1 thousand downloads last month - 18 stars on GitHub - 1 maintainer
spider-feeder 0.3.0
spider-feeder is a library to help loading inputs to scrapy spiders.
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 55 downloads last month - 14 stars on GitHub - 1 maintainer
haipproxy2 0.1.4
High aviariable proxy pool client for crawlers.
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 26 downloads last month - 5,392 stars on GitHub - 1 maintainer
haipproxy 0.11.6
High aviariable proxy pool client for crawlers.
6 versions - Latest release: almost 6 years ago - 1 dependent repositories - 21 downloads last month - 5,392 stars on GitHub - 1 maintainer
scrapy-statsd 1.0.1
Publish Scrapy stats to statsd
2 versions - Latest release: about 6 years ago - 3 dependent repositories - 53 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
django-dynamic-scraper 0.13.3
Creating Scrapy scrapers via the Django admin interface
65 versions - Latest release: almost 3 years ago - 34 dependent repositories - 192 downloads last month - 1,138 stars on GitHub - 1 maintainer
scrapy-random-ua 0.3
Scrapy Middleware to set a random User-Agent for every Request.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
cli-yams 1.1
Yet Another Media Scraper
2 versions - Latest release: 5 months ago - 11 downloads last month - 3 stars on GitHub - 1 maintainer
tkit-scrapy-mongo 0.0.0.116654862
Terry toolkit sdk for tkit_scrapy_mongo ,
1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-twostage 0.0.4
Two stage Scrapy spider: download and extract
4 versions - Latest release: about 7 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
scrapy-podcast-rss 0.0.3
Scrapy pipeline and items to create and store RSS feeds for podcasts.
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 34 downloads last month - 1 stars on GitHub - 1 maintainer
nimbus-scrapy-rabbitmq 0.0.3
nimbus_scrapy_rabbitmq
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 maintainer
scrapy-s3pipeline 0.7.0
Scrapy pipeline to store chunked items into Amazon S3 or Google Clous Storage bucket
8 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.14 thousand downloads last month - 71 stars on GitHub - 1 maintainer
scrapy-fieldstats 0.2.0
A Scrapy extension to generate a summary of fields coverage from your scraped data.
10 versions - Latest release: about 4 years ago - 2 dependent repositories - 166 downloads last month - 17 stars on GitHub - 1 maintainer
scrapy-mongoengine-item 0.1.5
Scrapy extension to write scraped items using MongoEngine documents
4 versions - Latest release: about 5 years ago - 1 dependent repositories - 91 downloads last month - 2 stars on GitHub - 1 maintainer
pyleapo 1.0.3
A Python API for accessing your Leap Card balance, overview, and travel credit history.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
scraper-factory 0.2.1
Scraping library to retrieve data from useful pages, such as Amazon wishlists
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 35 downloads last month - 0 stars on GitHub - 1 maintainer
scrapy-heroku 0.7.1
Utilities for running scrapy on heroku
8 versions - Latest release: over 11 years ago - 9 dependent repositories - 34 downloads last month - 67 stars on GitHub - 1 maintainer
nimbus-scrapy 4.1.2
nimbus_scrapy
120 versions - Latest release: about 4 years ago - 1 dependent repositories - 356 downloads last month - 1 maintainer
Top 9.6% on pypi.org
scrapy-mysql-pipeline 2019.7.19
Asynchronous mysql Scrapy item pipeline
3 versions - Latest release: almost 5 years ago - 4 dependent repositories - 37 downloads last month - 48 stars on GitHub - 1 maintainer
scrapy-http-pipeline 0.2.0
Scrapy HTTP POST items pipeline
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 23 downloads last month - 1 stars on GitHub - 1 maintainer
favorites-crawler 0.2.0
Crawl your personal favorite images, photo albums, comics from website. Support pixiv, yande.re f...
26 versions - Latest release: 5 months ago - 1 dependent repositories - 144 downloads last month - 16 stars on GitHub - 1 maintainer
shub-cli 2.0.1
A CLI at your hands to deal with the features of ScrapingHub.
5 versions - Latest release: over 7 years ago - 1 dependent repositories - 38 downloads last month - 16 stars on GitHub - 1 maintainer