Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "crawl" keyword

sitecrawl 1.0.5
Simple Python3 module to crawl a website and extract URLs
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 42 downloads last month - 5 stars on GitHub - 1 maintainer
nudecrawler 0.3.19
Crawl telegra.ph for nude pictures and videos
38 versions - Latest release: about 1 year ago - 237 downloads last month - 265 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
scrape 0.11.3
a command-line web scraping tool
112 versions - Latest release: over 2 years ago - 9 dependent repositories - 466 downloads last month - 148 stars on GitHub - 1 maintainer
cc-net 1.0.0
Tools to download and clean Common Crawl
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 3.59 thousand downloads last month - 917 stars on GitHub - 1 maintainer
crawl-requests 2.2.8
crawl_requests(like requests) can update ua and proxy automatically.
17 versions - Latest release: over 6 years ago - 99 downloads last month - 0 stars on GitHub - 1 maintainer
winzig 0.3.0
A tiny search engine for personal use.
27 versions - Latest release: 2 months ago - 282 downloads last month - 3 stars on GitHub - 1 maintainer
pithytools 0.0.1
Pithytools is a collection of command line utilities.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 5 stars on GitHub - 1 maintainer
pithy 0.0.13
Pithy is a collection of utility libraries for Python 3.
11 versions - Latest release: about 4 years ago - 8 dependent repositories - 38 downloads last month - 5 stars on GitHub - 1 maintainer
ekrhizoc 0.1.2
A simple python web crawler
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 28 downloads last month - 0 stars on GitHub - 1 maintainer
spotify2csv 0.4.2
Convert Spotify URLs to tracks info in CSV format
5 versions - Latest release: over 6 years ago - 1 dependent repositories - 38 downloads last month - 1 stars on GitHub - 1 maintainer
seolint 0.2
SEO linting tool.
2 versions - Latest release: over 12 years ago - 2 dependent repositories - 9 downloads last month - 6 stars on GitHub - 1 maintainer
scrapy-plus 1.0.5
scrapy 常用爬网必备工具包
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 40 downloads last month - 23 stars on GitHub - 1 maintainer
scrapy-multifeedexporter 0.1.1
Export scraped items of different types to multiple feeds.
2 versions - Latest release: over 9 years ago - 3 dependent repositories - 8 downloads last month - 7 stars on GitHub - 1 maintainer
steam-review-scraper 0.1.0
A package to scrape game reviews from Steam.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 45 downloads last month - 4 stars on GitHub - 1 maintainer
kneescrape 0.13
A simple script to crawl a website and scrape for emails and unique words to create relevant dict...
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 0 stars on GitHub - 1 maintainer
mozia-modules 1.8.3
Modules for mozia
80 versions - Latest release: over 6 years ago - 2 dependent repositories - 66 downloads last month - 1 maintainer
alltweets 0.2
A very simple Twitter crawler that can collect all friends, followers, and tweets of a specified ...
2 versions - Latest release: about 8 years ago - 2 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
solidscraper 0.7.7
This package lets your script scrape web sites. JQuery-Like API.
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
scrape-google 0.0.2
A package used to scrape top links from google
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
dl-coursera 0.1.2
A simple, fast, and reliable Coursera crawling & downloading tool
3 versions - Latest release: over 4 years ago - 2 dependent repositories - 120 downloads last month - 139 stars on GitHub - 1 maintainer
proxy_pool 0.0.3
A proxy pool which you can get an avaiable proxy http server.
3 versions - Latest release: about 7 years ago - 2 dependent repositories - 30 downloads last month - 5 stars on GitHub - 1 maintainer
filelist 1.1.7
Easily list some files in a directory, and exclude others.
8 versions - Latest release: about 8 years ago - 23 dependent repositories - 60 downloads last month - 0 stars on GitHub - 1 maintainer
renfepy 2.0.0
Python library for crawl trains from renfe
10 versions - Latest release: over 1 year ago - 1 dependent repositories - 34 downloads last month - 0 stars on GitHub - 1 maintainer
lurk 0.1.3
Extract html from one or multiple urls
4 versions - Latest release: over 8 years ago - 4 dependent repositories - 308 downloads last month - 0 stars on GitHub - 1 maintainer
asyncpy 1.2.0
Use asyncio and aiohttp's concatenated web crawler framework
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 81 downloads last month - 102 stars on GitHub - 1 maintainer
django-scraper 0.3.8
Django application for collecting online content following user-defined instructions
6 versions - Latest release: about 9 years ago - 5 dependent repositories - 20 downloads last month - 19 stars on GitHub - 1 maintainer
contentfetch 0.0.5
Extracting the content from the webpage
5 versions - Latest release: over 1 year ago - 30 downloads last month - 1 maintainer
fcrawler 1.0.1
Python application that can be used to copy files of a given file type from a folder directory.
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 18 downloads last month - 2 stars on GitHub - 1 maintainer
simspider 1.1.0
简单的定向爬虫框架
3 versions - Latest release: almost 7 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
lxparse 1.0.8
A library for intelligently parsing list page links and details page contents
9 versions - Latest release: over 1 year ago - 20 downloads last month - 15 stars on GitHub - 1 maintainer
facehugger 0.1.6
Extracts faces from an image
7 versions - Latest release: over 10 years ago - 2 dependent repositories - 26 downloads last month - 10 stars on GitHub - 1 maintainer
modules-for-mozia 1.1.0
Modules for mozia
1 version - Latest release: over 6 years ago - 2 dependent repositories - 12 downloads last month - 1 maintainer
emailcrawlerpy 2.0
Small tool to crawl email from given websites
1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 1 maintainer
fs-walker 0.0.1
Walk your file system to check duplicate or missing files
1 version - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
pug-ann 0.0.22
# pug-ann
15 versions - Latest release: about 9 years ago - 4 dependent repositories - 63 downloads last month - 2 stars on GitHub - 1 maintainer
koreanewscrawler 1.51
Crawl the korean news
9 versions - Latest release: about 2 years ago - 1 dependent repositories - 64 downloads last month - 217 stars on GitHub - 1 maintainer
stweet 2.1.1
Package to scrap tweets
20 versions - Latest release: over 1 year ago - 1 dependent repositories - 283 downloads last month - 572 stars on GitHub - 1 maintainer
libgenapi 1.2.1
Library to search on Library genesis
10 versions - Latest release: about 6 years ago - 4 dependent repositories - 97 downloads last month - 113 stars on GitHub - 1 maintainer
alfeios 1.4
Enrich your command-line shell with Herculean cleaning capabilities
3 versions - Latest release: 8 months ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
universal-utils 0.0.1
python utils
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 1 maintainer
xextract 0.1.8
Extract structured data from HTML and XML documents like a boss.
17 versions - Latest release: over 4 years ago - 1 dependent package - 4 dependent repositories - 1.14 thousand downloads last month - 50 stars on GitHub - 1 maintainer
weibo-scraper 1.0.6
Simple Weibo Scraper
9 versions - Latest release: about 6 years ago - 1 dependent repositories - 1.54 thousand downloads last month - 91 stars on GitHub - 1 maintainer
crawlmap 1.3
A python3 script to change your crawling logs to a mindmap
4 versions - Latest release: about 2 years ago - 23 downloads last month - 7 stars on GitHub - 1 maintainer
salscraper 0.2.1
A scarping tool
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 14 downloads last month - 1 maintainer
mozia 1.0.0
Modules for mozia
1 version - Latest release: about 6 years ago - 1 dependent repositories - 7 downloads last month - 1 maintainer
Top 8.9% on pypi.org
scweet 0.3.3 💰
Tool for scraping Tweets
14 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.43 thousand downloads last month - 978 stars on GitHub - 1 maintainer
obsidian 1.0.1
Obsidian make web crawl easier
5 versions - Latest release: over 7 years ago - 1 dependent repositories - 146 downloads last month - 5 stars on GitHub - 1 maintainer
scr 0.12.0
Command-line Utility for Web Scraping
13 versions - Latest release: almost 2 years ago - 4 dependent repositories - 307 downloads last month - 4 stars on GitHub - 1 maintainer
smockrawl 0.3.0
Smockeo API crawler
6 versions - Latest release: about 2 years ago - 1 dependent repositories - 66 downloads last month - 0 stars on GitHub - 1 maintainer
structure-spider 1.3.5
multi requests to combine a structure item.
49 versions - Latest release: over 4 years ago - 1 dependent repositories - 283 downloads last month - 29 stars on GitHub - 1 maintainer
xhs 0.2.13
xiaohongshu crawl sdk.
27 versions - Latest release: about 1 month ago - 1 dependent repositories - 805 downloads last month - 808 stars on GitHub - 1 maintainer
crawlist 0.1.0
A universal solution for web crawling lists
10 versions - Latest release: 26 days ago - 613 downloads last month - 23 stars on GitHub - 1 maintainer
easy-server-indexing 0.1.1
The following package can be integrated into a server indexing softwares which will skip the alre...
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1 stars on GitHub - 2 maintainers
zagoload 0.5.1
Download files(http,ftp). Supports: cache, uniform access to remote and local files
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 1 maintainer
concurrentfloodscraper 1.0.1
A concurrent flood web scraper.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 43 downloads last month - 0 stars on GitHub - 1 maintainer
web-walker 3.1.5
your can crawl web pages with litte settings. based on scrapy.
43 versions - Latest release: almost 7 years ago - 1 dependent repositories - 145 downloads last month - 53 stars on GitHub - 1 maintainer
tvstats 0.0.2
Scrape data of all the episodes of a Tv Series from IMDB
2 versions - Latest release: almost 9 years ago - 2 dependent repositories - 14 downloads last month - 7 stars on GitHub - 1 maintainer
scrapy-redirect 0.1.0
Restrict authorized Scrapy redirections to the website start_urls
1 version - Latest release: almost 11 years ago - 2 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
py3spider 1.0.7
仿Scrapy实现,基于py3.4+的多线程异步网络爬虫,实例请访问https://github.com/ChenL1994/py3spider/tree/master/examples
6 versions - Latest release: almost 7 years ago - 1 dependent repositories - 20 downloads last month - 1 maintainer
pixivhack 0.1.5
Pixiv Hack is a tool to automatically crawl illustrations filtered by ratings on www.pixiv.net
1 version - Latest release: over 8 years ago - 2 dependent repositories - 12 downloads last month - 15 stars on GitHub - 1 maintainer
naverscrap 1.0.6
A Naver News Scraping tool
7 versions - Latest release: about 3 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
jike 0.5.0
Jike Metro 🚇 : Jike Python SDK
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 44 downloads last month - 202 stars on GitHub - 1 maintainer
icrawl 1.0.6
iCrawl
7 versions - Latest release: over 8 years ago - 1 dependent repositories - 5 downloads last month - 1 maintainer
analyze_site 0.1.3
Utility to crawl web site looking for key words
2 versions - Latest release: over 7 years ago - 35 downloads last month - 0 stars on GitHub - 1 maintainer
ebook-crawler 2.1.8 removed 💰
This project was moved to https://pypi.org/project/lightnovel-crawler/
30 versions - Latest release: over 5 years ago - 1 dependent repositories - 48 downloads last month - 1,259 stars on GitHub - 1 maintainer