An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "crawl" keyword

View the packages on the pypi.org package registry that are tagged with the "crawl" keyword.

font-obfuscator 0.1.2
字体反爬、字体混淆工具是一个用于混淆字体文件的工具,可以将字体文件中的字形进行混淆,从而防止字体文件被直接提取出来。Font Obfuscator is an open-source Pytho...
3 versions - Latest release: 8 months ago - 139 downloads last month - 9 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
scweet 0.3.3 💰
Tool for scraping Tweets, User infos, Followers and Following
17 versions - Latest release: about 4 years ago - 1 dependent repositories - 1.63 thousand downloads last month - 1,121 stars on GitHub - 1 maintainer
weibo-scraper 1.0.6
Simple Weibo Scraper
9 versions - Latest release: almost 7 years ago - 1 dependent repositories - 403 downloads last month - 91 stars on GitHub - 1 maintainer
emailcrawlerpy 2.0
Small tool to crawl email from given websites
1 version - Latest release: about 4 years ago - 1 dependent repositories - 31 downloads last month - 1 maintainer
web-walker 3.1.5
your can crawl web pages with litte settings. based on scrapy.
43 versions - Latest release: almost 8 years ago - 1 dependent repositories - 795 downloads last month - 54 stars on GitHub - 1 maintainer
zagoload 0.5.1
Download files(http,ftp). Supports: cache, uniform access to remote and local files
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 62 downloads last month - 2 stars on GitHub - 1 maintainer
mozia 1.0.0
Modules for mozia
1 version - Latest release: about 7 years ago - 1 dependent repositories - 30 downloads last month - 1 maintainer
proxy_pool 0.0.3
A proxy pool which you can get an avaiable proxy http server.
3 versions - Latest release: about 8 years ago - 2 dependent repositories - 105 downloads last month - 5 stars on GitHub - 1 maintainer
jike 0.5.0
Jike Metro 🚇 : Jike Python SDK
5 versions - Latest release: almost 7 years ago - 1 dependent repositories - 150 downloads last month - 209 stars on GitHub - 1 maintainer
tvstats 0.0.2
Scrape data of all the episodes of a Tv Series from IMDB
2 versions - Latest release: almost 10 years ago - 2 dependent repositories - 31 downloads last month - 7 stars on GitHub - 1 maintainer
salscraper 0.2.1
A scarping tool
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 71 downloads last month - 1 maintainer
libgenapi 1.2.1
Library to search on Library genesis
10 versions - Latest release: about 7 years ago - 4 dependent repositories - 289 downloads last month - 118 stars on GitHub - 1 maintainer
scrapy-plus 1.0.5
scrapy 常用爬网必备工具包
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 214 downloads last month - 24 stars on GitHub - 1 maintainer
steam-review-scraper 0.1.0
A package to scrape game reviews from Steam.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 58 downloads last month - 5 stars on GitHub - 1 maintainer
obsidian 1.0.1
Obsidian make web crawl easier
5 versions - Latest release: over 8 years ago - 1 dependent repositories - 303 downloads last month - 5 stars on GitHub - 1 maintainer
asyncpy 1.2.0
Use asyncio and aiohttp's concatenated web crawler framework
14 versions - Latest release: over 2 years ago - 1 dependent repositories - 738 downloads last month - 108 stars on GitHub - 1 maintainer
scrapy-multifeedexporter 0.1.1
Export scraped items of different types to multiple feeds.
2 versions - Latest release: over 10 years ago - 3 dependent repositories - 55 downloads last month - 7 stars on GitHub - 1 maintainer
xhs 0.2.13
xiaohongshu crawl sdk.
27 versions - Latest release: 12 months ago - 1 dependent repositories - 2.21 thousand downloads last month - 1,508 stars on GitHub - 1 maintainer
air-web 0.1.0
A lightweight package for crawling the web with the minimalist of code.
1 version - Latest release: 7 months ago - 2.25 thousand downloads last month - 0 stars on GitHub - 1 maintainer
smockrawl 0.3.0
Smockeo API crawler
6 versions - Latest release: about 3 years ago - 1 dependent repositories - 251 downloads last month - 0 stars on GitHub - 1 maintainer
winzig 0.3.0
A tiny search engine for personal use.
27 versions - Latest release: about 1 year ago - 678 downloads last month - 3 stars on GitHub - 1 maintainer
xextract 0.1.9
Extract structured data from HTML and XML documents like a boss.
18 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 615 downloads last month - 50 stars on GitHub - 1 maintainer
pithytools 0.0.1
Pithytools is a collection of command line utilities.
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
pithy 0.0.13
Pithy is a collection of utility libraries for Python 3.
11 versions - Latest release: almost 5 years ago - 8 dependent repositories - 247 downloads last month - 5 stars on GitHub - 1 maintainer
crawlmap 1.3
A python3 script to change your crawling logs to a mindmap
4 versions - Latest release: almost 3 years ago - 110 downloads last month - 7 stars on GitHub - 1 maintainer
seolint 0.2
SEO linting tool.
2 versions - Latest release: over 13 years ago - 2 dependent repositories - 50 downloads last month - 6 stars on GitHub - 1 maintainer
sitecrawl 1.0.5
Simple Python3 module to crawl a website and extract URLs
6 versions - Latest release: about 3 years ago - 1 dependent repositories - 167 downloads last month - 5 stars on GitHub - 1 maintainer
simspider 1.1.0
简单的定向爬虫框架
3 versions - Latest release: over 7 years ago - 1 dependent repositories - 51 downloads last month - 1 maintainer
py3spider 1.0.7
仿Scrapy实现,基于py3.4+的多线程异步网络爬虫,实例请访问https://github.com/ChenL1994/py3spider/tree/master/examples
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 132 downloads last month - 1 maintainer
kneescrape 0.13
A simple script to crawl a website and scrape for emails and unique words to create relevant dict...
3 versions - Latest release: over 7 years ago - 1 dependent repositories - 75 downloads last month - 0 stars on GitHub - 1 maintainer
renfepy 2.0.0
Python library for crawl trains from renfe
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 271 downloads last month - 1 stars on GitHub - 1 maintainer
easy-server-indexing 0.1.1
The following package can be integrated into a server indexing softwares which will skip the alre...
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 96 downloads last month - 1 stars on GitHub - 2 maintainers
analyze_site 0.1.3
Utility to crawl web site looking for key words
2 versions - Latest release: about 8 years ago - 59 downloads last month - 0 stars on GitHub - 1 maintainer
pug-ann 0.0.22
# pug-ann
15 versions - Latest release: about 10 years ago - 4 dependent repositories - 248 downloads last month - 2 stars on GitHub - 1 maintainer
concurrentfloodscraper 1.0.1
A concurrent flood web scraper.
2 versions - Latest release: about 8 years ago - 1 dependent repositories - 54 downloads last month - 0 stars on GitHub - 1 maintainer
nudecrawler 0.3.28
Crawl telegra.ph searching for nudes!
48 versions - Latest release: 9 months ago - 1.14 thousand downloads last month - 303 stars on GitHub - 1 maintainer
universal-utils 0.0.1
python utils
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 58 downloads last month - 1 stars on GitHub - 1 maintainer
cc-net 1.0.0
Tools to download and clean Common Crawl
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 39.5 thousand downloads last month - 993 stars on GitHub - 1 maintainer
structure-spider 1.3.5
multi requests to combine a structure item.
49 versions - Latest release: over 5 years ago - 1 dependent repositories - 591 downloads last month - 29 stars on GitHub - 1 maintainer
ekrhizoc 0.1.2
A simple python web crawler
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 144 downloads last month - 0 stars on GitHub - 1 maintainer
stweet 2.1.1
Package to scrap tweets
20 versions - Latest release: about 2 years ago - 1 dependent repositories - 690 downloads last month - 602 stars on GitHub - 1 maintainer
icrawl 1.0.6
iCrawl
7 versions - Latest release: over 9 years ago - 1 dependent repositories - 139 downloads last month - 1 maintainer
fcrawler 1.0.1
Python application that can be used to copy files of a given file type from a folder directory.
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 159 downloads last month - 2 stars on GitHub - 1 maintainer
filelist 1.1.7
Easily list some files in a directory, and exclude others.
8 versions - Latest release: about 9 years ago - 23 dependent repositories - 210 downloads last month - 1 stars on GitHub - 1 maintainer
spotify2csv 0.4.2
Convert Spotify URLs to tracks info in CSV format
5 versions - Latest release: about 7 years ago - 1 dependent repositories - 157 downloads last month - 1 stars on GitHub - 1 maintainer
scrape-google 0.0.2
A package used to scrape top links from google
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 61 downloads last month - 0 stars on GitHub - 1 maintainer
scr 0.12.0
Command-line Utility for Web Scraping
13 versions - Latest release: over 2 years ago - 4 dependent repositories - 594 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-redirect 0.1.0
Restrict authorized Scrapy redirections to the website start_urls
1 version - Latest release: over 11 years ago - 2 dependent repositories - 27 downloads last month - 0 stars on GitHub - 1 maintainer
lurk 0.1.3
Extract html from one or multiple urls
4 versions - Latest release: over 9 years ago - 4 dependent repositories - 223 downloads last month - 0 stars on GitHub - 1 maintainer
facehugger 0.1.6
Extracts faces from an image
7 versions - Latest release: over 11 years ago - 2 dependent repositories - 161 downloads last month - 10 stars on GitHub - 1 maintainer
solidscraper 0.7.7
This package lets your script scrape web sites. JQuery-Like API.
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 187 downloads last month - 3 stars on GitHub - 1 maintainer
pixivhack 0.1.5
Pixiv Hack is a tool to automatically crawl illustrations filtered by ratings on www.pixiv.net
1 version - Latest release: over 9 years ago - 2 dependent repositories - 35 downloads last month - 15 stars on GitHub - 1 maintainer
koreanewscrawler 1.51
Crawl the korean news
9 versions - Latest release: about 3 years ago - 1 dependent repositories - 161 downloads last month - 222 stars on GitHub - 1 maintainer
fs-walker 0.0.1
Walk your file system to check duplicate or missing files
1 version - Latest release: over 5 years ago - 1 dependent repositories - 28 downloads last month - 0 stars on GitHub - 1 maintainer
redbookweb 1.0.1
redbook web crawl sdk.
2 versions - Latest release: 4 months ago - 25 downloads last month - 1 maintainer
preparser 2.0.8
a slight preparser to help parse webpage content or get request from urls,which supports win, mac...
6 versions - Latest release: 3 months ago - 164 downloads last month - 1 stars on GitHub - 1 maintainer
lxparse 1.0.8
A library for intelligently parsing list page links and details page contents
9 versions - Latest release: over 2 years ago - 240 downloads last month - 17 stars on GitHub - 1 maintainer
modules-for-mozia 1.1.0
Modules for mozia
1 version - Latest release: over 7 years ago - 2 dependent repositories - 16 downloads last month - 1 maintainer
alfeios 1.4
Enrich your command-line shell with Herculean cleaning capabilities
3 versions - Latest release: over 1 year ago - 123 downloads last month - 0 stars on GitHub - 1 maintainer
naverscrap 1.0.6
A Naver News Scraping tool
7 versions - Latest release: almost 4 years ago - 1 dependent repositories - 197 downloads last month - 0 stars on GitHub - 1 maintainer
facebook-scraper-vn 0.0.1
Scraping facebook page tool
1 version - Latest release: 8 months ago - 51 downloads last month - 2 stars on GitHub - 1 maintainer
contentfetch 0.0.5
Extracting the content from the webpage
5 versions - Latest release: over 2 years ago - 168 downloads last month - 1 maintainer
spideyx 1.0.0
SpideyX - A Web Reconnaissance Penetration Testing tool for Penetration Testers and Ethical Hackers
1 version - Latest release: 7 months ago - 53 downloads last month - 155 stars on GitHub - 1 maintainer
django-scraper 0.3.8
Django application for collecting online content following user-defined instructions
6 versions - Latest release: almost 10 years ago - 5 dependent repositories - 131 downloads last month - 19 stars on GitHub - 1 maintainer
mozia-modules 1.8.3
Modules for mozia
80 versions - Latest release: about 7 years ago - 2 dependent repositories - 742 downloads last month - 1 maintainer
crawlist 0.1.0
A universal solution for web crawling lists
10 versions - Latest release: 11 months ago - 363 downloads last month - 23 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
scrape 0.11.3
a command-line web scraping tool
112 versions - Latest release: about 3 years ago - 9 dependent repositories - 2.9 thousand downloads last month - 151 stars on GitHub - 1 maintainer
alltweets 0.2
A very simple Twitter crawler that can collect all friends, followers, and tweets of a specified ...
2 versions - Latest release: almost 9 years ago - 2 dependent repositories - 72 downloads last month - 0 stars on GitHub - 1 maintainer
xhs-client 1.0.0 removed
xiaohongshu crawl sdk.
1 version - Latest release: 6 months ago - 1 maintainer
crawl-requests 2.2.8 removed
crawl_requests(like requests) can update ua and proxy automatically.
17 versions - Latest release: about 7 years ago - 172 downloads last month - 0 stars on GitHub - 1 maintainer
ebook-crawler 2.1.8 removed 💰
This project was moved to https://pypi.org/project/lightnovel-crawler/
30 versions - Latest release: over 6 years ago - 1 dependent repositories - 48 downloads last month - 1,259 stars on GitHub - 1 maintainer