pypi.org "crawl" keyword
View the packages on the pypi.org package registry that are tagged with the "crawl" keyword.
font-obfuscator 0.1.2
字体反爬、字体混淆工具是一个用于混淆字体文件的工具,可以将字体文件中的字形进行混淆,从而防止字体文件被直接提取出来。Font Obfuscator is an open-source Pytho...3 versions - Latest release: 8 months ago - 139 downloads last month - 9 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
17 versions - Latest release: about 4 years ago - 1 dependent repositories - 1.63 thousand downloads last month - 1,121 stars on GitHub - 1 maintainer
scweet 0.3.3 💰
Tool for scraping Tweets, User infos, Followers and Following17 versions - Latest release: about 4 years ago - 1 dependent repositories - 1.63 thousand downloads last month - 1,121 stars on GitHub - 1 maintainer
weibo-scraper 1.0.6
Simple Weibo Scraper9 versions - Latest release: almost 7 years ago - 1 dependent repositories - 403 downloads last month - 91 stars on GitHub - 1 maintainer
emailcrawlerpy 2.0
Small tool to crawl email from given websites1 version - Latest release: about 4 years ago - 1 dependent repositories - 31 downloads last month - 1 maintainer
web-walker 3.1.5
your can crawl web pages with litte settings. based on scrapy.43 versions - Latest release: almost 8 years ago - 1 dependent repositories - 795 downloads last month - 54 stars on GitHub - 1 maintainer
zagoload 0.5.1
Download files(http,ftp). Supports: cache, uniform access to remote and local files2 versions - Latest release: over 8 years ago - 1 dependent repositories - 62 downloads last month - 2 stars on GitHub - 1 maintainer
mozia 1.0.0
Modules for mozia1 version - Latest release: about 7 years ago - 1 dependent repositories - 30 downloads last month - 1 maintainer
proxy_pool 0.0.3
A proxy pool which you can get an avaiable proxy http server.3 versions - Latest release: about 8 years ago - 2 dependent repositories - 105 downloads last month - 5 stars on GitHub - 1 maintainer
jike 0.5.0
Jike Metro 🚇 : Jike Python SDK5 versions - Latest release: almost 7 years ago - 1 dependent repositories - 150 downloads last month - 209 stars on GitHub - 1 maintainer
tvstats 0.0.2
Scrape data of all the episodes of a Tv Series from IMDB2 versions - Latest release: almost 10 years ago - 2 dependent repositories - 31 downloads last month - 7 stars on GitHub - 1 maintainer
salscraper 0.2.1
A scarping tool3 versions - Latest release: about 5 years ago - 1 dependent repositories - 71 downloads last month - 1 maintainer
libgenapi 1.2.1
Library to search on Library genesis10 versions - Latest release: about 7 years ago - 4 dependent repositories - 289 downloads last month - 118 stars on GitHub - 1 maintainer
scrapy-plus 1.0.5
scrapy 常用爬网必备工具包6 versions - Latest release: over 4 years ago - 1 dependent repositories - 214 downloads last month - 24 stars on GitHub - 1 maintainer
steam-review-scraper 0.1.0
A package to scrape game reviews from Steam.1 version - Latest release: almost 4 years ago - 1 dependent repositories - 58 downloads last month - 5 stars on GitHub - 1 maintainer
obsidian 1.0.1
Obsidian make web crawl easier5 versions - Latest release: over 8 years ago - 1 dependent repositories - 303 downloads last month - 5 stars on GitHub - 1 maintainer
asyncpy 1.2.0
Use asyncio and aiohttp's concatenated web crawler framework14 versions - Latest release: over 2 years ago - 1 dependent repositories - 738 downloads last month - 108 stars on GitHub - 1 maintainer
scrapy-multifeedexporter 0.1.1
Export scraped items of different types to multiple feeds.2 versions - Latest release: over 10 years ago - 3 dependent repositories - 55 downloads last month - 7 stars on GitHub - 1 maintainer
xhs 0.2.13
xiaohongshu crawl sdk.27 versions - Latest release: 12 months ago - 1 dependent repositories - 2.21 thousand downloads last month - 1,508 stars on GitHub - 1 maintainer
air-web 0.1.0
A lightweight package for crawling the web with the minimalist of code.1 version - Latest release: 7 months ago - 2.25 thousand downloads last month - 0 stars on GitHub - 1 maintainer
smockrawl 0.3.0
Smockeo API crawler6 versions - Latest release: about 3 years ago - 1 dependent repositories - 251 downloads last month - 0 stars on GitHub - 1 maintainer
winzig 0.3.0
A tiny search engine for personal use.27 versions - Latest release: about 1 year ago - 678 downloads last month - 3 stars on GitHub - 1 maintainer
xextract 0.1.9
Extract structured data from HTML and XML documents like a boss.18 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 615 downloads last month - 50 stars on GitHub - 1 maintainer
pithytools 0.0.1
Pithytools is a collection of command line utilities.1 version - Latest release: almost 5 years ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
pithy 0.0.13
Pithy is a collection of utility libraries for Python 3.11 versions - Latest release: almost 5 years ago - 8 dependent repositories - 247 downloads last month - 5 stars on GitHub - 1 maintainer
crawlmap 1.3
A python3 script to change your crawling logs to a mindmap4 versions - Latest release: almost 3 years ago - 110 downloads last month - 7 stars on GitHub - 1 maintainer
seolint 0.2
SEO linting tool.2 versions - Latest release: over 13 years ago - 2 dependent repositories - 50 downloads last month - 6 stars on GitHub - 1 maintainer
sitecrawl 1.0.5
Simple Python3 module to crawl a website and extract URLs6 versions - Latest release: about 3 years ago - 1 dependent repositories - 167 downloads last month - 5 stars on GitHub - 1 maintainer
simspider 1.1.0
简单的定向爬虫框架3 versions - Latest release: over 7 years ago - 1 dependent repositories - 51 downloads last month - 1 maintainer
py3spider 1.0.7
仿Scrapy实现,基于py3.4+的多线程异步网络爬虫,实例请访问https://github.com/ChenL1994/py3spider/tree/master/examples6 versions - Latest release: over 7 years ago - 1 dependent repositories - 132 downloads last month - 1 maintainer
kneescrape 0.13
A simple script to crawl a website and scrape for emails and unique words to create relevant dict...3 versions - Latest release: over 7 years ago - 1 dependent repositories - 75 downloads last month - 0 stars on GitHub - 1 maintainer
renfepy 2.0.0
Python library for crawl trains from renfe10 versions - Latest release: about 2 years ago - 1 dependent repositories - 271 downloads last month - 1 stars on GitHub - 1 maintainer
easy-server-indexing 0.1.1
The following package can be integrated into a server indexing softwares which will skip the alre...2 versions - Latest release: over 5 years ago - 1 dependent repositories - 96 downloads last month - 1 stars on GitHub - 2 maintainers
analyze_site 0.1.3
Utility to crawl web site looking for key words2 versions - Latest release: about 8 years ago - 59 downloads last month - 0 stars on GitHub - 1 maintainer
pug-ann 0.0.22
# pug-ann15 versions - Latest release: about 10 years ago - 4 dependent repositories - 248 downloads last month - 2 stars on GitHub - 1 maintainer
concurrentfloodscraper 1.0.1
A concurrent flood web scraper.2 versions - Latest release: about 8 years ago - 1 dependent repositories - 54 downloads last month - 0 stars on GitHub - 1 maintainer
nudecrawler 0.3.28
Crawl telegra.ph searching for nudes!48 versions - Latest release: 9 months ago - 1.14 thousand downloads last month - 303 stars on GitHub - 1 maintainer
universal-utils 0.0.1
python utils1 version - Latest release: almost 6 years ago - 1 dependent repositories - 58 downloads last month - 1 stars on GitHub - 1 maintainer
cc-net 1.0.0
Tools to download and clean Common Crawl2 versions - Latest release: over 4 years ago - 1 dependent repositories - 39.5 thousand downloads last month - 993 stars on GitHub - 1 maintainer
structure-spider 1.3.5
multi requests to combine a structure item.49 versions - Latest release: over 5 years ago - 1 dependent repositories - 591 downloads last month - 29 stars on GitHub - 1 maintainer
ekrhizoc 0.1.2
A simple python web crawler4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 144 downloads last month - 0 stars on GitHub - 1 maintainer
stweet 2.1.1
Package to scrap tweets20 versions - Latest release: about 2 years ago - 1 dependent repositories - 690 downloads last month - 602 stars on GitHub - 1 maintainer
icrawl 1.0.6
iCrawl7 versions - Latest release: over 9 years ago - 1 dependent repositories - 139 downloads last month - 1 maintainer
fcrawler 1.0.1
Python application that can be used to copy files of a given file type from a folder directory.4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 159 downloads last month - 2 stars on GitHub - 1 maintainer
filelist 1.1.7
Easily list some files in a directory, and exclude others.8 versions - Latest release: about 9 years ago - 23 dependent repositories - 210 downloads last month - 1 stars on GitHub - 1 maintainer
spotify2csv 0.4.2
Convert Spotify URLs to tracks info in CSV format5 versions - Latest release: about 7 years ago - 1 dependent repositories - 157 downloads last month - 1 stars on GitHub - 1 maintainer
scrape-google 0.0.2
A package used to scrape top links from google2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 61 downloads last month - 0 stars on GitHub - 1 maintainer
scr 0.12.0
Command-line Utility for Web Scraping13 versions - Latest release: over 2 years ago - 4 dependent repositories - 594 downloads last month - 2 stars on GitHub - 1 maintainer
scrapy-redirect 0.1.0
Restrict authorized Scrapy redirections to the website start_urls1 version - Latest release: over 11 years ago - 2 dependent repositories - 27 downloads last month - 0 stars on GitHub - 1 maintainer
lurk 0.1.3
Extract html from one or multiple urls4 versions - Latest release: over 9 years ago - 4 dependent repositories - 223 downloads last month - 0 stars on GitHub - 1 maintainer
facehugger 0.1.6
Extracts faces from an image7 versions - Latest release: over 11 years ago - 2 dependent repositories - 161 downloads last month - 10 stars on GitHub - 1 maintainer
solidscraper 0.7.7
This package lets your script scrape web sites. JQuery-Like API.4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 187 downloads last month - 3 stars on GitHub - 1 maintainer
pixivhack 0.1.5
Pixiv Hack is a tool to automatically crawl illustrations filtered by ratings on www.pixiv.net1 version - Latest release: over 9 years ago - 2 dependent repositories - 35 downloads last month - 15 stars on GitHub - 1 maintainer
koreanewscrawler 1.51
Crawl the korean news9 versions - Latest release: about 3 years ago - 1 dependent repositories - 161 downloads last month - 222 stars on GitHub - 1 maintainer
fs-walker 0.0.1
Walk your file system to check duplicate or missing files1 version - Latest release: over 5 years ago - 1 dependent repositories - 28 downloads last month - 0 stars on GitHub - 1 maintainer
redbookweb 1.0.1
redbook web crawl sdk.2 versions - Latest release: 4 months ago - 25 downloads last month - 1 maintainer
preparser 2.0.8
a slight preparser to help parse webpage content or get request from urls,which supports win, mac...6 versions - Latest release: 3 months ago - 164 downloads last month - 1 stars on GitHub - 1 maintainer
lxparse 1.0.8
A library for intelligently parsing list page links and details page contents9 versions - Latest release: over 2 years ago - 240 downloads last month - 17 stars on GitHub - 1 maintainer
modules-for-mozia 1.1.0
Modules for mozia1 version - Latest release: over 7 years ago - 2 dependent repositories - 16 downloads last month - 1 maintainer
alfeios 1.4
Enrich your command-line shell with Herculean cleaning capabilities3 versions - Latest release: over 1 year ago - 123 downloads last month - 0 stars on GitHub - 1 maintainer
naverscrap 1.0.6
A Naver News Scraping tool7 versions - Latest release: almost 4 years ago - 1 dependent repositories - 197 downloads last month - 0 stars on GitHub - 1 maintainer
facebook-scraper-vn 0.0.1
Scraping facebook page tool1 version - Latest release: 8 months ago - 51 downloads last month - 2 stars on GitHub - 1 maintainer
contentfetch 0.0.5
Extracting the content from the webpage5 versions - Latest release: over 2 years ago - 168 downloads last month - 1 maintainer
spideyx 1.0.0
SpideyX - A Web Reconnaissance Penetration Testing tool for Penetration Testers and Ethical Hackers1 version - Latest release: 7 months ago - 53 downloads last month - 155 stars on GitHub - 1 maintainer
django-scraper 0.3.8
Django application for collecting online content following user-defined instructions6 versions - Latest release: almost 10 years ago - 5 dependent repositories - 131 downloads last month - 19 stars on GitHub - 1 maintainer
mozia-modules 1.8.3
Modules for mozia80 versions - Latest release: about 7 years ago - 2 dependent repositories - 742 downloads last month - 1 maintainer
crawlist 0.1.0
A universal solution for web crawling lists10 versions - Latest release: 11 months ago - 363 downloads last month - 23 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
112 versions - Latest release: about 3 years ago - 9 dependent repositories - 2.9 thousand downloads last month - 151 stars on GitHub - 1 maintainer
scrape 0.11.3
a command-line web scraping tool112 versions - Latest release: about 3 years ago - 9 dependent repositories - 2.9 thousand downloads last month - 151 stars on GitHub - 1 maintainer
alltweets 0.2
A very simple Twitter crawler that can collect all friends, followers, and tweets of a specified ...2 versions - Latest release: almost 9 years ago - 2 dependent repositories - 72 downloads last month - 0 stars on GitHub - 1 maintainer
xhs-client 1.0.0 removed
xiaohongshu crawl sdk.1 version - Latest release: 6 months ago - 1 maintainer
crawl-requests 2.2.8 removed
crawl_requests(like requests) can update ua and proxy automatically.17 versions - Latest release: about 7 years ago - 172 downloads last month - 0 stars on GitHub - 1 maintainer
ebook-crawler 2.1.8 removed 💰
This project was moved to https://pypi.org/project/lightnovel-crawler/30 versions - Latest release: over 6 years ago - 1 dependent repositories - 48 downloads last month - 1,259 stars on GitHub - 1 maintainer
Related Keywords
crawler
18
scrape
18
python
17
web
14
scraper
11
spider
8
scraping
8
search
7
scrapy
6
crawling
5
parse
4
html
4
csv
4
python3
4
json
4
file
4
files
3
parsing
3
web-scraping
3
crawler-python
3
tweets
3
selenium
3
parser
3
walk
3
twitter
3
sqlite
3
requests
3
api
3
proxy
2
xhs
2
aiohttp
2
fs
2
定向爬虫
2
crawling-sites
2
convert
2
filesystem
2
system
2
tor
2
duplicate
2
missing
2
asyncio
2
zstandard
2
zst
2
xml
2
utilities
2
webscraping
2
unicode
2
terminal
2
term
2
data
2
syntax
2
svg
2
msgpack
2
graphviz
2
scrap
2
scrapper
2
graph
2
format
2
diff
2
download
2
console
2
extract
2
directory
2
webpage
2
facehugger
1
webscrape
1
lurker
1
simplecv
1
vision
1
face
1
crawling-python
1
jquery
1
web-crawler
1
pixiv
1
wbs
1
users
1
user
1
unofficial
1
twitter-api
1
twint
1
tweet
1
searchrunner
1
scrap-tweet
1
structure
1
dataset
1
common
1
utils
1
screenshot
1
tits
1
telegra-ph
1
onlyfans
1
nudity-detection
1
nudes
1
font
1
lurk
1
repl
1
shell
1
command
1
downloader
1
regex
1