Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "warc" keyword
warc2zim 1.5.5 💰
Convert WARC to ZIM22 versions - Latest release: 5 months ago - 1 dependent repositories - 123 downloads last month - 36 stars on GitHub - 1 maintainer
metawarc 1.1.1
metawarc: a command-line tool for data extraction from WARC files (web archives)3 versions - Latest release: over 1 year ago - 1 dependent repositories - 22 downloads last month - 24 stars on GitHub - 1 maintainer
otmt 1.0.5
Tools for determining if web archive collecions are Off-Topic9 versions - Latest release: over 2 years ago - 1 dependent repositories - 31 downloads last month - 8 stars on GitHub - 1 maintainer
cdxsummary 0.1.1b5
Summarize web archive capture index (CDX) files5 versions - Latest release: over 2 years ago - 1 dependent repositories - 42 downloads last month - 47 stars on GitHub - 1 maintainer
basc-warc 0.0.1
Create and manage WARC files. Currently in planning / pre-alpha stage.1 version - Latest release: 10 months ago - 2 stars on GitHub - 1 maintainer
ipwb 0.1
InterPlanetary Wayback (ipwb): Web Archive integration with IPFS242 versions - Latest release: 10 months ago - 2 dependent repositories - 1.45 thousand downloads last month - 590 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
22 versions - Latest release: almost 4 years ago - 20 dependent packages - 150 dependent repositories - 193 thousand downloads last month - 326 stars on GitHub - 1 maintainer
warcio 1.7.4 💰
Streaming WARC (and ARC) IO library22 versions - Latest release: almost 4 years ago - 20 dependent packages - 150 dependent repositories - 193 thousand downloads last month - 326 stars on GitHub - 1 maintainer
cocrawler 0.1.14
A modern web crawler framework for Python8 versions - Latest release: over 3 years ago - 1 dependent repositories - 41 downloads last month - 176 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
72 versions - Latest release: about 1 month ago - 6 dependent packages - 5 dependent repositories - 209 thousand downloads last month - 42 stars on GitHub - 1 maintainer
fastwarc 0.14.7
A high-performance WARC parsing library for Python written in C++/Cython.72 versions - Latest release: about 1 month ago - 6 dependent packages - 5 dependent repositories - 209 thousand downloads last month - 42 stars on GitHub - 1 maintainer
archivebox-likn 0.6.3 💰
The decentralized hosted internet archive.4 versions - Latest release: 11 months ago - 38 downloads last month - 19,808 stars on GitHub - 1 maintainer
forum-dl 0.3.0 💰
Scrape posts and threads from forums, news aggregators, mail archives3 versions - Latest release: 12 months ago - 120 downloads last month - 60 stars on GitHub - 1 maintainer
warcdb 0.2.2
WarcDB: Web crawl data as SQLite databases4 versions - Latest release: 7 months ago - 37 downloads last month - 384 stars on GitHub - 1 maintainer
warc-extractor 0.1.1
A simple tool for extracting warc files.2 versions - Latest release: almost 2 years ago - 56 downloads last month - 65 stars on GitHub - 1 maintainer
scrapy-warcio 0.0.8
Scrapy WARC I/O8 versions - Latest release: over 4 years ago - 1 dependent repositories - 44 downloads last month - 13 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
30 versions - Latest release: 4 months ago - 4 dependent repositories - 598 downloads last month - 150 stars on GitHub - 1 maintainer
cdx-toolkit 0.9.35
A toolkit for working with CDX indices30 versions - Latest release: 4 months ago - 4 dependent repositories - 598 downloads last month - 150 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
25 versions - Latest release: 5 months ago - 4 dependent repositories - 1.94 thousand downloads last month - 19,808 stars on GitHub - 1 maintainer
archivebox 0.7.2 💰
Self-hosted internet archiving solution.25 versions - Latest release: 5 months ago - 4 dependent repositories - 1.94 thousand downloads last month - 19,808 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
66 versions - Latest release: about 1 month ago - 2 dependent packages - 4 dependent repositories - 20.2 thousand downloads last month - 42 stars on GitHub - 1 maintainer
resiliparse 0.14.7
A collection of robust and fast processing tools for parsing and analyzing (not only) web archive...66 versions - Latest release: about 1 month ago - 2 dependent packages - 4 dependent repositories - 20.2 thousand downloads last month - 42 stars on GitHub - 1 maintainer
warcreader 0.4.3
Library for reading HTTP responses from WARC (Web ARChieve) files9 versions - Latest release: over 7 years ago - 2 dependent repositories - 6 downloads last month - 1 maintainer
Top 8.7% on pypi.org
12 versions - Latest release: almost 2 years ago - 2 dependent packages - 6 dependent repositories - 1.6 thousand downloads last month - 21 stars on GitHub - 1 maintainer
cdxj-indexer 1.4.5 💰
CDXJ Indexer for WARC and ARC files12 versions - Latest release: almost 2 years ago - 2 dependent packages - 6 dependent repositories - 1.6 thousand downloads last month - 21 stars on GitHub - 1 maintainer
Related Keywords
python
10
web-archiving
8
web
4
archive
3
webarchive
3
internet-archiving
3
scraper
2
rss
2
pocket
2
pinboard
2
headless-browser
2
firefox
2
digipres
2
chromium
2
browser-bookmarks
2
bookmark-archiver
2
backups
2
archivebox
2
htmlparser
2
extraction
2
web-archives
2
cython
2
cpp
2
bigdata
2
memento
2
youtube-dl
2
cdx
2
wget
2
wayback-machine
2
self-hosted
2
singlefile
2
summary
1
python3
1
screenshot
1
commoncrawl
1
cdx-api
1
scrapy
1
web-data
1
sqlite
1
database
1
crawling
1
cli
1
simplemachines
1
phpbb
1
forum
1
discourse
1
data-fetching
1
statistics
1
report
1
nodejs
1
collection
1
topic
1
timemap
1
simhash
1
measure
1
cosine
1
offtopic
1
similarity
1
webarchives
1
webarchiving
1
warc-files
1
osint-python
1
osint
1
metadata
1
zim
1
pluggable-modules
1
crawler
1
concurrency
1
async-python
1
aiohttp-client
1
aiohttp
1
pywb
1
service-worker
1
memento-rfc
1
docker
1
wayback
1
odu
1
distributed
1
ipfs
1
archives
1
http
1
archiving
1
webcomponents
1
web-archive
1