pypi.org "web-archiving" keyword
View the packages on the pypi.org package registry that are tagged with the "web-archiving" keyword.
Top 3.0% on pypi.org
105 versions - Latest release: about 1 month ago - 3 dependent packages - 32 dependent repositories - 4.14 thousand downloads last month - 1,563 stars on GitHub - 1 maintainer
pywb 2.9.1 💰
Pywb Webrecorder web archive replay and capture tools105 versions - Latest release: about 1 month ago - 3 dependent packages - 32 dependent repositories - 4.14 thousand downloads last month - 1,563 stars on GitHub - 1 maintainer
ipwb 0.2024.10.24.1853
InterPlanetary Wayback (ipwb): Web Archive integration with IPFS244 versions - Latest release: about 1 year ago - 2 dependent repositories - 1.51 thousand downloads last month - 606 stars on GitHub - 2 maintainers
Top 8.7% on pypi.org
13 versions - Latest release: 11 months ago - 2 dependent packages - 6 dependent repositories - 5.22 thousand downloads last month - 21 stars on GitHub - 1 maintainer
cdxj-indexer 1.4.6 💰
CDXJ Indexer for WARC and ARC files13 versions - Latest release: 11 months ago - 2 dependent packages - 6 dependent repositories - 5.22 thousand downloads last month - 21 stars on GitHub - 1 maintainer
pwebarc-dumb-dump-server 1.6.1
pwebarc-dumb-dump-server is now hoardy-web-sas4 versions - Latest release: about 1 year ago - 22 downloads last month - 22 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
35 versions - Latest release: over 3 years ago - 7 dependent packages - 174 dependent repositories - 58.9 thousand downloads last month - 544 stars on GitHub - 1 maintainer
waybackpy 3.0.6
Python package that interfaces with the Internet Archive's Wayback Machine APIs. Archive pages an...35 versions - Latest release: over 3 years ago - 7 dependent packages - 174 dependent repositories - 58.9 thousand downloads last month - 544 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
33 versions - Latest release: 6 days ago - 4 dependent repositories - 6.33 thousand downloads last month - 157 stars on GitHub - 1 maintainer
cdx-toolkit 0.9.38
A toolkit for working with CDX indices33 versions - Latest release: 6 days ago - 4 dependent repositories - 6.33 thousand downloads last month - 157 stars on GitHub - 1 maintainer
warcdb 0.2.2 💰
WarcDB: Web crawl data as SQLite databases4 versions - Latest release: about 2 years ago - 40 downloads last month - 406 stars on GitHub - 1 maintainer
scrapy-warcio 0.0.8
Scrapy WARC I/O8 versions - Latest release: almost 6 years ago - 1 dependent repositories - 67 downloads last month - 22 stars on GitHub - 1 maintainer
hoardy-web 0.23.0
Inspect, search, organize, programmatically extract values and generate static website mirrors fr...16 versions - Latest release: 10 months ago - 63 downloads last month - 92 stars on GitHub - 1 maintainer
fatcat-openapi-client 0.5.0
API client library for fatcat.wiki (a bibliographic catalog)5 versions - Latest release: almost 4 years ago - 6 dependent repositories - 53 downloads last month - 120 stars on GitHub - 3 maintainers
auto-archiver 1.1.4
Automatically archive links to videos, images, and social media content from Google Sheets (and m...92 versions - Latest release: 21 days ago - 1 dependent repositories - 1.84 thousand downloads last month - 959 stars on GitHub - 1 maintainer
hoardy-web-sas 1.9.0
A simple archiving server for the `Hoardy-Web` Web Extension browser add-on.4 versions - Latest release: 10 months ago - 31 downloads last month - 92 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
76 versions - Latest release: almost 2 years ago - 4 dependent repositories - 3.32 thousand downloads last month - 24,925 stars on GitHub - 1 maintainer
archivebox 0.7.2 💰
Self-hosted internet archiving solution.76 versions - Latest release: almost 2 years ago - 4 dependent repositories - 3.32 thousand downloads last month - 24,925 stars on GitHub - 1 maintainer
archivebox-likn 0.6.3 💰
The decentralized hosted internet archive.4 versions - Latest release: over 2 years ago - 48 downloads last month - 19,808 stars on GitHub - 1 maintainer
pwebarc-wrrarms 0.14.1
pwebarc-wrrarms is now hoardy-web12 versions - Latest release: about 1 year ago - 10 downloads last month - 22 stars on GitHub - 1 maintainer
eprints2archives 1.3.5
Send EPrints URLs to the Internet Archive and other archives11 versions - Latest release: over 2 years ago - 1 dependent repositories - 64 downloads last month - 4 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
23 versions - Latest release: 11 months ago - 20 dependent packages - 150 dependent repositories - 2.23 million downloads last month - 326 stars on GitHub - 1 maintainer
warcio 1.7.5 💰
Streaming WARC (and ARC) IO library23 versions - Latest release: 11 months ago - 20 dependent packages - 150 dependent repositories - 2.23 million downloads last month - 326 stars on GitHub - 1 maintainer
pywayback 0.10.9.1 removed 💰
Python WayBack for web archive replay and live web proxy1 version - Latest release: about 10 years ago - 1 dependent repositories - 9 downloads last month - 1,543 stars on GitHub - 1 maintainer
Related Keywords
python
11
warc
8
wayback-machine
7
internet-archiving
7
self-hosted
6
backups
6
archive
5
wayback
5
web-archives
5
internet
4
cli
3
archiving
3
pywb
3
web
3
snapshot
2
offline-reading
2
browser-extension
2
auto-save
2
archiver
2
wayback machine
2
HTTP
2
HTTPS
2
mirror
2
download
2
website
2
site
2
browser
2
WWW
2
web-archive
2
web-browsing
2
website-archive
2
archivebox
2
bookmark-archiver
2
browser-bookmarks
2
chromium
2
digipres
2
firefox
2
headless-browser
2
pinboard
2
pocket
2
rss
2
singlefile
2
wget
2
youtube-dl
2
archives
2
osint
2
internet-archive
2
cdx-api
2
memento
2
docker
2
scholarly-communication
1
oosi
1
scraping
1
open-source-research
1
service
1
Wayback Machine CLI
1
Internet Archive
1
Wayback Machine
1
Archive Website
1
service-worker
1
memento-rfc
1
odu
1
distributed
1
ipfs
1
http
1
EPrints
1
web archives
1
preservation
1
eprints
1
terminal
1
web-data
1
scrapy
1
sqlite
1
database
1
crawling
1
commoncrawl
1
cdx
1
webarchiving
1
wayback-machine-python
1
wayback-machine-api
1
archive-webpages
1
archive-webpage
1
savepagenow
1
CDX API
1
Availability API
1
Internet Archiving
1
Wayback Machine Python
1
fatcat
1
OpenAPI
1
digital-library
1
open-access
1
postgresql
1
rust
1