Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "warc" keyword

warc2zim 1.5.5 💰
Convert WARC to ZIM
22 versions - Latest release: 5 months ago - 1 dependent repositories - 123 downloads last month - 36 stars on GitHub - 1 maintainer
metawarc 1.1.1
metawarc: a command-line tool for data extraction from WARC files (web archives)
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 22 downloads last month - 24 stars on GitHub - 1 maintainer
otmt 1.0.5
Tools for determining if web archive collecions are Off-Topic
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 31 downloads last month - 8 stars on GitHub - 1 maintainer
cdxsummary 0.1.1b5
Summarize web archive capture index (CDX) files
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 42 downloads last month - 47 stars on GitHub - 1 maintainer
basc-warc 0.0.1
Create and manage WARC files. Currently in planning / pre-alpha stage.
1 version - Latest release: 10 months ago - 2 stars on GitHub - 1 maintainer
ipwb 0.1
InterPlanetary Wayback (ipwb): Web Archive integration with IPFS
242 versions - Latest release: 10 months ago - 2 dependent repositories - 1.45 thousand downloads last month - 590 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
warcio 1.7.4 💰
Streaming WARC (and ARC) IO library
22 versions - Latest release: almost 4 years ago - 20 dependent packages - 150 dependent repositories - 193 thousand downloads last month - 326 stars on GitHub - 1 maintainer
cocrawler 0.1.14
A modern web crawler framework for Python
8 versions - Latest release: over 3 years ago - 1 dependent repositories - 41 downloads last month - 176 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
fastwarc 0.14.7
A high-performance WARC parsing library for Python written in C++/Cython.
72 versions - Latest release: about 1 month ago - 6 dependent packages - 5 dependent repositories - 209 thousand downloads last month - 42 stars on GitHub - 1 maintainer
archivebox-likn 0.6.3 💰
The decentralized hosted internet archive.
4 versions - Latest release: 11 months ago - 38 downloads last month - 19,808 stars on GitHub - 1 maintainer
forum-dl 0.3.0 💰
Scrape posts and threads from forums, news aggregators, mail archives
3 versions - Latest release: 12 months ago - 120 downloads last month - 60 stars on GitHub - 1 maintainer
warcdb 0.2.2
WarcDB: Web crawl data as SQLite databases
4 versions - Latest release: 7 months ago - 37 downloads last month - 384 stars on GitHub - 1 maintainer
warc-extractor 0.1.1
A simple tool for extracting warc files.
2 versions - Latest release: almost 2 years ago - 56 downloads last month - 65 stars on GitHub - 1 maintainer
scrapy-warcio 0.0.8
Scrapy WARC I/O
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 44 downloads last month - 13 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
cdx-toolkit 0.9.35
A toolkit for working with CDX indices
30 versions - Latest release: 4 months ago - 4 dependent repositories - 598 downloads last month - 150 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
archivebox 0.7.2 💰
Self-hosted internet archiving solution.
25 versions - Latest release: 5 months ago - 4 dependent repositories - 1.94 thousand downloads last month - 19,808 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
resiliparse 0.14.7
A collection of robust and fast processing tools for parsing and analyzing (not only) web archive...
66 versions - Latest release: about 1 month ago - 2 dependent packages - 4 dependent repositories - 20.2 thousand downloads last month - 42 stars on GitHub - 1 maintainer
warcreader 0.4.3
Library for reading HTTP responses from WARC (Web ARChieve) files
9 versions - Latest release: over 7 years ago - 2 dependent repositories - 6 downloads last month - 1 maintainer
Top 8.7% on pypi.org
cdxj-indexer 1.4.5 💰
CDXJ Indexer for WARC and ARC files
12 versions - Latest release: almost 2 years ago - 2 dependent packages - 6 dependent repositories - 1.6 thousand downloads last month - 21 stars on GitHub - 1 maintainer