Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "hdfs" keyword

hadoop-mapreduce 0.5
Implementation of Hadoop Mapreduce on text files
4 versions - Latest release: over 1 year ago - 23 downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.8% on pypi.org
megfile 3.0.3
Megvii file operation library
77 versions - Latest release: about 6 hours ago - 2 dependent packages - 5 dependent repositories - 6.69 thousand downloads last month - 105 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
impyla 0.19.0
Python client for the Impala distributed query engine
52 versions - Latest release: 6 months ago - 22 dependent packages - 251 dependent repositories - 612 thousand downloads last month - 723 stars on GitHub - 13 maintainers
pydistcp 1.0.7
pydistcp: python WebHDFS inter/intra-cluster data copy tool.
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 21 downloads last month - 10 stars on GitHub - 2 maintainers
cyhdfs 0.1.3
Cython wrapper around libhdfs
2 versions - Latest release: over 11 years ago - 2 dependent repositories - 13 downloads last month - 2 maintainers
srcd_smart_open 1.9.0 💰
Utils for streaming large files (S3, HDFS, gzip, bz2...) - temporary source{d} fork
1 version - Latest release: about 5 years ago - 54 downloads last month - 3,095 stars on GitHub - 2 maintainers
Top 1.1% on pypi.org
smart-open 7.0.4 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
61 versions - Latest release: about 1 month ago - 216 dependent packages - 10,157 dependent repositories - 24.5 million downloads last month - 3,095 stars on GitHub - 2 maintainers
Top 5.6% on pypi.org
tethys-smart-open 6.2.0.dev2 removed
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
3 versions - Latest release: over 1 year ago
sqoopit 0.0.12
A simple package to let you Sqoop into HDFS/Hive/HBase with python
1 version - Latest release: about 4 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 2 maintainers
aioseaweedfs 0.3.1 💰
async client for seaweedfs
4 versions - Latest release: 3 months ago - 41 downloads last month - 21,149 stars on GitHub - 2 maintainers
pymongo_hadoop 1.1.0
UNKNOWN
2 versions - Latest release: about 11 years ago - 3 dependent repositories - 25 downloads last month - 1,522 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
snakebite 2.11.0
Pure Python HDFS client
63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
py4hdfs 0.0.1
Fast Queries to HDFS
1 version - Latest release: about 8 years ago - 2 dependent repositories - 9 downloads last month - 1 stars on GitHub - 2 maintainers
dvc-hdfs 3.0.0
hdfs plugin for dvc
3 versions - Latest release: 5 months ago - 4 dependent packages - 3 dependent repositories - 10.5 thousand downloads last month - 0 stars on GitHub - 4 maintainers
dfspy 0.1.0
Distributed File System written in Python
1 version - Latest release: almost 2 years ago - 19 downloads last month - 14 stars on GitHub - 2 maintainers
hdfs-native 0.9.1
Python bindings for hdfs-native Rust library
10 versions - Latest release: about 1 month ago - 313 downloads last month - 18 stars on GitHub - 2 maintainers
spark-hdfs-tools 0.1.1
spark_hdfs_tools
1 version - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
alphareader 0.0.7
A reader for large files with custom delimiters and encodings
7 versions - Latest release: about 4 years ago - 1 dependent repositories - 27 downloads last month - 5 stars on GitHub - 1 maintainer
hadeploy 0.6.1
An Hadoop Application deployment tool
12 versions - Latest release: over 5 years ago - 1 dependent repositories - 112 downloads last month - 10 stars on GitHub - 2 maintainers
redis2hdfs 0.0.2
Export Redis data to HDFS
2 versions - Latest release: over 9 years ago - 2 dependent repositories - 12 downloads last month - 2 stars on GitHub - 2 maintainers
Top 0.4% on pypi.org
hdfs 2.7.3
HdfsCLI: API and command line interface for HDFS.
81 versions - Latest release: 7 months ago - 31 dependent packages - 952 dependent repositories - 2.96 million downloads last month - 267 stars on GitHub - 1 maintainer
pfio 2.8.0
PFN IO library
31 versions - Latest release: 23 days ago - 1 dependent repositories - 58 thousand downloads last month - 52 stars on GitHub - 4 maintainers
webhdfspy 0.3.5
A wrapper library to access Hadoop HTTP REST API
8 versions - Latest release: over 7 years ago - 2 dependent repositories - 95 downloads last month - 8 stars on GitHub - 2 maintainers
chainerio 0.1.2
Chainer IO library
6 versions - Latest release: about 4 years ago - 44 downloads last month - 52 stars on GitHub - 4 maintainers
Top 7.7% on pypi.org
cluster-pack 0.3.7
A library on top of either pex or conda-packto make your Python code easily available on a cluster
44 versions - Latest release: 10 days ago - 2 dependent packages - 5 dependent repositories - 683 downloads last month - 43 stars on GitHub - 14 maintainers
kraken-pyds 2.0.1
kraken: python distributed data transfer tool.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
pyhdfs 0.3.1
Pure Python HDFS client
7 versions - Latest release: over 4 years ago - 11 dependent repositories - 7.36 thousand downloads last month - 88 stars on GitHub - 2 maintainers
tinyhdfs 1.1.4
Tiny client for HDFS, base on WebHDFS
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 2 stars on GitHub - 2 maintainers
tanit 2.0.1
tanit: python distributed data transfer tool.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 2 maintainers
schemaindex 0.2451
An indexing engine for different types of data sources, including HDFS, Mysql, etc.
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
braingeneers-smart-open 2023.10.6 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
1 version - Latest release: 7 months ago - 1 dependent package - 258 downloads last month - 0 stars on GitHub - 3 maintainers
Top 4.4% on pypi.org
skein 0.8.2
A simple tool and library for deploying applications on Apache YARN
23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 25.2 thousand downloads last month - 138 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
tiledb 0.28.0
Pythonic interface to the TileDB array storage manager
136 versions - Latest release: 24 days ago - 13 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
Top 3.6% on pypi.org
hdfs3 0.3.1 💰
Python wrappers for libhdfs3, a native HDFS client
7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 11.2 thousand downloads last month - 136 stars on GitHub - 6 maintainers
Top 3.2% on pypi.org
wradlib 2.0.3
wradlib - An Open Source Library for Weather Radar Data Processing
71 versions - Latest release: 6 months ago - 3 dependent packages - 12 dependent repositories - 2.67 thousand downloads last month - 247 stars on GitHub - 2 maintainers
ym-impyla 0.14.0
Python client for the Impala distributed query engine
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 2 maintainers
edmunds_hdfs_load 4.0.1
Moves files to hdfs by creating hive tables
29 versions - Latest release: almost 10 years ago - 2 dependent repositories - 85 downloads last month - 2 maintainers
sparkdh 0.0.1
1 version - Latest release: about 2 years ago - 1 dependent repositories - 6 downloads last month - 0 stars on GitHub - 2 maintainers
smart-pathlib 0.0.1
Utils for making os standard path operationsinteroperable with Cloud blob storages
1 version - Latest release: over 3 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 2 maintainers
impyla-jz 0.16.3
Python client for the Impala distributed query engine
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
spark-yarn-submit 1.0.0
library to handle spark job submit in a yarn cluster in different environment
1 version - Latest release: about 7 years ago - 1 dependent repositories - 3 downloads last month - 3 stars on GitHub - 2 maintainers
jupyter-omnicm 0.0.5
jupyter-omnicm is a flexible content manager system for Jupyter notebooks.
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
test-cephadm 0.2
cephadm
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 3 downloads last month - 13,175 stars on GitHub - 1 maintainer
starlake-dagster 0.1.2
Starlake Python Distribution For Dagster
5 versions - Latest release: 3 months ago - 11 downloads last month - 31 stars on GitHub - 2 maintainers
starlake-airflow 0.1.2
Starlake Python Distribution For Airflow
23 versions - Latest release: 3 months ago - 74 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-orchestration 0.1.2
Starlake Python Distribution For orchestration
6 versions - Latest release: 3 months ago - 23 downloads last month - 31 stars on GitHub - 2 maintainers
importable 0.2.2
Allows to import zip-compressed Python package by URL (http, hdfs).
4 versions - Latest release: over 6 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 2 maintainers
trustedanalytics 0.7.3.post20161020785
Trusted Analytics Toolkit
161 versions - Latest release: over 7 years ago - 2 dependent repositories - 279 downloads last month - 43 stars on GitHub - 2 maintainers
streamsx.hdfs 1.5.9
HDFS integration for IBM Streams
18 versions - Latest release: over 3 years ago - 167 downloads last month - 9 stars on GitHub - 8 maintainers
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.
8 versions - Latest release: 6 months ago - 2 downloads last month - 3,082 stars on GitHub - 2 maintainers
hcompressor 1.0.0 removed
Hcompressor is a tool to compress files in HDFS
1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer