Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "hdfs" keyword

hdfs-native 0.9.1
Python bindings for hdfs-native Rust library
13 versions - Latest release: 2 months ago - 561 downloads last month - 21 stars on GitHub - 1 maintainer
cyhdfs 0.1.3
Cython wrapper around libhdfs
2 versions - Latest release: over 11 years ago - 2 dependent repositories - 11 downloads last month - 1 maintainer
pydistcp 1.0.7
pydistcp: python WebHDFS inter/intra-cluster data copy tool.
6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 11 downloads last month - 10 stars on GitHub - 1 maintainer
sqoopit 0.0.12
A simple package to let you Sqoop into HDFS/Hive/HBase with python
1 version - Latest release: about 4 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.1% on pypi.org
smart-open 7.0.4 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
61 versions - Latest release: 3 months ago - 248 dependent packages - 10,157 dependent repositories - 24.7 million downloads last month - 3,101 stars on GitHub - 2 maintainers
pymongo_hadoop 1.1.0
UNKNOWN
2 versions - Latest release: about 11 years ago - 3 dependent repositories - 21 downloads last month - 1,521 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
snakebite 2.11.0
Pure Python HDFS client
63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
py4hdfs 0.0.1
Fast Queries to HDFS
1 version - Latest release: about 8 years ago - 2 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
spark-hdfs-tools 0.1.1
spark_hdfs_tools
1 version - Latest release: 11 months ago - 10 downloads last month - 1 maintainer
dfspy 0.1.0
Distributed File System written in Python
1 version - Latest release: almost 2 years ago - 15 downloads last month - 14 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
hdfs 2.7.3
HdfsCLI: API and command line interface for HDFS.
81 versions - Latest release: 8 months ago - 36 dependent packages - 952 dependent repositories - 3.08 million downloads last month - 267 stars on GitHub - 1 maintainer
alphareader 0.0.7
A reader for large files with custom delimiters and encodings
7 versions - Latest release: about 4 years ago - 1 dependent repositories - 46 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
megfile 3.0.3
Megvii file operation library
79 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 5.35 thousand downloads last month - 110 stars on GitHub - 2 maintainers
hadeploy 0.6.1
An Hadoop Application deployment tool
12 versions - Latest release: over 5 years ago - 1 dependent repositories - 25 downloads last month - 10 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
impyla 0.19.0
Python client for the Impala distributed query engine
52 versions - Latest release: 7 months ago - 29 dependent packages - 251 dependent repositories - 746 thousand downloads last month - 724 stars on GitHub - 13 maintainers
redis2hdfs 0.0.2
Export Redis data to HDFS
2 versions - Latest release: over 9 years ago - 2 dependent repositories - 12 downloads last month - 2 stars on GitHub - 1 maintainer
dvc-hdfs 3.0.0
hdfs plugin for dvc
3 versions - Latest release: 6 months ago - 5 dependent packages - 3 dependent repositories - 10.2 thousand downloads last month - 0 stars on GitHub - 2 maintainers
starlake-airflow 0.1.2
Starlake Python Distribution For Airflow
24 versions - Latest release: 4 months ago - 1 dependent package - 128 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-dagster 0.1.2
Starlake Python Distribution For Dagster
6 versions - Latest release: 4 months ago - 1 dependent package - 85 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-orchestration 0.1.2
Starlake Python Distribution For orchestration
7 versions - Latest release: 4 months ago - 2 dependent packages - 121 downloads last month - 31 stars on GitHub - 1 maintainer
pfio 2.8.0
PFN IO library
31 versions - Latest release: about 2 months ago - 1 dependent repositories - 66.8 thousand downloads last month - 52 stars on GitHub - 3 maintainers
webhdfspy 0.3.5
A wrapper library to access Hadoop HTTP REST API
8 versions - Latest release: almost 8 years ago - 2 dependent repositories - 95 downloads last month - 8 stars on GitHub - 1 maintainer
chainerio 0.1.2
Chainer IO library
6 versions - Latest release: about 4 years ago - 54 downloads last month - 52 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
cluster-pack 0.3.7
A library on top of either pex or conda-packto make your Python code easily available on a cluster
44 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 1.27 thousand downloads last month - 43 stars on GitHub - 7 maintainers
schemaindex 0.2451
An indexing engine for different types of data sources, including HDFS, Mysql, etc.
4 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
pyhdfs 0.3.1
Pure Python HDFS client
7 versions - Latest release: over 4 years ago - 2 dependent packages - 11 dependent repositories - 7.38 thousand downloads last month - 88 stars on GitHub - 1 maintainer
kraken-pyds 2.0.1
kraken: python distributed data transfer tool.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 16 downloads last month - 2 stars on GitHub - 1 maintainer
tanit 2.0.1
tanit: python distributed data transfer tool.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 14 downloads last month - 2 stars on GitHub - 1 maintainer
braingeneers-smart-open 2023.10.6 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
1 version - Latest release: 8 months ago - 1 dependent package - 1.46 thousand downloads last month - 0 stars on GitHub - 2 maintainers
aioseaweedfs 0.3.1 💰
async client for seaweedfs
4 versions - Latest release: 4 months ago - 25 downloads last month - 21,348 stars on GitHub - 1 maintainer
tinyhdfs 1.1.4
Tiny client for HDFS, base on WebHDFS
1 version - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
skein 0.8.2
A simple tool and library for deploying applications on Apache YARN
23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 39.5 thousand downloads last month - 140 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
wradlib 2.0.3
wradlib - An Open Source Library for Weather Radar Data Processing
71 versions - Latest release: 7 months ago - 5 dependent packages - 12 dependent repositories - 3.09 thousand downloads last month - 250 stars on GitHub - 2 maintainers
ym-impyla 0.14.0
Python client for the Impala distributed query engine
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
edmunds_hdfs_load 4.0.1
Moves files to hdfs by creating hive tables
29 versions - Latest release: almost 10 years ago - 2 dependent repositories - 56 downloads last month - 1 maintainer
srcd_smart_open 1.9.0 💰
Utils for streaming large files (S3, HDFS, gzip, bz2...) - temporary source{d} fork
1 version - Latest release: over 5 years ago - 39 downloads last month - 3,100 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
tethys-smart-open 6.2.0.dev2 removed
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
3 versions - Latest release: over 1 year ago
sparkdh 0.0.1
1 version - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
smart-pathlib 0.0.1
Utils for making os standard path operationsinteroperable with Cloud blob storages
1 version - Latest release: over 3 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
impyla-jz 0.16.3
Python client for the Impala distributed query engine
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
spark-yarn-submit 1.0.0
library to handle spark job submit in a yarn cluster in different environment
1 version - Latest release: over 7 years ago - 1 dependent repositories - 12 downloads last month - 3 stars on GitHub - 1 maintainer
jupyter-omnicm 0.0.5
jupyter-omnicm is a flexible content manager system for Jupyter notebooks.
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
test-cephadm 0.2
cephadm
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 27 downloads last month - 13,175 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
tiledb 0.29.0
Pythonic interface to the TileDB array storage manager
137 versions - Latest release: about 1 month ago - 15 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.
8 versions - Latest release: 7 months ago - 53 downloads last month - 3,095 stars on GitHub - 1 maintainer
hadoop-mapreduce 0.5
Implementation of Hadoop Mapreduce on text files
4 versions - Latest release: over 1 year ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
trustedanalytics 0.7.3.post20161020785
Trusted Analytics Toolkit
161 versions - Latest release: over 7 years ago - 2 dependent repositories - 791 downloads last month - 43 stars on GitHub - 1 maintainer
streamsx.hdfs 1.5.9
HDFS integration for IBM Streams
18 versions - Latest release: over 3 years ago - 167 downloads last month - 9 stars on GitHub - 4 maintainers
importable 0.2.2
Allows to import zip-compressed Python package by URL (http, hdfs).
4 versions - Latest release: over 6 years ago - 1 dependent repositories - 47 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
hdfs3 0.3.1 💰
Python wrappers for libhdfs3, a native HDFS client
7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 10.5 thousand downloads last month - 136 stars on GitHub - 3 maintainers
hcompressor 1.0.0 removed
Hcompressor is a tool to compress files in HDFS
1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer