Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "hdfs" keyword
hdfs-native 0.9.1
Python bindings for hdfs-native Rust library13 versions - Latest release: 2 months ago - 561 downloads last month - 21 stars on GitHub - 1 maintainer
cyhdfs 0.1.3
Cython wrapper around libhdfs2 versions - Latest release: over 11 years ago - 2 dependent repositories - 11 downloads last month - 1 maintainer
pydistcp 1.0.7
pydistcp: python WebHDFS inter/intra-cluster data copy tool.6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 11 downloads last month - 10 stars on GitHub - 1 maintainer
sqoopit 0.0.12
A simple package to let you Sqoop into HDFS/Hive/HBase with python1 version - Latest release: about 4 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.1% on pypi.org
61 versions - Latest release: 3 months ago - 248 dependent packages - 10,157 dependent repositories - 24.7 million downloads last month - 3,101 stars on GitHub - 2 maintainers
smart-open 7.0.4 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)61 versions - Latest release: 3 months ago - 248 dependent packages - 10,157 dependent repositories - 24.7 million downloads last month - 3,101 stars on GitHub - 2 maintainers
pymongo_hadoop 1.1.0
UNKNOWN2 versions - Latest release: about 11 years ago - 3 dependent repositories - 21 downloads last month - 1,521 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
snakebite 2.11.0
Pure Python HDFS client63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
py4hdfs 0.0.1
Fast Queries to HDFS1 version - Latest release: about 8 years ago - 2 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
spark-hdfs-tools 0.1.1
spark_hdfs_tools1 version - Latest release: 11 months ago - 10 downloads last month - 1 maintainer
dfspy 0.1.0
Distributed File System written in Python1 version - Latest release: almost 2 years ago - 15 downloads last month - 14 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
81 versions - Latest release: 8 months ago - 36 dependent packages - 952 dependent repositories - 3.08 million downloads last month - 267 stars on GitHub - 1 maintainer
hdfs 2.7.3
HdfsCLI: API and command line interface for HDFS.81 versions - Latest release: 8 months ago - 36 dependent packages - 952 dependent repositories - 3.08 million downloads last month - 267 stars on GitHub - 1 maintainer
alphareader 0.0.7
A reader for large files with custom delimiters and encodings7 versions - Latest release: about 4 years ago - 1 dependent repositories - 46 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
79 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 5.35 thousand downloads last month - 110 stars on GitHub - 2 maintainers
megfile 3.0.3
Megvii file operation library79 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 5.35 thousand downloads last month - 110 stars on GitHub - 2 maintainers
hadeploy 0.6.1
An Hadoop Application deployment tool12 versions - Latest release: over 5 years ago - 1 dependent repositories - 25 downloads last month - 10 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
52 versions - Latest release: 7 months ago - 29 dependent packages - 251 dependent repositories - 746 thousand downloads last month - 724 stars on GitHub - 13 maintainers
impyla 0.19.0
Python client for the Impala distributed query engine52 versions - Latest release: 7 months ago - 29 dependent packages - 251 dependent repositories - 746 thousand downloads last month - 724 stars on GitHub - 13 maintainers
redis2hdfs 0.0.2
Export Redis data to HDFS2 versions - Latest release: over 9 years ago - 2 dependent repositories - 12 downloads last month - 2 stars on GitHub - 1 maintainer
dvc-hdfs 3.0.0
hdfs plugin for dvc3 versions - Latest release: 6 months ago - 5 dependent packages - 3 dependent repositories - 10.2 thousand downloads last month - 0 stars on GitHub - 2 maintainers
starlake-airflow 0.1.2
Starlake Python Distribution For Airflow24 versions - Latest release: 4 months ago - 1 dependent package - 128 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-dagster 0.1.2
Starlake Python Distribution For Dagster6 versions - Latest release: 4 months ago - 1 dependent package - 85 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-orchestration 0.1.2
Starlake Python Distribution For orchestration7 versions - Latest release: 4 months ago - 2 dependent packages - 121 downloads last month - 31 stars on GitHub - 1 maintainer
pfio 2.8.0
PFN IO library31 versions - Latest release: about 2 months ago - 1 dependent repositories - 66.8 thousand downloads last month - 52 stars on GitHub - 3 maintainers
webhdfspy 0.3.5
A wrapper library to access Hadoop HTTP REST API8 versions - Latest release: almost 8 years ago - 2 dependent repositories - 95 downloads last month - 8 stars on GitHub - 1 maintainer
chainerio 0.1.2
Chainer IO library6 versions - Latest release: about 4 years ago - 54 downloads last month - 52 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
44 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 1.27 thousand downloads last month - 43 stars on GitHub - 7 maintainers
cluster-pack 0.3.7
A library on top of either pex or conda-packto make your Python code easily available on a cluster44 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 1.27 thousand downloads last month - 43 stars on GitHub - 7 maintainers
schemaindex 0.2451
An indexing engine for different types of data sources, including HDFS, Mysql, etc.4 versions - Latest release: over 6 years ago - 1 dependent repositories - 12 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
7 versions - Latest release: over 4 years ago - 2 dependent packages - 11 dependent repositories - 7.38 thousand downloads last month - 88 stars on GitHub - 1 maintainer
pyhdfs 0.3.1
Pure Python HDFS client7 versions - Latest release: over 4 years ago - 2 dependent packages - 11 dependent repositories - 7.38 thousand downloads last month - 88 stars on GitHub - 1 maintainer
kraken-pyds 2.0.1
kraken: python distributed data transfer tool.3 versions - Latest release: over 3 years ago - 1 dependent repositories - 16 downloads last month - 2 stars on GitHub - 1 maintainer
tanit 2.0.1
tanit: python distributed data transfer tool.1 version - Latest release: over 3 years ago - 1 dependent repositories - 14 downloads last month - 2 stars on GitHub - 1 maintainer
braingeneers-smart-open 2023.10.6 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)1 version - Latest release: 8 months ago - 1 dependent package - 1.46 thousand downloads last month - 0 stars on GitHub - 2 maintainers
aioseaweedfs 0.3.1 💰
async client for seaweedfs4 versions - Latest release: 4 months ago - 25 downloads last month - 21,348 stars on GitHub - 1 maintainer
tinyhdfs 1.1.4
Tiny client for HDFS, base on WebHDFS1 version - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 39.5 thousand downloads last month - 140 stars on GitHub - 1 maintainer
skein 0.8.2
A simple tool and library for deploying applications on Apache YARN23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 39.5 thousand downloads last month - 140 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
71 versions - Latest release: 7 months ago - 5 dependent packages - 12 dependent repositories - 3.09 thousand downloads last month - 250 stars on GitHub - 2 maintainers
wradlib 2.0.3
wradlib - An Open Source Library for Weather Radar Data Processing71 versions - Latest release: 7 months ago - 5 dependent packages - 12 dependent repositories - 3.09 thousand downloads last month - 250 stars on GitHub - 2 maintainers
ym-impyla 0.14.0
Python client for the Impala distributed query engine1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
edmunds_hdfs_load 4.0.1
Moves files to hdfs by creating hive tables29 versions - Latest release: almost 10 years ago - 2 dependent repositories - 56 downloads last month - 1 maintainer
srcd_smart_open 1.9.0 💰
Utils for streaming large files (S3, HDFS, gzip, bz2...) - temporary source{d} fork1 version - Latest release: over 5 years ago - 39 downloads last month - 3,100 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
3 versions - Latest release: over 1 year ago
tethys-smart-open 6.2.0.dev2 removed
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)3 versions - Latest release: over 1 year ago
sparkdh 0.0.1
1 version - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainersmart-pathlib 0.0.1
Utils for making os standard path operationsinteroperable with Cloud blob storages1 version - Latest release: over 3 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
impyla-jz 0.16.3
Python client for the Impala distributed query engine1 version - Latest release: almost 3 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
spark-yarn-submit 1.0.0
library to handle spark job submit in a yarn cluster in different environment1 version - Latest release: over 7 years ago - 1 dependent repositories - 12 downloads last month - 3 stars on GitHub - 1 maintainer
jupyter-omnicm 0.0.5
jupyter-omnicm is a flexible content manager system for Jupyter notebooks.2 versions - Latest release: over 5 years ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
test-cephadm 0.2
cephadm2 versions - Latest release: over 4 years ago - 1 dependent repositories - 27 downloads last month - 13,175 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
137 versions - Latest release: about 1 month ago - 15 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
tiledb 0.29.0
Pythonic interface to the TileDB array storage manager137 versions - Latest release: about 1 month ago - 15 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.8 versions - Latest release: 7 months ago - 53 downloads last month - 3,095 stars on GitHub - 1 maintainer
hadoop-mapreduce 0.5
Implementation of Hadoop Mapreduce on text files4 versions - Latest release: over 1 year ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
trustedanalytics 0.7.3.post20161020785
Trusted Analytics Toolkit161 versions - Latest release: over 7 years ago - 2 dependent repositories - 791 downloads last month - 43 stars on GitHub - 1 maintainer
streamsx.hdfs 1.5.9
HDFS integration for IBM Streams18 versions - Latest release: over 3 years ago - 167 downloads last month - 9 stars on GitHub - 4 maintainers
importable 0.2.2
Allows to import zip-compressed Python package by URL (http, hdfs).4 versions - Latest release: over 6 years ago - 1 dependent repositories - 47 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 10.5 thousand downloads last month - 136 stars on GitHub - 3 maintainers
hdfs3 0.3.1 💰
Python wrappers for libhdfs3, a native HDFS client7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 10.5 thousand downloads last month - 136 stars on GitHub - 3 maintainers
hcompressor 1.0.0 removed
Hcompressor is a tool to compress files in HDFS1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
hadoop
24
python
19
s3
11
spark
10
webhdfs
8
streaming
6
gcs
6
hive
6
file
5
distributed
5
filesystem
4
cloudera
4
hacktoberfest
4
bigdata
3
synapse
3
impala
3
sql
3
mpp
3
pydata
3
snowflake
3
pandas
3
bigquery
3
etl
3
db
3
api
3
scala
3
redshift
3
pep
3
249
3
hiveserver2
3
hs2
3
boto
3
bz2
3
gzip-stream
3
streaming-data
3
azure blob storage
3
file streaming
2
storage
2
disaster-recovery
2
data-transfer
2
data-migration
2
data-life-cycle
2
data-backup
2
aws-s3
2
hbase
2
thrift
2
chainer
2
development
2
hadoop-filesystem
2
yarn
2
cluster
2
HDFS
2
pyspark
2
data
2
distributed-systems
2
mapreduce
2
distributed-file-system
2
distributed-storage
2
deployment
2
erasure-coding
2
fuse
2
replication
2
kubernetes
2
posix
2
rainbow
1
radar
1
netcdf4
1
submit
1
apache-yarn
1
cdh
1
hdp
1
sigmet
1
azure-storage
1
data-science
1
dataframe
1
weather
1
wradlib
1
blob
1
xarray
1
azure
1
python-library
1
toolkit
1
stream-processing
1
java
1
ibm-streams
1
nfs
1
nvme-over-fabrics
1
object-store
1
smb
1
software-defined-storage
1
array
1
numpy
1
storage-manager
1
tiledb
1
analytics
1
streams
1
big
1
spark-clusters
1
spark-job
1
jupyter
1