Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "hdfs" keyword
hadoop-mapreduce 0.5
Implementation of Hadoop Mapreduce on text files4 versions - Latest release: over 1 year ago - 23 downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.8% on pypi.org
77 versions - Latest release: about 6 hours ago - 2 dependent packages - 5 dependent repositories - 6.69 thousand downloads last month - 105 stars on GitHub - 2 maintainers
megfile 3.0.3
Megvii file operation library77 versions - Latest release: about 6 hours ago - 2 dependent packages - 5 dependent repositories - 6.69 thousand downloads last month - 105 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
52 versions - Latest release: 6 months ago - 22 dependent packages - 251 dependent repositories - 612 thousand downloads last month - 723 stars on GitHub - 13 maintainers
impyla 0.19.0
Python client for the Impala distributed query engine52 versions - Latest release: 6 months ago - 22 dependent packages - 251 dependent repositories - 612 thousand downloads last month - 723 stars on GitHub - 13 maintainers
pydistcp 1.0.7
pydistcp: python WebHDFS inter/intra-cluster data copy tool.6 versions - Latest release: over 3 years ago - 1 dependent repositories - 21 downloads last month - 10 stars on GitHub - 2 maintainers
cyhdfs 0.1.3
Cython wrapper around libhdfs2 versions - Latest release: over 11 years ago - 2 dependent repositories - 13 downloads last month - 2 maintainers
srcd_smart_open 1.9.0 💰
Utils for streaming large files (S3, HDFS, gzip, bz2...) - temporary source{d} fork1 version - Latest release: about 5 years ago - 54 downloads last month - 3,095 stars on GitHub - 2 maintainers
Top 1.1% on pypi.org
61 versions - Latest release: about 1 month ago - 216 dependent packages - 10,157 dependent repositories - 24.5 million downloads last month - 3,095 stars on GitHub - 2 maintainers
smart-open 7.0.4 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)61 versions - Latest release: about 1 month ago - 216 dependent packages - 10,157 dependent repositories - 24.5 million downloads last month - 3,095 stars on GitHub - 2 maintainers
Top 5.6% on pypi.org
3 versions - Latest release: over 1 year ago
tethys-smart-open 6.2.0.dev2 removed
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)3 versions - Latest release: over 1 year ago
sqoopit 0.0.12
A simple package to let you Sqoop into HDFS/Hive/HBase with python1 version - Latest release: about 4 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 2 maintainers
aioseaweedfs 0.3.1 💰
async client for seaweedfs4 versions - Latest release: 3 months ago - 41 downloads last month - 21,149 stars on GitHub - 2 maintainers
pymongo_hadoop 1.1.0
UNKNOWN2 versions - Latest release: about 11 years ago - 3 dependent repositories - 25 downloads last month - 1,522 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
snakebite 2.11.0
Pure Python HDFS client63 versions - Latest release: almost 8 years ago - 1 dependent package - 88 dependent repositories - 47.5 thousand downloads last month - 858 stars on GitHub - 1 maintainer
py4hdfs 0.0.1
Fast Queries to HDFS1 version - Latest release: about 8 years ago - 2 dependent repositories - 9 downloads last month - 1 stars on GitHub - 2 maintainers
dvc-hdfs 3.0.0
hdfs plugin for dvc3 versions - Latest release: 5 months ago - 4 dependent packages - 3 dependent repositories - 10.5 thousand downloads last month - 0 stars on GitHub - 4 maintainers
dfspy 0.1.0
Distributed File System written in Python1 version - Latest release: almost 2 years ago - 19 downloads last month - 14 stars on GitHub - 2 maintainers
hdfs-native 0.9.1
Python bindings for hdfs-native Rust library10 versions - Latest release: about 1 month ago - 313 downloads last month - 18 stars on GitHub - 2 maintainers
spark-hdfs-tools 0.1.1
spark_hdfs_tools1 version - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
alphareader 0.0.7
A reader for large files with custom delimiters and encodings7 versions - Latest release: about 4 years ago - 1 dependent repositories - 27 downloads last month - 5 stars on GitHub - 1 maintainer
hadeploy 0.6.1
An Hadoop Application deployment tool12 versions - Latest release: over 5 years ago - 1 dependent repositories - 112 downloads last month - 10 stars on GitHub - 2 maintainers
redis2hdfs 0.0.2
Export Redis data to HDFS2 versions - Latest release: over 9 years ago - 2 dependent repositories - 12 downloads last month - 2 stars on GitHub - 2 maintainers
Top 0.4% on pypi.org
81 versions - Latest release: 7 months ago - 31 dependent packages - 952 dependent repositories - 2.96 million downloads last month - 267 stars on GitHub - 1 maintainer
hdfs 2.7.3
HdfsCLI: API and command line interface for HDFS.81 versions - Latest release: 7 months ago - 31 dependent packages - 952 dependent repositories - 2.96 million downloads last month - 267 stars on GitHub - 1 maintainer
pfio 2.8.0
PFN IO library31 versions - Latest release: 23 days ago - 1 dependent repositories - 58 thousand downloads last month - 52 stars on GitHub - 4 maintainers
webhdfspy 0.3.5
A wrapper library to access Hadoop HTTP REST API8 versions - Latest release: over 7 years ago - 2 dependent repositories - 95 downloads last month - 8 stars on GitHub - 2 maintainers
chainerio 0.1.2
Chainer IO library6 versions - Latest release: about 4 years ago - 44 downloads last month - 52 stars on GitHub - 4 maintainers
Top 7.7% on pypi.org
44 versions - Latest release: 10 days ago - 2 dependent packages - 5 dependent repositories - 683 downloads last month - 43 stars on GitHub - 14 maintainers
cluster-pack 0.3.7
A library on top of either pex or conda-packto make your Python code easily available on a cluster44 versions - Latest release: 10 days ago - 2 dependent packages - 5 dependent repositories - 683 downloads last month - 43 stars on GitHub - 14 maintainers
kraken-pyds 2.0.1
kraken: python distributed data transfer tool.3 versions - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
7 versions - Latest release: over 4 years ago - 11 dependent repositories - 7.36 thousand downloads last month - 88 stars on GitHub - 2 maintainers
pyhdfs 0.3.1
Pure Python HDFS client7 versions - Latest release: over 4 years ago - 11 dependent repositories - 7.36 thousand downloads last month - 88 stars on GitHub - 2 maintainers
tinyhdfs 1.1.4
Tiny client for HDFS, base on WebHDFS1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 2 stars on GitHub - 2 maintainers
tanit 2.0.1
tanit: python distributed data transfer tool.1 version - Latest release: over 3 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 2 maintainers
schemaindex 0.2451
An indexing engine for different types of data sources, including HDFS, Mysql, etc.4 versions - Latest release: about 6 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
braingeneers-smart-open 2023.10.6 💰
Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)1 version - Latest release: 7 months ago - 1 dependent package - 258 downloads last month - 0 stars on GitHub - 3 maintainers
Top 4.4% on pypi.org
23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 25.2 thousand downloads last month - 138 stars on GitHub - 2 maintainers
skein 0.8.2
A simple tool and library for deploying applications on Apache YARN23 versions - Latest release: about 2 years ago - 2 dependent packages - 12 dependent repositories - 25.2 thousand downloads last month - 138 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
136 versions - Latest release: 24 days ago - 13 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
tiledb 0.28.0
Pythonic interface to the TileDB array storage manager136 versions - Latest release: 24 days ago - 13 dependent packages - 136 dependent repositories - 36.3 thousand downloads last month - 178 stars on GitHub - 5 maintainers
Top 3.6% on pypi.org
7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 11.2 thousand downloads last month - 136 stars on GitHub - 6 maintainers
hdfs3 0.3.1 💰
Python wrappers for libhdfs3, a native HDFS client7 versions - Latest release: almost 5 years ago - 3 dependent packages - 30 dependent repositories - 11.2 thousand downloads last month - 136 stars on GitHub - 6 maintainers
Top 3.2% on pypi.org
71 versions - Latest release: 6 months ago - 3 dependent packages - 12 dependent repositories - 2.67 thousand downloads last month - 247 stars on GitHub - 2 maintainers
wradlib 2.0.3
wradlib - An Open Source Library for Weather Radar Data Processing71 versions - Latest release: 6 months ago - 3 dependent packages - 12 dependent repositories - 2.67 thousand downloads last month - 247 stars on GitHub - 2 maintainers
ym-impyla 0.14.0
Python client for the Impala distributed query engine1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 2 maintainers
edmunds_hdfs_load 4.0.1
Moves files to hdfs by creating hive tables29 versions - Latest release: almost 10 years ago - 2 dependent repositories - 85 downloads last month - 2 maintainers
sparkdh 0.0.1
1 version - Latest release: about 2 years ago - 1 dependent repositories - 6 downloads last month - 0 stars on GitHub - 2 maintainerssmart-pathlib 0.0.1
Utils for making os standard path operationsinteroperable with Cloud blob storages1 version - Latest release: over 3 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 2 maintainers
impyla-jz 0.16.3
Python client for the Impala distributed query engine1 version - Latest release: almost 3 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
spark-yarn-submit 1.0.0
library to handle spark job submit in a yarn cluster in different environment1 version - Latest release: about 7 years ago - 1 dependent repositories - 3 downloads last month - 3 stars on GitHub - 2 maintainers
jupyter-omnicm 0.0.5
jupyter-omnicm is a flexible content manager system for Jupyter notebooks.2 versions - Latest release: about 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
test-cephadm 0.2
cephadm2 versions - Latest release: over 4 years ago - 1 dependent repositories - 3 downloads last month - 13,175 stars on GitHub - 1 maintainer
starlake-dagster 0.1.2
Starlake Python Distribution For Dagster5 versions - Latest release: 3 months ago - 11 downloads last month - 31 stars on GitHub - 2 maintainers
starlake-airflow 0.1.2
Starlake Python Distribution For Airflow23 versions - Latest release: 3 months ago - 74 downloads last month - 31 stars on GitHub - 1 maintainer
starlake-orchestration 0.1.2
Starlake Python Distribution For orchestration6 versions - Latest release: 3 months ago - 23 downloads last month - 31 stars on GitHub - 2 maintainers
importable 0.2.2
Allows to import zip-compressed Python package by URL (http, hdfs).4 versions - Latest release: over 6 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 2 maintainers
trustedanalytics 0.7.3.post20161020785
Trusted Analytics Toolkit161 versions - Latest release: over 7 years ago - 2 dependent repositories - 279 downloads last month - 43 stars on GitHub - 2 maintainers
streamsx.hdfs 1.5.9
HDFS integration for IBM Streams18 versions - Latest release: over 3 years ago - 167 downloads last month - 9 stars on GitHub - 8 maintainers
elx 0.2.0 💰
A lightweight Python interface for extracting and loading using the Singer.io spec.8 versions - Latest release: 6 months ago - 2 downloads last month - 3,082 stars on GitHub - 2 maintainers
hcompressor 1.0.0 removed
Hcompressor is a tool to compress files in HDFS1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
hadoop
24
python
19
s3
11
spark
10
webhdfs
8
hive
6
streaming
6
gcs
6
file
5
distributed
5
hacktoberfest
4
cloudera
4
filesystem
4
streaming-data
3
gzip-stream
3
bz2
3
azure blob storage
3
boto
3
bigquery
3
etl
3
bigdata
3
redshift
3
synapse
3
snowflake
3
scala
3
impala
3
sql
3
mpp
3
pydata
3
pandas
3
db
3
api
3
pep
3
249
3
hiveserver2
3
hs2
3
HDFS
2
cluster
2
hadoop-filesystem
2
development
2
chainer
2
deployment
2
replication
2
posix
2
pyspark
2
thrift
2
aws-s3
2
data-backup
2
data-life-cycle
2
data-migration
2
data-transfer
2
disaster-recovery
2
data
2
yarn
2
mapreduce
2
storage
2
hbase
2
erasure-coding
2
fuse
2
file streaming
2
distributed-systems
2
distributed-storage
2
distributed-file-system
2
kubernetes
2
jupyter
1
spark-job
1
spark-clusters
1
python-library
1
block-storage
1
hdp
1
cloud-storage
1
high-performance
1
highly-available
1
cdh
1
submit
1
libhdfs
1
azure-storage
1
data-science
1
dataframe
1
xarray
1
wradlib
1
weather
1
sigmet
1
rainbow
1
radar
1
netcdf4
1
tiledb
1
decompression
1
compression
1
toolkit
1
stream-processing
1
java
1
ibm-streams
1
streams
1
big
1
analytics
1
python_path
1
importable
1
zip
1
http
1