pypi.org "cloudera" keyword
View the packages on the pypi.org package registry that are tagged with the "cloudera" keyword.
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 3 years ago - 55 downloads last month - 11 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
59 versions - Latest release: about 1 month ago - 29 dependent packages - 251 dependent repositories - 6.52 million downloads last month - 738 stars on GitHub - 13 maintainers
impyla 0.21.0
Python client for the Impala distributed query engine59 versions - Latest release: about 1 month ago - 29 dependent packages - 251 dependent repositories - 6.52 million downloads last month - 738 stars on GitHub - 13 maintainers
nifi-rest 0.0.3
Scripts for the NIFI API4 versions - Latest release: about 4 years ago - 1 dependent repositories - 141 downloads last month - 1 maintainer
wunderkafka 0.18.0
librdkafka-powered client for Kafka for python with (hopefully) more handful API31 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 1.16 thousand downloads last month - 3 stars on GitHub - 1 maintainer
nifi-api 0.0.6
Scripts for the NIFI API3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 214 downloads last month - 1 maintainer
trustedanalytics 0.7.3.post20161020785
Trusted Analytics Toolkit161 versions - Latest release: over 8 years ago - 2 dependent repositories - 720 downloads last month - 43 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 95 downloads last month - 11 stars on GitHub - 1 maintainer
impyla-jz 0.16.3
Python client for the Impala distributed query engine1 version - Latest release: almost 4 years ago - 1 dependent repositories - 46 downloads last month - 0 stars on GitHub - 1 maintainer
ym-impyla 0.14.0
Python client for the Impala distributed query engine1 version - Latest release: over 8 years ago - 1 dependent repositories - 38 downloads last month - 1 stars on GitHub - 1 maintainer
spooq 3.4.2
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes.13 versions - Latest release: 9 months ago - 1 dependent repositories - 26.2 thousand downloads last month - 8 stars on GitHub - 1 maintainer
cloudera-assist 0.1.0
A CLI tool for managing credentials and executing playbooks for Cloudera demonstrations, workshop...2 versions - Latest release: 3 months ago - 97 downloads last month - 1 maintainer
Related Keywords
spark
7
hadoop
7
python
6
hive
6
hdfs
4
impala
3
sql
3
mpp
3
pydata
3
pandas
3
distributed
3
data
3
hs2
3
db
3
api
3
pep
3
249
3
hiveserver2
3
data-engineering
3
nifi
2
requests
2
bigdata
2
chile
2
data-engineer
2
data-governance
2
data-warehouse
2
datamart
2
dataquality
2
gdpr
2
hortonworks
2
huemul
2
huemul-bigdatagovernance
2
parquet
2
spark-sql
2
trabaja-sobre-spark
2
databricks
1
big data
1
batch
1
streaming
1
data engineering
1
big-data
1
etl-pipeline
1
extract
1
load
1
transform
1
ansible-navigator
1
cdp
1
data wrangling
1
data ingestion
1
etl
1
spooq
1
yarn
1
big
1
analytics
1
librdkafka
1
kafka-protocol
1
kafka-client
1
confluent
1
kafka
1