Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "databricks" keyword

Top 0.9% on pypi.org
apache-airflow-providers-databricks 6.4.0
Provider package apache-airflow-providers-databricks for Apache Airflow
90 versions - Latest release: 15 days ago - 6 dependent packages - 23 dependent repositories - 7.13 million downloads last month - 34,343 stars on GitHub - 4 maintainers
mlflow-saagie 2.9.2
MLflow: A Platform for ML Development and Productionization - forked for Saagie
8 versions - Latest release: 4 months ago - 1 dependent repositories - 40 downloads last month - 17,434 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
mlflow 2.12.2
MLflow is an open source platform for the complete machine learning lifecycle
98 versions - Latest release: 12 days ago - 360 dependent packages - 5,089 dependent repositories - 14.3 million downloads last month - 17,434 stars on GitHub - 13 maintainers
mlflow-devlibx 1.22.8
MLflow: A Platform for ML Development and Productionization
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 78 downloads last month - 17,434 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
mlflow-skinny 2.12.2
MLflow is an open source platform for the complete machine learning lifecycle
59 versions - Latest release: 12 days ago - 49 dependent packages - 70 dependent repositories - 4.61 million downloads last month - 17,434 stars on GitHub - 8 maintainers
mlflow-stonewise 1.30.1
MLflow: A Platform for ML Development and Productionization
1 version - Latest release: over 1 year ago - 20 downloads last month - 17,434 stars on GitHub - 1 maintainer
mlflow-tmp 2.2.26
MLflow: A Platform for ML Development and Productionization
25 versions - Latest release: 12 months ago - 84 downloads last month - 17,420 stars on GitHub - 1 maintainer
mlflow-by-johnsnowlabs-v2 2.44.0
MLflow: A Platform for ML Development and Productionization
9 versions - Latest release: 8 months ago - 34 downloads last month - 17,420 stars on GitHub - 1 maintainer
lmcmlflow 1.17.1
MLflow: A Platform for ML Development and Productionization
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 17,420 stars on GitHub - 1 maintainer
mlflow-by-ckl 2.67.0
MLflow: A Platform for ML Development and Productionization
32 versions - Latest release: about 1 month ago - 123 downloads last month - 16,113 stars on GitHub - 1 maintainer
mlflow-by-johnsnowlabs 2.40.0
MLflow: A Platform for ML Development and Productionization
35 versions - Latest release: 7 months ago - 205 downloads last month - 16,113 stars on GitHub - 1 maintainer
mlflow-ste 1.10.1.dev0
MLflow: An ML Workflow Tool
1 version - Latest release: over 3 years ago - 1 dependent repositories - 19 downloads last month - 16,107 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
sqlfluff 3.0.6 💰
The SQL Linter for Humans
129 versions - Latest release: 15 days ago - 30 dependent packages - 122 dependent repositories - 1.91 million downloads last month - 7,226 stars on GitHub - 2 maintainers
Top 1.9% on pypi.org
sqlfluff-templater-dbt 3.0.6 💰
Lint your dbt project SQL
77 versions - Latest release: 15 days ago - 2 dependent packages - 51 dependent repositories - 859 thousand downloads last month - 7,226 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
sqlglot 23.13.1
An easily customizable SQL parser and transpiler
521 versions - Latest release: 17 days ago - 104 dependent packages - 272 dependent repositories - 2.86 million downloads last month - 5,389 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
synapseml 1.0.4
Synapse Machine Learning
16 versions - Latest release: about 1 month ago - 2 dependent packages - 3 dependent repositories - 230 thousand downloads last month - 4,985 stars on GitHub - 1 maintainer
dcborow-mmlspark 0.14.dev1
Microsoft ML for Spark
1 version - Latest release: about 4 years ago - 1 dependent repositories - 58 downloads last month - 4,972 stars on GitHub - 1 maintainer
nozberkman-mmlspark 1.0.0
Microsoft ML for Spark
1 version - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 4,472 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
acryl-sqlglot 23.11.2.dev2
An easily customizable SQL parser and transpiler
19 versions - Latest release: about 1 month ago - 2 dependent packages - 2 dependent repositories - 238 thousand downloads last month - 4,253 stars on GitHub - 2 maintainers
cz-sqlglot 0.0.1
An easily customizable SQL parser and transpiler
1 version - Latest release: 8 months ago - 26 downloads last month - 4,253 stars on GitHub - 1 maintainer
sqlglot-doris 1.1.9
An easily customizable SQL parser and transpiler
42 versions - Latest release: 11 days ago - 513 downloads last month - 4,253 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
deltalake 0.17.4
Native Delta Lake Python binding based on delta-rs with Pandas integration
56 versions - Latest release: 10 days ago - 47 dependent packages - 235 dependent repositories - 2.89 million downloads last month - 1,844 stars on GitHub - 4 maintainers
Top 2.2% on pypi.org
dbx 0.8.18
DataBricks CLI eXtensions aka dbx
62 versions - Latest release: 10 months ago - 3 dependent packages - 55 dependent repositories - 358 thousand downloads last month - 420 stars on GitHub - 2 maintainers
databricks-cli-uc 0.16.3.13
A command line interface for Databricks with Unity Catalog extensions
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 76 downloads last month - 375 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
databricks-cli 0.18.0
A command line interface for Databricks
53 versions - Latest release: 8 months ago - 56 dependent packages - 1,640 dependent repositories - 15.2 million downloads last month - 373 stars on GitHub - 13 maintainers
Top 5.7% on pypi.org
dbldatagen 0.3.6
Databricks Labs - PySpark Synthetic Data Generator
13 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 173 thousand downloads last month - 268 stars on GitHub - 1 maintainer
sparkdantic 0.20.5
A pydantic -> spark schema library
30 versions - Latest release: 2 months ago - 32.9 thousand downloads last month - 268 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
nutter 0.1.35
A databricks notebook testing library
4 versions - Latest release: over 1 year ago - 11 dependent repositories - 420 thousand downloads last month - 260 stars on GitHub - 1 maintainer
databricks-sdk-secure 0.19.0
Databricks SDK for Python (Beta)
1 version - Latest release: 3 months ago - 34 downloads last month - 258 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
databricks-sdk 0.27.0
Databricks SDK for Python (Beta)
46 versions - Latest release: 18 days ago - 49 dependent packages - 93 dependent repositories - 5.44 million downloads last month - 258 stars on GitHub - 3 maintainers
atoti-directquery-databricks 0.8.12
Plugin to use DirectQuery on Databricks
11 versions - Latest release: 18 days ago - 1 dependent package - 90 downloads last month - 214 stars on GitHub - 2 maintainers
mlflow-sagemaker 1.5.18
MLflow: An ML Workflow Tool (Forked for Sagemaker)
25 versions - Latest release: about 2 years ago - 1 dependent repositories - 57 downloads last month - 210 stars on GitHub - 3 maintainers
Top 3.3% on pypi.org
dbt-databricks 1.7.15
The Databricks adapter plugin for dbt
110 versions - Latest release: 5 days ago - 10 dependent packages - 40 dependent repositories - 934 thousand downloads last month - 179 stars on GitHub - 4 maintainers
databricks-labs-ucx 0.23.1
UCX - Unity Catalog Migration Toolkit
8 versions - Latest release: 11 days ago - 238 downloads last month - 152 stars on GitHub - 2 maintainers
variant-spark 0.5.2
VariantSpark Python API
36 versions - Latest release: over 1 year ago - 1 dependent repositories - 263 downloads last month - 138 stars on GitHub - 3 maintainers
Top 2.7% on pypi.org
databricks-sql-connector 3.1.2
Databricks SQL Connector for Python
62 versions - Latest release: about 1 month ago - 67 dependent packages - 86 dependent repositories - 10.7 million downloads last month - 126 stars on GitHub - 4 maintainers
databricks-sqlalchemy 0.0.1b1
SQLAlchemy dialect for Databricks
1 version - Latest release: 4 months ago - 31 downloads last month - 126 stars on GitHub - 1 maintainer
databrickslabs-jupyterlab-status 2.2.1
A JupyterLab extension to show starting status of Databricks clusters.
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 20 downloads last month - 71 stars on GitHub - 1 maintainer
databrickslabs-jupyterlab 2.2.1
Remote JupyterLab kernel for Databricks
37 versions - Latest release: almost 3 years ago - 1 dependent repositories - 513 downloads last month - 71 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
databricks-api 0.9.0
Databricks API client auto-generated from the official databricks-cli package
10 versions - Latest release: 12 months ago - 18 dependent packages - 37 dependent repositories - 2.43 million downloads last month - 60 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
johnsnowlabs 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
141 versions - Latest release: 9 days ago - 1 dependent package - 3 dependent repositories - 28.1 thousand downloads last month - 45 stars on GitHub - 2 maintainers
blackbricks 2.1.3
Black for Databricks notebooks
37 versions - Latest release: 10 months ago - 1 dependent repositories - 5.6 thousand downloads last month - 43 stars on GitHub - 1 maintainer
pyjaws 0.1.7
PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows
10 versions - Latest release: 8 months ago - 1 dependent repositories - 60 downloads last month - 37 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
diviner 0.1.1
Diviner: A Grouped Forecasting API
2 versions - Latest release: over 1 year ago - 91 dependent repositories - 11.7 thousand downloads last month - 33 stars on GitHub - 1 maintainer
dlt-with-debug 2.2
Utility for running workflows leveraging delta live tables from interactive notebooks
4 versions - Latest release: over 1 year ago - 27.3 thousand downloads last month - 29 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
databricks-dbapi 0.6.0
A DBAPI 2.0 interface and SQLAlchemy dialect for Databricks interactive clusters.
7 versions - Latest release: over 2 years ago - 2 dependent packages - 64 dependent repositories - 54 thousand downloads last month - 22 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
sqlalchemy-databricks 0.2.0
SQLAlchemy Dialect for Databricks
2 versions - Latest release: about 2 years ago - 11 dependent packages - 81 dependent repositories - 237 thousand downloads last month - 21 stars on GitHub - 1 maintainer
astro-provider-databricks 0.2.2
Affordable Databricks Workflows in Apache Airflow
12 versions - Latest release: about 1 month ago - 1 dependent repositories - 53.2 thousand downloads last month - 20 stars on GitHub - 6 maintainers
spetlr 5.1.6
A python ETL libRary (SPETLR) for Databricks powered by Apache SPark.
52 versions - Latest release: 6 days ago - 1 dependent package - 25.1 thousand downloads last month - 18 stars on GitHub - 1 maintainer
freeza-offset 1.0.10
Spark stream consumption commit in kafka consumer group
10 versions - Latest release: almost 4 years ago - 1 dependent repositories - 947 downloads last month - 14 stars on GitHub - 1 maintainer
catalog-builder 0.4
Data Catalogs Made Easy
4 versions - Latest release: 18 days ago - 34 downloads last month - 13 stars on GitHub - 1 maintainer
databricks-rocket 2.1.0
Keep your local python scripts installed and in sync with a databricks notebook. Shortens the fee...
48 versions - Latest release: 4 months ago - 1 dependent repositories - 1.46 thousand downloads last month - 13 stars on GitHub - 4 maintainers
azure-databricks-sdk-python 0.0.2
A Python SDK for the Azure Databricks REST API 2.0.
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.61 thousand downloads last month - 13 stars on GitHub - 1 maintainer
pulumi-databricks 1.40.0
A Pulumi package for creating and managing databricks cloud resources.
250 versions - Latest release: 6 days ago - 1 dependent package - 2 dependent repositories - 17.2 thousand downloads last month - 12 stars on GitHub - 2 maintainers
databricks-labs-remorph 0.1.7
SQL code converter and data reconcilation tool for accelerating data onboarding to Databricks fro...
9 versions - Latest release: 13 days ago - 184 downloads last month - 11 stars on GitHub - 2 maintainers
great-assertions 0.0.75
Lightweight assertions inspired by the great-expectations library
72 versions - Latest release: over 2 years ago - 1 dependent repositories - 469 downloads last month - 10 stars on GitHub - 2 maintainers
databricks-labs-blueprint 0.6.0
Common libraries for Databricks Labs
20 versions - Latest release: 9 days ago - 3 dependent packages - 26.3 thousand downloads last month - 10 stars on GitHub - 1 maintainer
dbl-waterbear 0.1.1
Automated provisioning of an industry Lakehouse with enterprise data model
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 12 downloads last month - 9 stars on GitHub - 1 maintainer
spooq 3.4.0
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes.
11 versions - Latest release: 2 months ago - 1 dependent repositories - 19.2 thousand downloads last month - 8 stars on GitHub - 1 maintainer
atc-dataplatform 1.1.69
A common set of python libraries for DataBricks
90 versions - Latest release: about 1 year ago - 2 dependent packages - 1 dependent repositories - 31.7 thousand downloads last month - 8 stars on GitHub - 1 maintainer
dbq 0.9.0
Run a query on Databricks
1 version - Latest release: over 4 years ago - 1 dependent repositories - 20 downloads last month - 7 stars on GitHub - 1 maintainer
databricks-cdk 0.3.5
Deploying databricks resources from cdk
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 32 downloads last month - 7 stars on GitHub - 1 maintainer
azure-databricks-api 0.6.2
A wrapper for the Azure Databricks REST API
5 versions - Latest release: almost 4 years ago - 100 thousand downloads last month - 7 stars on GitHub - 1 maintainer
eleflow-spark-integrations 0.0.1a2 removed
The easy and quickly way to connect and integrate the Spark project with many others data sources.
2 versions - Latest release: about 2 years ago - 6 stars on GitHub
clarifai-pyspark 0.0.4
Clarifai PySpark Python SDK
4 versions - Latest release: 4 months ago - 25 downloads last month - 6 stars on GitHub - 1 maintainer
lakehouse-engine 1.19.0
A Spark framework serving as the engine for several lakehouse algorithms and data flows.
9 versions - Latest release: 2 months ago - 254 downloads last month - 6 stars on GitHub - 1 maintainer
pyspark-connectors 0.2.0
The easy and quickly way to connect and integrate the Spark project with many others data sources.
8 versions - Latest release: almost 2 years ago - 113 downloads last month - 5 stars on GitHub - 1 maintainer
exelog 0.0.1
Enabling meticulous logging for Spark Applications
1 version - Latest release: over 2 years ago - 1 dependent repositories - 18 downloads last month - 5 stars on GitHub - 1 maintainer
dbutils-typehint 0.1.9
Provides type hints for dbutils in Data Bricks: https://docs.databricks.com/dev-tools/databricks-...
8 versions - Latest release: almost 4 years ago - 1 dependent repositories - 73.1 thousand downloads last month - 5 stars on GitHub - 1 maintainer
databricks-labs-pylint 0.4.0
Plugin for PyLint to support Databricks specific code patterns and best practices.
6 versions - Latest release: 25 days ago - 1.15 thousand downloads last month - 5 stars on GitHub - 2 maintainers
dtflw 0.6.7
dtflw is a Python framework for building modular data pipelines based on Databricks dbutils.noteb...
7 versions - Latest release: 7 months ago - 1.97 thousand downloads last month - 4 stars on GitHub - 2 maintainers
hermione-databricks 1.0.7
Tool to create ML project structure inside the databricks framework
18 versions - Latest release: over 3 years ago - 1 dependent repositories - 167 downloads last month - 4 stars on GitHub - 1 maintainer
fastdbfs 0.5
Interactive command line client for Databricks DBFS
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 27 downloads last month - 4 stars on GitHub - 1 maintainer
tuberia 0.0.1
Tuberia... when data engineering meets software engineering
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 23 downloads last month - 3 stars on GitHub - 1 maintainer
fish-databricks-jobs 0.7.18
cli and sdk to manage Jobs in Databricks
34 versions - Latest release: 3 months ago - 200 downloads last month - 3 stars on GitHub - 1 maintainer
databricks-aws-utils 1.5.1
Databricks AWS Utils
16 versions - Latest release: about 1 month ago - 1 dependent repositories - 284 downloads last month - 3 stars on GitHub - 1 maintainer
databricks-sdk-python 0.0.3 removed
Objet based databricks sdk
4 versions - Latest release: about 1 year ago - 46 downloads last month - 3 stars on GitHub - 1 maintainer
databricks-labs-lsql 0.4.3
Lightweight stateless SQL execution for Databricks with minimal dependencies
14 versions - Latest release: 13 days ago - 2 dependent packages - 25.8 thousand downloads last month - 2 stars on GitHub - 2 maintainers
astro-providers-databricks 0.1.0a1 removed
Affordable Databricks Workflows in Apache Airflow
1 version - Latest release: about 1 year ago - 74 downloads last month - 2 stars on GitHub - 3 maintainers
cdktf-cdktf-provider-databricks 13.10.0
Prebuilt databricks Provider for Terraform CDK (cdktf)
224 versions - Latest release: 5 days ago - 1 dependent repositories - 4.44 thousand downloads last month - 2 stars on GitHub - 1 maintainer
dbloy 0.3.0
Continuous Delivery tool for PySpark Notebooks based jobs on Databricks.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
axerflow 0.0.4
Axerflow: An ML Workflow Tool
1 version - Latest release: almost 4 years ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
fugue-cloudprovider 0.2.0
A collection of utils for Fugue to run on cloud providers
12 versions - Latest release: over 1 year ago - 74 downloads last month - 1 stars on GitHub - 1 maintainer
atc-dataplatform-tools 0.1.26
A common set of python libraries for DataBricks, supplement to atc-dataplatform
26 versions - Latest release: about 1 year ago - 1 dependent repositories - 221 downloads last month - 1 stars on GitHub - 1 maintainer
databricks-utils 0.0.7
Ease-of-use utility tools for databricks notebooks.
6 versions - Latest release: almost 6 years ago - 1 dependent repositories - 296 thousand downloads last month - 1 stars on GitHub - 1 maintainer
databricks-cicd 0.1.16
CICD tool for testing and deploying to Databricks
16 versions - Latest release: about 1 year ago - 1 dependent repositories - 154 downloads last month - 1 stars on GitHub - 1 maintainer
mlflow-cratedb 2.12.2
MLflow adapter for CrateDB
11 versions - Latest release: 3 days ago - 181 downloads last month - 1 stars on GitHub - 3 maintainers
databrickstools 0.3.2
A simple commandline application to manage databricks resources.
8 versions - Latest release: about 4 years ago - 1 dependent repositories - 37.3 thousand downloads last month - 1 stars on GitHub - 1 maintainer
pfore-cloud-utilities 0.0.0.dev6
Provides utility functions for cloud-based workflows.
6 versions - Latest release: about 2 months ago - 111 downloads last month - 0 stars on GitHub - 1 maintainer
runeatest 0.27.33
nunit test report generator to run in DataBricks
30 versions - Latest release: about 3 years ago - 1 dependent repositories - 126 downloads last month - 0 stars on GitHub - 1 maintainer
databricks-filestore-uploader 0.1.0
A quick filetree uploader for the databricks filestore, local to cloud.
1 version - Latest release: over 1 year ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
tactivos-databricks-cicd 0.1.16
CICD tool for testing and deploying to Databricks
1 version - Latest release: 10 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
spetlr-tools 0.1.65
Supplements to the python SPark ETL libRary (SPETLR) for Databricks.
31 versions - Latest release: 3 months ago - 1 dependent repositories - 8.07 thousand downloads last month - 0 stars on GitHub - 1 maintainer
pipeline-deploy 0.3.5
A deployment tool for ETL data pipelines.
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 2 maintainers
azutils 0.1.2
Utilities for Azure
3 versions - Latest release: over 3 years ago - 62 downloads last month - 0 stars on GitHub - 1 maintainer
dltctl 0.2
A command line interface for Databricks Delta Live Tables
2 versions - Latest release: about 1 year ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
readdatabrickstables 0.1.1
Databricks connectors to read tables
11 versions - Latest release: 2 months ago - 86 downloads last month - 1 maintainer
etlassist 0.0.3
etlassist is a python package supporting ETL operations
3 versions - Latest release: 5 months ago - 24 downloads last month - 1 maintainer
databricks_test_helper 1.0.0
A testing helper for Apache Spark Databricks notebooks
1 version - Latest release: almost 8 years ago - 2 dependent repositories - 114 downloads last month - 1 maintainer
datacorecommon 0.5.0
Wrapper functions for PySpark
27 versions - Latest release: 3 months ago - 1 dependent repositories - 85.7 thousand downloads last month - 3 maintainers