Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 1.2% on pypi.org
Top 0.4% downloads on pypi.org
Top 0.4% dependent packages on pypi.org
Top 1.0% dependent repos on pypi.org
Top 3.0% forks on pypi.org
Top 0.7% docker downloads on pypi.org

pypi.org : deltalake

Native Delta Lake Python binding based on delta-rs with Pandas integration

Registry - Source - Documentation - JSON
purl: pkg:pypi/deltalake
Keywords: deltalake, delta, datalake, pandas, arrow, databricks, delta-lake, pandas-dataframe, python, rust
License: Apache-2.0
Latest release: 12 days ago
First release: over 3 years ago
Dependent packages: 47
Dependent repositories: 235
Downloads: 2,889,243 last month
Stars: 1,844 on GitHub
Forks: 357 on GitHub
Docker dependents: 36
Docker downloads: 6,787,353
Total Commits: 734
Committers: 84
Average commits per author: 8.738
Development Distribution Score (DDS): 0.866
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 3 days ago

Top 1.5% on pypi.org
unstructured 0.14.0
A library that prepares raw documents for downstream ML tasks.
135 versions - Latest release: 6 days ago - 113 dependent packages - 3,374 dependent repositories - 1.16 million downloads last month - 4,064 stars on GitHub - 1 maintainer
superlinked 3.46.0
The Superlinked vector computing library
65 versions - Latest release: 6 days ago - 3.39 thousand downloads last month - 148 stars on GitHub - 1 maintainer
dagster-polars 0.23.6
Dagster integration library for Polars
56 versions - Latest release: 7 days ago - 7.01 thousand downloads last month - 10,410 stars on GitHub - 2 maintainers
dagster-deltalake 0.23.6
Package for Deltalake-specific Dagster framework op and resource components.
33 versions - Latest release: 7 days ago - 2 dependent packages - 2.54 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
odbc2deltalake 0.11.8
41 versions - Latest release: 8 days ago - 1.63 thousand downloads last month - 1 maintainer
adapta 2.10.3
Logging, data connectors, monitoring, secret handling and general lifehacks to make data people l...
74 versions - Latest release: 8 days ago - 1.08 thousand downloads last month - 8 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
asyncdb 2.7.6 💰
Library for Asynchronous data source connections Collection of asyncio drivers.
177 versions - Latest release: 8 days ago - 5 dependent packages - 8 dependent repositories - 3.44 thousand downloads last month - 34 stars on GitHub - 2 maintainers
Top 7.5% on pypi.org
polars-u64-idx 0.20.26 💰
Blazingly fast DataFrame library
198 versions - Latest release: 9 days ago - 1 dependent repositories - 8.39 thousand downloads last month - 26,301 stars on GitHub - 2 maintainers
Top 5.9% on pypi.org
polars-lts-cpu 0.20.26 💰
Blazingly fast DataFrame library
151 versions - Latest release: 9 days ago - 10 dependent packages - 1 dependent repositories - 115 thousand downloads last month - 22,355 stars on GitHub - 2 maintainers
Top 0.5% on pypi.org
polars 0.20.26 💰
Blazingly fast DataFrame library
353 versions - Latest release: 9 days ago - 571 dependent packages - 947 dependent repositories - 6.99 million downloads last month - 26,301 stars on GitHub - 2 maintainers
Top 9.7% on pypi.org
uptrain 0.7.1
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, r...
49 versions - Latest release: 9 days ago - 2 dependent packages - 1 dependent repositories - 5.18 thousand downloads last month - 2,017 stars on GitHub - 2 maintainers
turntable-spoonbill 10.0.0
Productivity-centric Python Big Data Framework
5 versions - Latest release: 10 days ago - 239 downloads last month - 4,327 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
tecton 0.9.3
Tecton Python SDK
512 versions - Latest release: 13 days ago - 5 dependent packages - 3 dependent repositories - 825 thousand downloads last month - 6 maintainers
Top 0.8% on pypi.org
acryl-datahub 0.13.2
A CLI to work with DataHub metadata
676 versions - Latest release: 15 days ago - 10 dependent packages - 65 dependent repositories - 1.06 million downloads last month - 9,169 stars on GitHub - 3 maintainers
deltalake2db 0.3.3
Native polars deltalake reader
18 versions - Latest release: 15 days ago - 3 dependent packages - 658 downloads last month - 7 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
getdaft 0.2.24
Distributed Dataframes for Multimodal Data
71 versions - Latest release: 15 days ago - 5 dependent packages - 3 dependent repositories - 14.2 thousand downloads last month - 1,774 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
apache-airflow 2.9.1
Programmatically author, schedule and monitor data pipelines
193 versions - Latest release: 17 days ago - 314 dependent packages - 1,554 dependent repositories - 22.3 million downloads last month - 34,751 stars on GitHub - 1 maintainer
bmsdna-sql-utils 0.12.7
10 versions - Latest release: 20 days ago - 821 downloads last month - 1 maintainer
Top 1.4% on pypi.org
ibis-framework 9.0.0
The portable Python dataframe library
92 versions - Latest release: 23 days ago - 25 dependent packages - 130 dependent repositories - 189 thousand downloads last month - 3,333 stars on GitHub - 6 maintainers
Top 1.2% on pypi.org
awswrangler 3.7.3
Pandas on AWS.
151 versions - Latest release: about 1 month ago - 44 dependent packages - 185 dependent repositories - 45 million downloads last month - 3,704 stars on GitHub - 5 maintainers
bmsdna-lakeapi 0.19.12
API for distributing Data Lake Data
98 versions - Latest release: about 1 month ago - 473 downloads last month - 6 stars on GitHub - 2 maintainers
kukur 0.1.20
Kukur makes time series data and metadata available to the Apache Arrow ecosystem.
49 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 891 downloads last month - 2 maintainers
Top 3.7% on pypi.org
kedro-datasets 3.0.0
Kedro-Datasets is where you can find all of Kedro's data connectors.
28 versions - Latest release: about 1 month ago - 13 dependent packages - 46 dependent repositories - 136 thousand downloads last month - 81 stars on GitHub - 1 maintainer
cloud-arrow 0.8.1
Python library to provide an Unified cloud storage API for reading and writing parquet and delta...
7 versions - Latest release: about 2 months ago - 59 downloads last month - 0 stars on GitHub - 2 maintainers
lakescum 0.1.3
A Python pacakge to help Databricks Unity Catalog users to read and query Delta Lake tables with ...
3 versions - Latest release: about 2 months ago - 114 downloads last month - 13 stars on GitHub - 1 maintainer
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)
36 versions - Latest release: 2 months ago - 238 downloads last month - 14 stars on GitHub - 1 maintainer
polars-cli 0.7.0 💰
CLI interface for running SQL queries with Polars as backend
6 versions - Latest release: 3 months ago - 1 dependent repositories - 202 downloads last month - 112 stars on GitHub - 2 maintainers
parq-inspector 0.2.1
Parquet viewer for your terminal.
3 versions - Latest release: 3 months ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
openark 0.0.17
OpenARK Python Client
17 versions - Latest release: 3 months ago - 41 downloads last month - 1 stars on GitHub - 1 maintainer
faker-cli 0.5.0
Command-line fake data generator
9 versions - Latest release: 4 months ago - 67 downloads last month - 1 maintainer
datasaurus 0.0.2.dev4
Data Engineering framework based on Polars.rs
5 versions - Latest release: 5 months ago - 48 downloads last month - 14 stars on GitHub - 1 maintainer
deltalake-redis-lock 0.0.1a11
deltalake-redis-lock
10 versions - Latest release: 6 months ago - 121 downloads last month - 2 stars on GitHub - 1 maintainer
nba-dataloader 1.0.5
A python client to query the various stats.nba.com resources
6 versions - Latest release: 7 months ago - 59 downloads last month - 0 stars on GitHub - 1 maintainer
deltatorch 0.0.3
DeltaTorch allows loading training data from DeltaLake tables for training Deep Learning models ...
3 versions - Latest release: 7 months ago - 79 downloads last month - 44 stars on GitHub - 3 maintainers
vcsc-data-common 0.2.5
My package description
50 versions - Latest release: 8 months ago - 90 downloads last month - 1 maintainer
cdpdev-datahub 0.10.5a0
A CLI to work with DataHub metadata
1 version - Latest release: 8 months ago - 40 downloads last month - 9,169 stars on GitHub - 1 maintainer
dask-deltatable 0.3.1
Dask + Delta Table
4 versions - Latest release: 10 months ago - 1 dependent repositories - 665 downloads last month - 5 maintainers
target-s3-delta 0.0.25
`target-s3-delta` is a Singer target for s3-delta, built with the Meltano Singer SDK.
25 versions - Latest release: 10 months ago - 176 downloads last month - 2 maintainers
levi 0.3.0
Delta Lake helper methods
3 versions - Latest release: 11 months ago - 1 dependent repositories - 30 downloads last month - 1 maintainer
etl-jobs 0.1.13
14 versions - Latest release: 12 months ago - 75 downloads last month - 1 maintainer
deltadask 0.2.0
Delta Lake powered by Dask
2 versions - Latest release: about 1 year ago - 23 downloads last month - 5 stars on GitHub - 1 maintainer
ez-transform 0.1.6
Analytics engineering for data lakes.
7 versions - Latest release: about 1 year ago - 36 downloads last month - 6 stars on GitHub - 1 maintainer
h10awswrnglr 2.20.0
Pandas on AWS.
1 version - Latest release: about 1 year ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
h10-awswrangler 2.20.3 removed
Pandas on AWS.
4 versions - Latest release: about 1 year ago - 1 maintainer
Top 7.5% on pypi.org
feathr 1.0.0
An Enterprise-Grade, High Performance Feature Store
22 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.33 thousand downloads last month - 1,928 stars on GitHub - 1 maintainer
deltaray 0.2.0
Delta reader for the Ray open-source toolkit for building ML applications
2 versions - Latest release: about 1 year ago - 127 downloads last month - 34 stars on GitHub - 2 maintainers
dask-deltalake 0.0.1
Dask + Deltalake
1 version - Latest release: over 1 year ago - 3 downloads last month - 1 maintainer