Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "parquet" keyword
parquet2lance 0.4.1
The Python wrapper for the Rust parquet2lance19 versions - Latest release: 2 months ago - 113 downloads last month - 3,305 stars on GitHub - 2 maintainers
roapi-http 0.6.0
Create full-fledged APIs for slowly moving datasets without writing a single line of code.17 versions - Latest release: about 2 years ago - 1 dependent repositories - 427 downloads last month - 3,089 stars on GitHub - 1 maintainer
columnq-cli 0.5.2
Create full-fledged APIs for slowly moving datasets without writing a single line of code.9 versions - Latest release: 12 days ago - 1 dependent repositories - 533 downloads last month - 3,089 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
9 versions - Latest release: 12 days ago - 1 dependent repositories - 1.71 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
roapi 0.11.3
Create full-fledged APIs for slowly moving datasets without writing a single line of code.9 versions - Latest release: 12 days ago - 1 dependent repositories - 1.71 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
firespark 0.0.32
FireSpark data processing utility library16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
Top 3.9% on pypi.org
50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt 2.9.15
Quilt is a data package manager50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
61 versions - Latest release: 5 months ago - 27 dependent packages - 44 dependent repositories - 47.8 thousand downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt3 5.4.0
Quilt: where data comes together61 versions - Latest release: 5 months ago - 27 dependent packages - 44 dependent repositories - 47.8 thousand downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt-installer 0.0.0a5
Quilt Data installation tool4 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt-stack-installer 1.0.0
Quilt Data installation tool2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
cryo-python 0.3.0
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.5 versions - Latest release: 4 months ago - 145 downloads last month - 975 stars on GitHub - 1 maintainer
cryo 0.3.2
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.3 versions - Latest release: 4 months ago - 2 dependent repositories - 919 downloads last month - 975 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
kglab 0.6.6 💰
A simple abstraction layer in Python for building knowledge graphs26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
pystore 0.1.23 💰
Fast data store for Pandas timeseries data35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
lonboard 0.9.1
Python library for fast, interactive geospatial vector data visualization in Jupyter.28 versions - Latest release: 4 days ago - 1 dependent package - 1 dependent repositories - 2.83 thousand downloads last month - 432 stars on GitHub - 4 maintainers
esl-s3 0.6.0
elasticsearch_loader plugin for AWS s310 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 395 stars on GitHub - 1 maintainer
esl-redis 0.6.0
elasticsearch_loader plugin for redis12 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 395 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
elasticsearch-loader 0.6.0
A pythonic tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainersgrai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_cube 0.0.2
4 versions - Latest release: 2 months ago - 198 downloads last month - 270 stars on GitHub - 2 maintainersgrai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 173 downloads last month - 270 stars on GitHub - 4 maintainersthe-guide 0.1.36
1 version - Latest release: 9 months ago - 15 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainersgrai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_dbt_cloud 0.1.5
18 versions - Latest release: 3 months ago - 300 downloads last month - 270 stars on GitHub - 4 maintainers
Top 3.2% on pypi.org
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
atoti-azure 0.8.12
Plugin to load CSV and Parquet files from Azure Blob Storage into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 304 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-gcp 0.8.12
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 281 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-aws 0.8.12
Plugin to load CSV and Parquet files from AWS S3 into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 2 dependent repositories - 365 downloads last month - 214 stars on GitHub - 2 maintainers
kartothek 5.3.0
A consistent table management library in python41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
regallager 0.0.1
A consistent table management library in python1 version - Latest release: almost 3 years ago - 1 dependent repositories - 7 downloads last month - 161 stars on GitHub - 2 maintainers
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-deeprec2212-cu114 0.8.0.dev1679289143
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: about 1 year ago - 8 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-deeprec2208-cu114 0.7.0.dev1672985131
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: over 1 year ago - 8 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
parquet-tools 0.2.16
Easy install parquet-tools21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
vdf-io 0.1.246 💰
This library uses a universal format for vector datasets to easily export and import data from al...96 versions - Latest release: 6 days ago - 1.63 thousand downloads last month - 118 stars on GitHub - 2 maintainers
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems4 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend 0.5.2.post1 removed
Efficient training of deep recommenders on cloud.1 version - Latest release: over 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cu114-tf115 0.6.1a0 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: almost 2 years ago - 105 stars on GitHub
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
overturemapsdownloader 0.1.9
Overture Maps Downloader simplifies geospatial data manipulation6 versions - Latest release: 15 days ago - 108 downloads last month - 87 stars on GitHub - 2 maintainers
arrowdantic 0.2.3
Arrow, pydantic style8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
graphique 1.6
GraphQL service for arrow tables and parquet data sets.17 versions - Latest release: 11 days ago - 1 dependent repositories - 422 downloads last month - 70 stars on GitHub - 1 maintainer
atlas-db 0.2.11
turn apple health export.xml into parquet4 versions - Latest release: 6 days ago - 646 downloads last month - 70 stars on GitHub - 2 maintainers
expanse 0.2.4
turn apple health export.xml into parquet4 versions - Latest release: 15 days ago - 626 downloads last month - 67 stars on GitHub - 2 maintainers
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
pyinflux3 0.9.2
Community Python client for InfluxDB IOx24 versions - Latest release: 12 months ago - 127 downloads last month - 44 stars on GitHub - 2 maintainers
pyinflux3-cli 0.9.2
Community Python client for InfluxDB IOx (CLI)9 versions - Latest release: 12 months ago - 1 dependent package - 59 downloads last month - 44 stars on GitHub - 2 maintainers
datasette-parquet 0.6.1
Read Parquet files in Datasette7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
perspective-parquet 0.1.1 💰
Parquet viewer for perspective in JupyterLab2 versions - Latest release: about 1 year ago - 55 downloads last month - 30 stars on GitHub - 2 maintainers
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...19 versions - Latest release: 9 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen210 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
24 versions - Latest release: 24 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
influxdb3-python 0.4.0
Community Python client for InfluxDB 3.024 versions - Latest release: 24 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
pynock 1.2.1
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization...5 versions - Latest release: over 1 year ago - 56 downloads last month - 15 stars on GitHub - 1 maintainer
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
iterabledata 1.0.2
Iterable data processing Python library1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 2 maintainers
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
bids2table 0.1.0
Efficiently index large-scale BIDS datasets and derivatives3 versions - Latest release: 10 days ago - 1 dependent repositories - 410 downloads last month - 11 stars on GitHub - 2 maintainers
parquet-metadata 0.0.1
A tool to show metadata about a Parquet file1 version - Latest release: over 5 years ago - 1 dependent repositories - 91.8 thousand downloads last month - 11 stars on GitHub - 1 maintainer
ddump 0.2.0
A data dump tool8 versions - Latest release: 10 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
typeddfs 0.16.5
Pandas DataFrame subclasses that enforce structure and can self-organize.34 versions - Latest release: about 2 years ago - 1 dependent repositories - 283 downloads last month - 8 stars on GitHub - 2 maintainers
microdrill 0.0.3
Simple Apache Drill alternative using PySpark3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
aporia-importer 1.0.6
Import data from cloud storage to Aporia1 version - Latest release: almost 3 years ago - 1 dependent repositories - 5 downloads last month - 7 stars on GitHub - 1 maintainer
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
joinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 2 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-loader 0.0.4
Parquet file Load and Read from minio & S34 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
easy-s3 1.0.7
This package helps you use S3 easily.9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
shapeshifter-cli 1.0.0
A command-line tool for transforming large data sets4 versions - Latest release: about 5 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 2 maintainers
shapeshifter 1.1.1
A tool for managing large datasets5 versions - Latest release: about 5 years ago - 1 dependent repositories - 61 downloads last month - 2 stars on GitHub - 2 maintainers
expressionable 1.2
A tool for managing large datasets3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
Related Keywords
python
44
data
25
data-science
22
csv
20
deep-learning
20
mysql
19
postgresql
19
hacktoberfest
19
redshift
19
snowflake
19
open-source
18
mssql
18
fivetran
18
django
18
dbt
18
datalineage
18
dataengineering
18
data-lineage
18
gpu
17
arrow
17
hybrid-parallelism
16
recommender-system
16
json
11
pandas
11
pyarrow
11
deep learning
10
recommendation system
9
python3
8
serialization
8
sql
8
hdf5
7
deep
7
apache-arrow
7
learning
7
recommendation
7
system
7
data-visualization
7
data-engineering
7
s3
6
rust
6
columnar
5
database
5
etl
5
cloud
5
dataframe
5
msgpack
4
apache
4
apache-parquet
4
pickle
4
excel
4
machine-learning
4
salmon
4
stata
4
tabular-data
4
transcription
4
polars
4
pytorch
4
pyspark
4
parquet-files
4
dask
4
transforming-files
4
tsv
4
pivot-tables
4
arff
4
data-version-control
4
analytics
4
merge
4
kallisto
4
gene-expression
4
data-versioning
4
gct
4
graphql
4
filter
4
elastic
3
blob-storage
3
IBM
3
charts
3
cube
3
multidimensional-analysis
3
olap
3
elasticsearch
3
elasticsearch-loader
3
atoti
3
duckdb
3
load
3
logstash
3
table
3
source
3
rest-api
3
static-datasets
3
query-frontends
3
sysml
3
tensorflow
3
dataset
3
query
3
in-memory-database
3
hive
3
delta-lake
3
datasets
3
influxdb
3