Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "parquet" keyword

vdf-io 0.1.246 💰
This library uses a universal format for vector datasets to easily export and import data from al...
96 versions - Latest release: 6 days ago - 1.63 thousand downloads last month - 118 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...
68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
quilt3 5.4.0
Quilt: where data comes together
61 versions - Latest release: 5 months ago - 27 dependent packages - 44 dependent repositories - 47.8 thousand downloads last month - 1,311 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainers
grai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainers
Top 3.9% on pypi.org
quilt 2.9.15
Quilt is a data package manager
50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
grai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainers
kartothek 5.3.0
A consistent table management library in python
41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)
36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
pystore 0.1.23 💰
Fast data store for Pandas timeseries data
35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
elasticsearch-loader 0.6.0
A pythonic tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch
35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
typeddfs 0.16.5
Pandas DataFrame subclasses that enforce structure and can self-organize.
34 versions - Latest release: about 2 years ago - 1 dependent repositories - 283 downloads last month - 8 stars on GitHub - 2 maintainers
atoti-aws 0.8.12
Plugin to load CSV and Parquet files from AWS S3 into Atoti tables
31 versions - Latest release: 9 days ago - 1 dependent package - 2 dependent repositories - 365 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-azure 0.8.12
Plugin to load CSV and Parquet files from Azure Blob Storage into Atoti tables
31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 304 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-gcp 0.8.12
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables
31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 281 downloads last month - 214 stars on GitHub - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...
31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainers
lonboard 0.9.1
Python library for fast, interactive geospatial vector data visualization in Jupyter.
28 versions - Latest release: 4 days ago - 1 dependent package - 1 dependent repositories - 2.83 thousand downloads last month - 432 stars on GitHub - 4 maintainers
Top 6.5% on pypi.org
kglab 0.6.6 💰
A simple abstraction layer in Python for building knowledge graphs
26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
pyinflux3 0.9.2
Community Python client for InfluxDB IOx
24 versions - Latest release: 12 months ago - 127 downloads last month - 44 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.6% on pypi.org
influxdb3-python 0.4.0
Community Python client for InfluxDB 3.0
24 versions - Latest release: 24 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
parquet-tools 0.2.16
Easy install parquet-tools
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
grai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainers
parquet2lance 0.4.1
The Python wrapper for the Rust parquet2lance
19 versions - Latest release: 2 months ago - 113 downloads last month - 3,305 stars on GitHub - 2 maintainers
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...
19 versions - Latest release: 9 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
grai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainers
grai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_dbt_cloud 0.1.5
18 versions - Latest release: 3 months ago - 300 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainers
graphique 1.6
GraphQL service for arrow tables and parquet data sets.
17 versions - Latest release: 10 days ago - 1 dependent repositories - 422 downloads last month - 70 stars on GitHub - 1 maintainer
roapi-http 0.6.0
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
17 versions - Latest release: about 2 years ago - 1 dependent repositories - 427 downloads last month - 3,089 stars on GitHub - 1 maintainer
firespark 0.0.32
FireSpark data processing utility library
16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...
15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
esl-redis 0.6.0
elasticsearch_loader plugin for redis
12 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 395 stars on GitHub - 1 maintainer
polario 0.3.1
Polars IO
11 versions - Latest release: 9 months ago - 125 downloads last month - 0 stars on GitHub - 2 maintainers
grai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainers
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
esl-s3 0.6.0
elasticsearch_loader plugin for AWS s3
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 395 stars on GitHub - 1 maintainer
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen2
10 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
columnq-cli 0.5.2
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 12 days ago - 1 dependent repositories - 533 downloads last month - 3,089 stars on GitHub - 1 maintainer
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files
9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
easy-s3 1.0.7
This package helps you use S3 easily.
9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
pyinflux3-cli 0.9.2
Community Python client for InfluxDB IOx (CLI)
9 versions - Latest release: 12 months ago - 1 dependent package - 59 downloads last month - 44 stars on GitHub - 2 maintainers
Top 9.1% on pypi.org
roapi 0.11.3
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 12 days ago - 1 dependent repositories - 1.71 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
lazydf 0.220726.3
Hopefully safe and deterministic serializer to binary format, including Pandas data
8 versions - Latest release: over 1 year ago - 2 dependent repositories - 14 downloads last month - 1 stars on GitHub - 2 maintainers
ddump 0.2.0
A data dump tool
8 versions - Latest release: 10 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
arrowdantic 0.2.3
Arrow, pydantic style
8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
datasette-parquet 0.6.1
Read Parquet files in Datasette
7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
data-toolset 0.1.7
Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.
7 versions - Latest release: 7 months ago - 63 downloads last month - 1 stars on GitHub - 2 maintainers
overturemapsdownloader 0.1.9
Overture Maps Downloader simplifies geospatial data manipulation
6 versions - Latest release: 15 days ago - 108 downloads last month - 87 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.
6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.
6 versions - Latest release: about 2 years ago - 105 stars on GitHub
parquet-to-hyper 1.1.4
Create and publish tableau hyper files from parquet files.
6 versions - Latest release: 6 months ago - 91 downloads last month - 0 stars on GitHub - 2 maintainers
featherplot 0.0.6
featherplot
6 versions - Latest release: 9 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
lakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database
5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
pynock 1.2.1
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization...
5 versions - Latest release: over 1 year ago - 56 downloads last month - 15 stars on GitHub - 1 maintainer
shapeshifter 1.1.1
A tool for managing large datasets
5 versions - Latest release: about 5 years ago - 1 dependent repositories - 61 downloads last month - 2 stars on GitHub - 2 maintainers
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
cryo-python 0.3.0
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.
5 versions - Latest release: 4 months ago - 145 downloads last month - 975 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
elbow 0.1.1
Lift special-purpose data into common tabular formats for analytics 💪
5 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 207 downloads last month - 0 stars on GitHub - 2 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...
5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
sep005-io-parquet 0.0.6
Parquet file read functions compliant with SDyPy SEP005
4 versions - Latest release: 3 months ago - 118 downloads last month - 2 maintainers
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.
4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
atlas-db 0.2.11
turn apple health export.xml into parquet
4 versions - Latest release: 6 days ago - 408 downloads last month - 67 stars on GitHub - 2 maintainers
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems
4 versions - Latest release: about 2 years ago - 105 stars on GitHub
parquet-loader 0.0.4
Parquet file Load and Read from minio & S3
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
grai_source_cube 0.0.2
4 versions - Latest release: 2 months ago - 198 downloads last month - 270 stars on GitHub - 2 maintainers
quilt-installer 0.0.0a5
Quilt Data installation tool
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1,311 stars on GitHub - 1 maintainer
expanse 0.2.4
turn apple health export.xml into parquet
4 versions - Latest release: 15 days ago - 626 downloads last month - 67 stars on GitHub - 2 maintainers
shapeshifter-cli 1.0.0
A command-line tool for transforming large data sets
4 versions - Latest release: about 5 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 2 maintainers
csvtoparquet 0.1.5
4 versions - Latest release: over 5 years ago - 60 downloads last month - 2 maintainers
expressionable-cli 1.0.0
A command-line tool for transforming large data sets
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 4 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
microdrill 0.0.3
Simple Apache Drill alternative using PySpark
3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
pycobol2parquet 0.0.3
A Python library to convert COBOL ebcdic file to parquet format based on copybook
3 versions - Latest release: about 2 months ago - 29 downloads last month - 2 maintainers
expressionable 1.2
A tool for managing large datasets
3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
csv-parakeet 1.1.1
Parquet to CSV command line tool
3 versions - Latest release: 7 months ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
polars-partitions 0.1.2
Simplified work with partitions based on Polars library
3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 2 maintainers
bids2table 0.1.0
Efficiently index large-scale BIDS datasets and derivatives
3 versions - Latest release: 10 days ago - 1 dependent repositories - 410 downloads last month - 11 stars on GitHub - 2 maintainers
cryo 0.3.2
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.
3 versions - Latest release: 4 months ago - 2 dependent repositories - 919 downloads last month - 975 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
2 versions - Latest release: about 2 years ago - 105 stars on GitHub
quilt-stack-installer 1.0.0
Quilt Data installation tool
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer