Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "parquet" keyword

kartothek 5.3.0
A consistent table management library in python
41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
grai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainers
expressionable 1.2
A tool for managing large datasets
3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
grai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainers
lonboard 0.9.1
Python library for fast, interactive geospatial vector data visualization in Jupyter.
28 versions - Latest release: 3 days ago - 1 dependent package - 1 dependent repositories - 2.83 thousand downloads last month - 432 stars on GitHub - 4 maintainers
expressionable-cli 1.0.0
A command-line tool for transforming large data sets
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 4 maintainers
grai_source_dbt_cloud 0.1.5
18 versions - Latest release: 3 months ago - 300 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainers
grai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 173 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainers
the-guide 0.1.36
1 version - Latest release: 9 months ago - 15 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainers
grai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainers
grai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainers
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
microdrill 0.0.3
Simple Apache Drill alternative using PySpark
3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
pyinflux3 0.9.2
Community Python client for InfluxDB IOx
24 versions - Latest release: 12 months ago - 127 downloads last month - 44 stars on GitHub - 2 maintainers
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen2
10 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
parquet-tools 0.2.16
Easy install parquet-tools
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
bids2table 0.1.0
Efficiently index large-scale BIDS datasets and derivatives
3 versions - Latest release: 9 days ago - 1 dependent repositories - 410 downloads last month - 11 stars on GitHub - 2 maintainers
pyinflux3-cli 0.9.2
Community Python client for InfluxDB IOx (CLI)
9 versions - Latest release: 12 months ago - 1 dependent package - 59 downloads last month - 44 stars on GitHub - 2 maintainers
polario 0.3.1
Polars IO
11 versions - Latest release: 9 months ago - 125 downloads last month - 0 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
elasticsearch-loader 0.6.0
A pythonic tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch
35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
perspective-parquet 0.1.1 💰
Parquet viewer for perspective in JupyterLab
2 versions - Latest release: about 1 year ago - 55 downloads last month - 30 stars on GitHub - 2 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-performance 0.0.2
Performance of parquet files using pandas
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
shapeshifter 1.1.1
A tool for managing large datasets
5 versions - Latest release: about 5 years ago - 1 dependent repositories - 61 downloads last month - 2 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
pystore 0.1.23 💰
Fast data store for Pandas timeseries data
35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
parquet2lance 0.4.1
The Python wrapper for the Rust parquet2lance
19 versions - Latest release: 2 months ago - 111 downloads last month - 3,269 stars on GitHub - 2 maintainers
lazydf 0.220726.3
Hopefully safe and deterministic serializer to binary format, including Pandas data
8 versions - Latest release: over 1 year ago - 2 dependent repositories - 14 downloads last month - 1 stars on GitHub - 2 maintainers
swemaps 0.1.0
Sweden in GeoParquet for easy usage.
1 version - Latest release: 4 days ago - 127 downloads last month - 0 stars on GitHub - 2 maintainers
ts-store 0.0.1
Flexible storage for time series.
1 version - Latest release: about 1 month ago - 76 downloads last month - 0 stars on GitHub - 2 maintainers
datasette-parquet 0.6.1
Read Parquet files in Datasette
7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
sep005-io-parquet 0.0.6
Parquet file read functions compliant with SDyPy SEP005
4 versions - Latest release: 3 months ago - 118 downloads last month - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...
31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
csv-parakeet 1.1.1
Parquet to CSV command line tool
3 versions - Latest release: 7 months ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
elbow 0.1.1
Lift special-purpose data into common tabular formats for analytics 💪
5 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 207 downloads last month - 0 stars on GitHub - 2 maintainers
atlas-db 0.2.11
turn apple health export.xml into parquet
4 versions - Latest release: 5 days ago - 408 downloads last month - 67 stars on GitHub - 2 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
atoti-aws 0.8.12
Plugin to load CSV and Parquet files from AWS S3 into Atoti tables
31 versions - Latest release: 8 days ago - 1 dependent package - 2 dependent repositories - 365 downloads last month - 214 stars on GitHub - 2 maintainers
procmondf 0.10
provides a convenient and efficient solution for capturing and analyzing system activity logs usi...
1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.1% on pypi.org
roapi 0.11.3
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 12 days ago - 1 dependent repositories - 1.71 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
easy-s3 1.0.7
This package helps you use S3 easily.
9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
parquet-to-hyper 1.1.4
Create and publish tableau hyper files from parquet files.
6 versions - Latest release: 6 months ago - 20 downloads last month - 0 stars on GitHub - 2 maintainers
typeddfs 0.16.5
Pandas DataFrame subclasses that enforce structure and can self-organize.
34 versions - Latest release: about 2 years ago - 1 dependent repositories - 283 downloads last month - 8 stars on GitHub - 2 maintainers
pycobol2parquet 0.0.3
A Python library to convert COBOL ebcdic file to parquet format based on copybook
3 versions - Latest release: about 2 months ago - 29 downloads last month - 2 maintainers
vdf-io 0.1.246 💰
This library uses a universal format for vector datasets to easily export and import data from al...
96 versions - Latest release: 6 days ago - 1.63 thousand downloads last month - 118 stars on GitHub - 2 maintainers
lakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database
5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
shapeshifter-cli 1.0.0
A command-line tool for transforming large data sets
4 versions - Latest release: about 5 years ago - 1 dependent repositories - 15 downloads last month - 2 stars on GitHub - 2 maintainers
parquet-loader 0.0.4
Parquet file Load and Read from minio & S3
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
overturemapsdownloader 0.1.9
Overture Maps Downloader simplifies geospatial data manipulation
6 versions - Latest release: 15 days ago - 108 downloads last month - 87 stars on GitHub - 2 maintainers
regallager 0.0.1
A consistent table management library in python
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 7 downloads last month - 161 stars on GitHub - 2 maintainers
scd2 1.0.0
slowly changing dimension type 2 with pandas or parquet
1 version - Latest release: over 1 year ago - 272 downloads last month - 2 stars on GitLab.com - 2 maintainers
atoti-azure 0.8.12
Plugin to load CSV and Parquet files from Azure Blob Storage into Atoti tables
31 versions - Latest release: 8 days ago - 1 dependent package - 1 dependent repositories - 304 downloads last month - 214 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)
36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
hybridbackend-deeprec2212-cu114 0.8.0.dev1679289143
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
1 version - Latest release: about 1 year ago - 8 downloads last month - 144 stars on GitHub - 2 maintainers
grai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainers
grai_source_cube 0.0.2
4 versions - Latest release: about 2 months ago - 198 downloads last month - 270 stars on GitHub - 2 maintainers
iterabledata 1.0.2
Iterable data processing Python library
1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 2 maintainers
polars-partitions 0.1.2
Simplified work with partitions based on Polars library
3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 2 maintainers
crazybin 1.0.0
Much better than hexbins plots! Use any kind of tile to visualize data and pictures!
1 version - Latest release: 7 days ago - 84 downloads last month - 0 stars on GitHub - 2 maintainers
parquet2csv 0.8
No judgment, but I looked this up on StackOverflow twice and decided not to have to look it up th...
1 version - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
atoti-gcp 0.8.12
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables
31 versions - Latest release: 8 days ago - 1 dependent package - 1 dependent repositories - 281 downloads last month - 214 stars on GitHub - 2 maintainers
data-toolset 0.1.7
Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.
7 versions - Latest release: 7 months ago - 63 downloads last month - 1 stars on GitHub - 2 maintainers
firespark 0.0.32
FireSpark data processing utility library
16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...
5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
expanse 0.2.4
turn apple health export.xml into parquet
4 versions - Latest release: 15 days ago - 626 downloads last month - 67 stars on GitHub - 2 maintainers
joinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.
2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 2 maintainers
dfcsv2parquet 0.10
converts large CSV files into smaller, Pandas-compatible Parquet files
1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
influxdb3-python 0.4.0
Community Python client for InfluxDB 3.0
24 versions - Latest release: 23 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
csvtoparquetlib 0.1.4
2 versions - Latest release: over 5 years ago - 23 downloads last month - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...
15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
csvtoparquet 0.1.5
4 versions - Latest release: over 5 years ago - 60 downloads last month - 2 maintainers
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
roapi-http 0.6.0
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
17 versions - Latest release: about 2 years ago - 1 dependent repositories - 427 downloads last month - 3,089 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
quilt-stack-installer 1.0.0
Quilt Data installation tool
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...
68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.
4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
parquet-metadata 0.0.1
A tool to show metadata about a Parquet file
1 version - Latest release: over 5 years ago - 1 dependent repositories - 91.8 thousand downloads last month - 11 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
kglab 0.6.6 💰
A simple abstraction layer in Python for building knowledge graphs
26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
unicov 0.0.1
Universal File conversion python library to convert any file to any type of file
1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
ddump 0.2.0
A data dump tool
8 versions - Latest release: 10 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
columnq-cli 0.5.2
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 12 days ago - 1 dependent repositories - 533 downloads last month - 3,089 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
arrowdantic 0.2.3
Arrow, pydantic style
8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
safeserializer 0.230202.1
Hopefully safe and deterministic serializer to binary format, including Pandas data
1 version - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer