Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "parquet" keyword
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
aporia-importer 1.0.6
Import data from cloud storage to Aporia1 version - Latest release: almost 3 years ago - 1 dependent repositories - 5 downloads last month - 7 stars on GitHub - 1 maintainer
arrowdantic 0.2.3
Arrow, pydantic style8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
atlas-db 0.2.11
turn apple health export.xml into parquet4 versions - Latest release: 5 days ago - 408 downloads last month - 67 stars on GitHub - 2 maintainers
atoti-aws 0.8.12
Plugin to load CSV and Parquet files from AWS S3 into Atoti tables31 versions - Latest release: 8 days ago - 1 dependent package - 2 dependent repositories - 365 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-azure 0.8.12
Plugin to load CSV and Parquet files from Azure Blob Storage into Atoti tables31 versions - Latest release: 8 days ago - 1 dependent package - 1 dependent repositories - 304 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-gcp 0.8.12
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables31 versions - Latest release: 8 days ago - 1 dependent package - 1 dependent repositories - 281 downloads last month - 214 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
bids2table 0.1.0
Efficiently index large-scale BIDS datasets and derivatives3 versions - Latest release: 9 days ago - 1 dependent repositories - 410 downloads last month - 11 stars on GitHub - 2 maintainers
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
columnq-cli 0.5.2
Create full-fledged APIs for slowly moving datasets without writing a single line of code.9 versions - Latest release: 12 days ago - 1 dependent repositories - 533 downloads last month - 3,089 stars on GitHub - 1 maintainer
crazybin 1.0.0
Much better than hexbins plots! Use any kind of tile to visualize data and pictures!1 version - Latest release: 7 days ago - 84 downloads last month - 0 stars on GitHub - 2 maintainers
cryo 0.3.2
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.3 versions - Latest release: 4 months ago - 2 dependent repositories - 919 downloads last month - 975 stars on GitHub - 1 maintainer
cryo-python 0.3.0
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.5 versions - Latest release: 4 months ago - 145 downloads last month - 975 stars on GitHub - 1 maintainer
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
csv-parakeet 1.1.1
Parquet to CSV command line tool3 versions - Latest release: 7 months ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
csvtoparquet 0.1.5
4 versions - Latest release: over 5 years ago - 60 downloads last month - 2 maintainerscsvtoparquetlib 0.1.4
2 versions - Latest release: over 5 years ago - 23 downloads last month - 2 maintainersdatasette-parquet 0.6.1
Read Parquet files in Datasette7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
data-toolset 0.1.7
Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.7 versions - Latest release: 7 months ago - 63 downloads last month - 1 stars on GitHub - 2 maintainers
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
ddump 0.2.0
A data dump tool8 versions - Latest release: 10 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
dfcsv2parquet 0.10
converts large CSV files into smaller, Pandas-compatible Parquet files1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
easy-s3 1.0.7
This package helps you use S3 easily.9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
elasticsearch-loader 0.6.0
A pythonic tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
elbow 0.1.1
Lift special-purpose data into common tabular formats for analytics 💪5 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 207 downloads last month - 0 stars on GitHub - 2 maintainers
esl-redis 0.6.0
elasticsearch_loader plugin for redis12 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 395 stars on GitHub - 1 maintainer
esl-s3 0.6.0
elasticsearch_loader plugin for AWS s310 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 395 stars on GitHub - 1 maintainer
expanse 0.2.4
turn apple health export.xml into parquet4 versions - Latest release: 15 days ago - 626 downloads last month - 67 stars on GitHub - 2 maintainers
expressionable 1.2
A tool for managing large datasets3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
expressionable-cli 1.0.0
A command-line tool for transforming large data sets3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 4 maintainers
featherplot 0.0.6
featherplot6 versions - Latest release: 9 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
firespark 0.0.32
FireSpark data processing utility library16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
grai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainersgrai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_cube 0.0.2
4 versions - Latest release: about 2 months ago - 198 downloads last month - 270 stars on GitHub - 2 maintainersgrai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainersgrai_source_dbt_cloud 0.1.5
18 versions - Latest release: 3 months ago - 300 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainersgrai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 173 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainersgraphique 1.6
GraphQL service for arrow tables and parquet data sets.17 versions - Latest release: 10 days ago - 1 dependent repositories - 422 downloads last month - 70 stars on GitHub - 1 maintainer
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
hybridbackend 0.5.2.post1 removed
Efficient training of deep recommenders on cloud.1 version - Latest release: over 2 years ago - 105 stars on GitHub
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cu114-tf115 0.6.1a0 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: almost 2 years ago - 105 stars on GitHub
hybridbackend-deeprec2208-cu114 0.7.0.dev1672985131
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: over 1 year ago - 8 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-deeprec2212-cu114 0.8.0.dev1679289143
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: about 1 year ago - 8 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems4 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...19 versions - Latest release: 9 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
24 versions - Latest release: 23 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
influxdb3-python 0.4.0
Community Python client for InfluxDB 3.024 versions - Latest release: 23 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
iterabledata 1.0.2
Iterable data processing Python library1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 2 maintainers
joinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 2 maintainers
kartothek 5.3.0
A consistent table management library in python41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
Top 6.5% on pypi.org
26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
kglab 0.6.6 💰
A simple abstraction layer in Python for building knowledge graphs26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
lakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
lazydf 0.220726.3
Hopefully safe and deterministic serializer to binary format, including Pandas data8 versions - Latest release: over 1 year ago - 2 dependent repositories - 14 downloads last month - 1 stars on GitHub - 2 maintainers
lonboard 0.9.1
Python library for fast, interactive geospatial vector data visualization in Jupyter.28 versions - Latest release: 3 days ago - 1 dependent package - 1 dependent repositories - 2.83 thousand downloads last month - 432 stars on GitHub - 4 maintainers
microdrill 0.0.3
Simple Apache Drill alternative using PySpark3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
overturemapsdownloader 0.1.9
Overture Maps Downloader simplifies geospatial data manipulation6 versions - Latest release: 15 days ago - 108 downloads last month - 87 stars on GitHub - 2 maintainers
parquet2csv 0.8
No judgment, but I looked this up on StackOverflow twice and decided not to have to look it up th...1 version - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
parquet2lance 0.4.1
The Python wrapper for the Rust parquet2lance19 versions - Latest release: 2 months ago - 111 downloads last month - 3,269 stars on GitHub - 2 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-loader 0.0.4
Parquet file Load and Read from minio & S34 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-metadata 0.0.1
A tool to show metadata about a Parquet file1 version - Latest release: over 5 years ago - 1 dependent repositories - 91.8 thousand downloads last month - 11 stars on GitHub - 1 maintainer
parquet-performance 0.0.2
Performance of parquet files using pandas2 versions - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
parquet-to-hyper 1.1.4
Create and publish tableau hyper files from parquet files.6 versions - Latest release: 6 months ago - 20 downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
parquet-tools 0.2.16
Easy install parquet-tools21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
perspective-parquet 0.1.1 💰
Parquet viewer for perspective in JupyterLab2 versions - Latest release: about 1 year ago - 55 downloads last month - 30 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
polario 0.3.1
Polars IO11 versions - Latest release: 9 months ago - 125 downloads last month - 0 stars on GitHub - 2 maintainers
polars-partitions 0.1.2
Simplified work with partitions based on Polars library3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 2 maintainers
procmondf 0.10
provides a convenient and efficient solution for capturing and analyzing system activity logs usi...1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen210 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
Related Keywords
python
44
data
25
data-science
22
deep-learning
20
csv
20
redshift
19
mysql
19
postgresql
19
snowflake
19
hacktoberfest
19
open-source
18
mssql
18
fivetran
18
django
18
dbt
18
datalineage
18
dataengineering
18
data-lineage
18
arrow
17
gpu
17
hybrid-parallelism
16
recommender-system
16
json
11
pandas
11
pyarrow
11
deep learning
10
recommendation system
9
python3
8
serialization
8
sql
8
system
7
hdf5
7
apache-arrow
7
recommendation
7
learning
7
deep
7
data-engineering
7
data-visualization
7
s3
6
rust
6
cloud
5
dataframe
5
database
5
etl
5
columnar
5
polars
4
apache
4
msgpack
4
merge
4
parquet-files
4
graphql
4
kallisto
4
apache-parquet
4
gene-expression
4
gct
4
data-version-control
4
filter
4
arff
4
data-versioning
4
analytics
4
tsv
4
dask
4
machine-learning
4
pyspark
4
pytorch
4
pivot-tables
4
transforming-files
4
transcription
4
pickle
4
excel
4
tabular-data
4
stata
4
salmon
4
what-if-analysis
3
cloud-native
3
datafusion
3
datasets
3
delta-lake
3
duckdb
3
in-memory-database
3
query
3
query-frontends
3
rest-api
3
dataset
3
static-datasets
3
influxdb
3
sysml
3
tensorflow
3
IBM
3
hive
3
parquet-tools
3
olap
3
multidimensional-analysis
3
cube
3
elt
3
logstash
3
elasticsearch-loader
3
charts
3
elasticsearch
3
elastic
3