Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "parquet" keyword
swemaps 0.1.0
Sweden in GeoParquet for easy usage.1 version - Latest release: 4 days ago - 127 downloads last month - 0 stars on GitHub - 2 maintainers
csv-parakeet 1.1.1
Parquet to CSV command line tool3 versions - Latest release: 7 months ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
elbow 0.1.1
Lift special-purpose data into common tabular formats for analytics 💪5 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 207 downloads last month - 0 stars on GitHub - 2 maintainers
ts-store 0.0.1
Flexible storage for time series.1 version - Latest release: about 1 month ago - 76 downloads last month - 0 stars on GitHub - 2 maintainers
parquet-performance 0.0.2
Performance of parquet files using pandas2 versions - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
parquet-to-hyper 1.1.4
Create and publish tableau hyper files from parquet files.6 versions - Latest release: 6 months ago - 91 downloads last month - 0 stars on GitHub - 2 maintainers
parquet2csv 0.8
No judgment, but I looked this up on StackOverflow twice and decided not to have to look it up th...1 version - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
procmondf 0.10
provides a convenient and efficient solution for capturing and analyzing system activity logs usi...1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
featherplot 0.0.6
featherplot6 versions - Latest release: 9 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
dfcsv2parquet 0.10
converts large CSV files into smaller, Pandas-compatible Parquet files1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
polario 0.3.1
Polars IO11 versions - Latest release: 9 months ago - 125 downloads last month - 0 stars on GitHub - 2 maintainers
polars-partitions 0.1.2
Simplified work with partitions based on Polars library3 versions - Latest release: 3 months ago - 22 downloads last month - 0 stars on GitHub - 2 maintainers
crazybin 1.0.0
Much better than hexbins plots! Use any kind of tile to visualize data and pictures!1 version - Latest release: 8 days ago - 84 downloads last month - 0 stars on GitHub - 2 maintainers
safeserializer 0.230202.1
Hopefully safe and deterministic serializer to binary format, including Pandas data1 version - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
lazydf 0.220726.3
Hopefully safe and deterministic serializer to binary format, including Pandas data8 versions - Latest release: over 1 year ago - 2 dependent repositories - 14 downloads last month - 1 stars on GitHub - 2 maintainers
lakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
unicov 0.0.1
Universal File conversion python library to convert any file to any type of file1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
data-toolset 0.1.7
Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.7 versions - Latest release: 7 months ago - 63 downloads last month - 1 stars on GitHub - 2 maintainers
expressionable-cli 1.0.0
A command-line tool for transforming large data sets3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 4 maintainers
shapeshifter-cli 1.0.0
A command-line tool for transforming large data sets4 versions - Latest release: about 5 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 2 maintainers
shapeshifter 1.1.1
A tool for managing large datasets5 versions - Latest release: about 5 years ago - 1 dependent repositories - 61 downloads last month - 2 stars on GitHub - 2 maintainers
scd2 1.0.0
slowly changing dimension type 2 with pandas or parquet1 version - Latest release: over 1 year ago - 272 downloads last month - 2 stars on GitLab.com - 2 maintainers
expressionable 1.2
A tool for managing large datasets3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
joinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 2 maintainers
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-loader 0.0.4
Parquet file Load and Read from minio & S34 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
easy-s3 1.0.7
This package helps you use S3 easily.9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
aporia-importer 1.0.6
Import data from cloud storage to Aporia1 version - Latest release: almost 3 years ago - 1 dependent repositories - 5 downloads last month - 7 stars on GitHub - 1 maintainer
microdrill 0.0.3
Simple Apache Drill alternative using PySpark3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
typeddfs 0.16.5
Pandas DataFrame subclasses that enforce structure and can self-organize.34 versions - Latest release: about 2 years ago - 1 dependent repositories - 283 downloads last month - 8 stars on GitHub - 2 maintainers
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
bids2table 0.1.0
Efficiently index large-scale BIDS datasets and derivatives3 versions - Latest release: 10 days ago - 1 dependent repositories - 410 downloads last month - 11 stars on GitHub - 2 maintainers
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
ddump 0.2.0
A data dump tool8 versions - Latest release: 10 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
parquet-metadata 0.0.1
A tool to show metadata about a Parquet file1 version - Latest release: over 5 years ago - 1 dependent repositories - 91.8 thousand downloads last month - 11 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
iterabledata 1.0.2
Iterable data processing Python library1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 2 maintainers
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
pynock 1.2.1
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization...5 versions - Latest release: over 1 year ago - 56 downloads last month - 15 stars on GitHub - 1 maintainer
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
24 versions - Latest release: 24 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
influxdb3-python 0.4.0
Community Python client for InfluxDB 3.024 versions - Latest release: 24 days ago - 4 dependent packages - 1 dependent repositories - 10.8 thousand downloads last month - 18 stars on GitHub - 2 maintainers
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen210 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...19 versions - Latest release: 9 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
perspective-parquet 0.1.1 💰
Parquet viewer for perspective in JupyterLab2 versions - Latest release: about 1 year ago - 55 downloads last month - 30 stars on GitHub - 2 maintainers
datasette-parquet 0.6.1
Read Parquet files in Datasette7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
pyinflux3 0.9.2
Community Python client for InfluxDB IOx24 versions - Latest release: 12 months ago - 127 downloads last month - 44 stars on GitHub - 2 maintainers
pyinflux3-cli 0.9.2
Community Python client for InfluxDB IOx (CLI)9 versions - Latest release: 12 months ago - 1 dependent package - 59 downloads last month - 44 stars on GitHub - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
expanse 0.2.4
turn apple health export.xml into parquet4 versions - Latest release: 15 days ago - 626 downloads last month - 67 stars on GitHub - 2 maintainers
graphique 1.6
GraphQL service for arrow tables and parquet data sets.17 versions - Latest release: 11 days ago - 1 dependent repositories - 422 downloads last month - 70 stars on GitHub - 1 maintainer
atlas-db 0.2.11
turn apple health export.xml into parquet4 versions - Latest release: 6 days ago - 646 downloads last month - 70 stars on GitHub - 2 maintainers
arrowdantic 0.2.3
Arrow, pydantic style8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
overturemapsdownloader 0.1.9
Overture Maps Downloader simplifies geospatial data manipulation6 versions - Latest release: 15 days ago - 108 downloads last month - 87 stars on GitHub - 2 maintainers
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
hybridbackend 0.5.2.post1 removed
Efficient training of deep recommenders on cloud.1 version - Latest release: over 2 years ago - 105 stars on GitHub
hybridbackend-cu114-tf115 0.6.1a0 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: almost 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems4 versions - Latest release: about 2 years ago - 105 stars on GitHub
vdf-io 0.1.246 💰
This library uses a universal format for vector datasets to easily export and import data from al...96 versions - Latest release: 6 days ago - 1.63 thousand downloads last month - 118 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
parquet-tools 0.2.16
Easy install parquet-tools21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 60.1 thousand downloads last month - 139 stars on GitHub - 2 maintainers
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-deeprec2208-cu114 0.7.0.dev1672985131
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: over 1 year ago - 8 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-deeprec2212-cu114 0.8.0.dev1679289143
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: about 1 year ago - 8 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
kartothek 5.3.0
A consistent table management library in python41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
regallager 0.0.1
A consistent table management library in python1 version - Latest release: almost 3 years ago - 1 dependent repositories - 7 downloads last month - 161 stars on GitHub - 2 maintainers
atoti-azure 0.8.12
Plugin to load CSV and Parquet files from Azure Blob Storage into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 304 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-aws 0.8.12
Plugin to load CSV and Parquet files from AWS S3 into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 2 dependent repositories - 365 downloads last month - 214 stars on GitHub - 2 maintainers
atoti-gcp 0.8.12
Plugin to load CSV and Parquet files from Google Cloud Storage into Atoti tables31 versions - Latest release: 9 days ago - 1 dependent package - 1 dependent repositories - 281 downloads last month - 214 stars on GitHub - 2 maintainers
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
grai_source_cube 0.0.2
4 versions - Latest release: 2 months ago - 198 downloads last month - 270 stars on GitHub - 2 maintainersgrai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainersthe-guide 0.1.36
1 version - Latest release: 9 months ago - 15 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainersgrai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 173 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainersgrai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainersgrai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainers
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainers
Related Keywords
python
44
data
25
data-science
22
csv
20
deep-learning
20
hacktoberfest
19
postgresql
19
mysql
19
redshift
19
snowflake
19
data-lineage
18
dataengineering
18
datalineage
18
dbt
18
django
18
fivetran
18
mssql
18
open-source
18
gpu
17
arrow
17
recommender-system
16
hybrid-parallelism
16
json
11
pandas
11
pyarrow
11
deep learning
10
recommendation system
9
sql
8
python3
8
serialization
8
data-visualization
7
apache-arrow
7
data-engineering
7
system
7
recommendation
7
learning
7
hdf5
7
deep
7
s3
6
rust
6
dataframe
5
database
5
etl
5
columnar
5
cloud
5
stata
4
salmon
4
tabular-data
4
pickle
4
msgpack
4
merge
4
kallisto
4
gene-expression
4
gct
4
machine-learning
4
transcription
4
transforming-files
4
tsv
4
apache
4
apache-parquet
4
pytorch
4
parquet-files
4
analytics
4
graphql
4
dask
4
polars
4
data-versioning
4
pyspark
4
pivot-tables
4
excel
4
filter
4
data-version-control
4
arff
4
cube
3
tensorflow
3
sysml
3
delta-lake
3
in-memory-database
3
static-datasets
3
dataset
3
query
3
elastic
3
query-frontends
3
elasticsearch
3
rest-api
3
elasticsearch-loader
3
logstash
3
parquet-tools
3
influxdb
3
datafusion
3
multidimensional-analysis
3
olap
3
what-if-analysis
3
duckdb
3
cloud-native
3
IBM
3
charts
3
source
3
table
3
load
3