Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "parquet" keyword
microdrill 0.0.3
Simple Apache Drill alternative using PySpark3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
csvtoparquetlib 0.1.4
2 versions - Latest release: over 5 years ago - 23 downloads last month - 2 maintainerscsvtoparquet 0.1.5
4 versions - Latest release: over 5 years ago - 60 downloads last month - 2 maintainersparquet-metadata 0.0.1
A tool to show metadata about a Parquet file1 version - Latest release: over 5 years ago - 1 dependent repositories - 91.8 thousand downloads last month - 11 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt 2.9.15
Quilt is a data package manager50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
shapeshifter-cli 1.0.0
A command-line tool for transforming large data sets4 versions - Latest release: about 5 years ago - 1 dependent repositories - 15 downloads last month - 2 stars on GitHub - 2 maintainers
shapeshifter 1.1.1
A tool for managing large datasets5 versions - Latest release: about 5 years ago - 1 dependent repositories - 61 downloads last month - 2 stars on GitHub - 2 maintainers
expressionable-cli 1.0.0
A command-line tool for transforming large data sets3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 4 maintainers
expressionable 1.2
A tool for managing large datasets3 versions - Latest release: about 5 years ago - 2 dependent repositories - 37 downloads last month - 2 stars on GitHub - 4 maintainers
quilt-installer 0.0.0a5
Quilt Data installation tool4 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt-stack-installer 1.0.0
Quilt Data installation tool2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
csv2parquet 0.0.9
A tool to convert CSVs to Parquet files10 versions - Latest release: over 4 years ago - 1 dependent repositories - 781 downloads last month - 61 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
elasticsearch-loader 0.6.0
A pythonic tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch35 versions - Latest release: over 4 years ago - 6 dependent repositories - 207 downloads last month - 395 stars on GitHub - 2 maintainers
esl-redis 0.6.0
elasticsearch_loader plugin for redis12 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 395 stars on GitHub - 1 maintainer
esl-s3 0.6.0
elasticsearch_loader plugin for AWS s310 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 395 stars on GitHub - 1 maintainer
parquet-performance 0.0.2
Performance of parquet files using pandas2 versions - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 2 maintainers
firespark 0.0.32
FireSpark data processing utility library16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...68 versions - Latest release: over 3 years ago - 1 dependent repositories - 427 downloads last month - 216 stars on GitHub - 1 maintainer
csvcli 1.0.2
A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardl...2 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 2 maintainers
easy-s3 1.0.7
This package helps you use S3 easily.9 versions - Latest release: over 3 years ago - 1 dependent repositories - 59 downloads last month - 3 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 14.9 thousand downloads last month - 217 stars on GitHub - 1 maintainer
regallager 0.0.1
A consistent table management library in python1 version - Latest release: almost 3 years ago - 1 dependent repositories - 7 downloads last month - 161 stars on GitHub - 2 maintainers
aporia-importer 1.0.6
Import data from cloud storage to Aporia1 version - Latest release: almost 3 years ago - 1 dependent repositories - 5 downloads last month - 7 stars on GitHub - 1 maintainer
parquet-csv 0.2.0
Parquet from and to CSV format converter5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
kartothek 5.3.0
A consistent table management library in python41 versions - Latest release: over 2 years ago - 1 dependent repositories - 440 downloads last month - 161 stars on GitHub - 6 maintainers
hybridbackend 0.5.2.post1 removed
Efficient training of deep recommenders on cloud.1 version - Latest release: over 2 years ago - 105 stars on GitHub
Top 8.2% on pypi.org
35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
pystore 0.1.23 💰
Fast data store for Pandas timeseries data35 versions - Latest release: about 2 years ago - 3 dependent repositories - 842 downloads last month - 536 stars on GitHub - 2 maintainers
dbd 0.8.9
dbd is a data loading and transformation tool that enables data analysts and engineers to load an...31 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 51 stars on GitHub - 2 maintainers
typeddfs 0.16.5
Pandas DataFrame subclasses that enforce structure and can self-organize.34 versions - Latest release: about 2 years ago - 1 dependent repositories - 283 downloads last month - 8 stars on GitHub - 2 maintainers
parquet-loader 0.0.4
Parquet file Load and Read from minio & S34 versions - Latest release: about 2 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 2 maintainers
parquet-dataset 0.0.1.dev4
Dataset for parquet group3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 2 maintainers
roapi-http 0.6.0
Create full-fledged APIs for slowly moving datasets without writing a single line of code.17 versions - Latest release: about 2 years ago - 1 dependent repositories - 427 downloads last month - 3,089 stars on GitHub - 1 maintainer
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems4 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 2 years ago - 105 stars on GitHub
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
hybridbackend-cpu-legacy 0.5.4
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
hybridbackend-cu114-tf115 0.6.1a0 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: almost 2 years ago - 105 stars on GitHub
pynock 1.2.1
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization...5 versions - Latest release: over 1 year ago - 5 downloads last month - 15 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu100 0.7.0.dev1666332077
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster4 versions - Latest release: over 1 year ago - 3 downloads last month - 144 stars on GitHub - 2 maintainers
unicov 0.0.1
Universal File conversion python library to convert any file to any type of file1 version - Latest release: over 1 year ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
kglab 0.6.6 💰
A simple abstraction layer in Python for building knowledge graphs26 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 441 downloads last month - 561 stars on GitHub - 1 maintainer
arrowdantic 0.2.3
Arrow, pydantic style8 versions - Latest release: over 1 year ago - 1 dependent repositories - 393 downloads last month - 74 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
iterabledata 1.0.2
Iterable data processing Python library1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu116 0.7.0.dev1672506489
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster5 versions - Latest release: over 1 year ago - 17 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-deeprec2208-cu114 0.7.0.dev1672985131
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: over 1 year ago - 8 downloads last month - 144 stars on GitHub - 1 maintainer
lazydf 0.220726.3
Hopefully safe and deterministic serializer to binary format, including Pandas data8 versions - Latest release: over 1 year ago - 2 dependent repositories - 14 downloads last month - 1 stars on GitHub - 2 maintainers
scd2 1.0.0
slowly changing dimension type 2 with pandas or parquet1 version - Latest release: over 1 year ago - 272 downloads last month - 2 stars on GitLab.com - 2 maintainers
safeserializer 0.230202.1
Hopefully safe and deterministic serializer to binary format, including Pandas data1 version - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
parquet2csv 0.8
No judgment, but I looked this up on StackOverflow twice and decided not to have to look it up th...1 version - Latest release: about 1 year ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen210 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-deeprec2212-cu114 0.8.0.dev1679289143
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: about 1 year ago - 8 downloads last month - 144 stars on GitHub - 2 maintainers
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
perspective-parquet 0.1.1 💰
Parquet viewer for perspective in JupyterLab2 versions - Latest release: about 1 year ago - 55 downloads last month - 30 stars on GitHub - 2 maintainers
pyinflux3-cli 0.9.2
Community Python client for InfluxDB IOx (CLI)9 versions - Latest release: 12 months ago - 1 dependent package - 59 downloads last month - 44 stars on GitHub - 2 maintainers
pyinflux3 0.9.2
Community Python client for InfluxDB IOx24 versions - Latest release: 12 months ago - 127 downloads last month - 44 stars on GitHub - 2 maintainers
aws-parquet 0.5.0
An object-oriented interface for defining parquet datasets for AWS built on top of awswrangler an...5 versions - Latest release: 11 months ago - 28 downloads last month - 3 stars on GitHub - 2 maintainers
procmondf 0.10
provides a convenient and efficient solution for capturing and analyzing system activity logs usi...1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
dfcsv2parquet 0.10
converts large CSV files into smaller, Pandas-compatible Parquet files1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
hybridbackend-tf115-cpu 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster10 versions - Latest release: 10 months ago - 1 dependent repositories - 42 downloads last month - 144 stars on GitHub - 3 maintainers
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
datasette-parquet 0.6.1
Read Parquet files in Datasette7 versions - Latest release: 10 months ago - 230 downloads last month - 38 stars on GitHub - 2 maintainers
featherplot 0.0.6
featherplot6 versions - Latest release: 9 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...19 versions - Latest release: 9 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
the-guide 0.1.36
1 version - Latest release: 9 months ago - 15 downloads last month - 270 stars on GitHub - 4 maintainerspolario 0.3.1
Polars IO11 versions - Latest release: 9 months ago - 125 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 319 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 409 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_looker 0.0.3
11 versions - Latest release: 8 months ago - 244 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_redshift 0.1.1
18 versions - Latest release: 8 months ago - 289 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mysql 0.1.1
19 versions - Latest release: 8 months ago - 290 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_bigquery 0.2.4
29 versions - Latest release: 8 months ago - 395 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_fivetran 0.1.2
18 versions - Latest release: 8 months ago - 283 downloads last month - 270 stars on GitHub - 4 maintainersgrai-cli 0.2.6
19 versions - Latest release: 7 months ago - 166 downloads last month - 270 stars on GitHub - 4 maintainersgrai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 173 downloads last month - 270 stars on GitHub - 4 maintainerscsv-parakeet 1.1.1
Parquet to CSV command line tool3 versions - Latest release: 7 months ago - 19 downloads last month - 0 stars on GitHub - 2 maintainers
data-toolset 0.1.7
Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.7 versions - Latest release: 7 months ago - 63 downloads last month - 1 stars on GitHub - 2 maintainers
parquet-to-hyper 1.1.4
Create and publish tableau hyper files from parquet files.6 versions - Latest release: 6 months ago - 20 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 434 downloads last month - 270 stars on GitHub - 4 maintainersgrai_schemas 0.2.11
61 versions - Latest release: 6 months ago - 1.29 thousand downloads last month - 270 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 6 months ago - 18 dependent packages - 3 dependent repositories - 775 downloads last month - 270 stars on GitHub - 4 maintainerslakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
61 versions - Latest release: 5 months ago - 27 dependent packages - 44 dependent repositories - 47.8 thousand downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt3 5.4.0
Quilt: where data comes together61 versions - Latest release: 5 months ago - 27 dependent packages - 44 dependent repositories - 47.8 thousand downloads last month - 1,311 stars on GitHub - 1 maintainer
grai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 280 downloads last month - 270 stars on GitHub - 4 maintainerscryo-python 0.3.0
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.5 versions - Latest release: 4 months ago - 145 downloads last month - 975 stars on GitHub - 1 maintainer
cryo 0.3.2
cryo is the easiest way to extract blockchain data to parquet, csv, json, or a python dataframe.3 versions - Latest release: 4 months ago - 2 dependent repositories - 919 downloads last month - 975 stars on GitHub - 1 maintainer
elbow 0.1.1
Lift special-purpose data into common tabular formats for analytics 💪5 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 207 downloads last month - 0 stars on GitHub - 2 maintainers
grai_source_dbt 0.3.5
42 versions - Latest release: 3 months ago - 514 downloads last month - 270 stars on GitHub - 2 maintainersgrai_source_dbt_cloud 0.1.5
18 versions - Latest release: 3 months ago - 300 downloads last month - 270 stars on GitHub - 4 maintainersgrai_source_mssql 0.1.3
21 versions - Latest release: 3 months ago - 288 downloads last month - 270 stars on GitHub - 4 maintainersjoinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 2 maintainers
Related Keywords
python
44
data
25
data-science
22
csv
20
deep-learning
20
hacktoberfest
19
postgresql
19
mysql
19
snowflake
19
redshift
19
open-source
18
mssql
18
fivetran
18
django
18
data-lineage
18
dbt
18
dataengineering
18
datalineage
18
arrow
17
gpu
17
recommender-system
16
hybrid-parallelism
16
pyarrow
11
pandas
11
json
11
deep learning
10
recommendation system
9
sql
8
python3
8
serialization
8
apache-arrow
7
system
7
data-visualization
7
learning
7
deep
7
data-engineering
7
recommendation
7
hdf5
7
s3
6
rust
6
dataframe
5
columnar
5
database
5
cloud
5
etl
5
arff
4
parquet-files
4
pytorch
4
polars
4
analytics
4
graphql
4
dask
4
data-version-control
4
filter
4
apache-parquet
4
excel
4
pivot-tables
4
data-versioning
4
gct
4
pyspark
4
apache
4
gene-expression
4
kallisto
4
merge
4
msgpack
4
pickle
4
salmon
4
stata
4
tabular-data
4
transcription
4
transforming-files
4
tsv
4
machine-learning
4
datafusion
3
datasets
3
delta-lake
3
in-memory-database
3
elt
3
query
3
sysml
3
cloud-native
3
blob-storage
3
atoti
3
load
3
table
3
source
3
charts
3
cube
3
hive
3
multidimensional-analysis
3
olap
3
what-if-analysis
3
dataset
3
logstash
3
elasticsearch-loader
3
elasticsearch
3
elastic
3
tensorflow
3
parquet-tools
3
IBM
3