Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "parquet" keyword
safeserializer 0.230202.1
Hopefully safe and deterministic serializer to binary format, including Pandas data1 version - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
aporia-importer 1.0.6
Import data from cloud storage to Aporia1 version - Latest release: almost 3 years ago - 1 dependent repositories - 26 downloads last month - 7 stars on GitHub - 1 maintainer
pynock 1.2.1
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization...5 versions - Latest release: over 1 year ago - 56 downloads last month - 15 stars on GitHub - 1 maintainer
imctermite 2.0.16
Enables extraction of measurement data from binary files with extension 'raw' used by proprietary...19 versions - Latest release: 10 months ago - 1 dependent repositories - 820 downloads last month - 27 stars on GitHub - 1 maintainer
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
featherplot 0.0.6
featherplot6 versions - Latest release: 10 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
airflow-provider-xlsx 1.0.1
Airflow operators for reading and writing XLSX files9 versions - Latest release: about 2 years ago - 1 dependent repositories - 15.1 thousand downloads last month - 4 stars on GitHub - 1 maintainer
graphique 1.6
GraphQL service for arrow tables and parquet data sets.17 versions - Latest release: 22 days ago - 1 dependent repositories - 422 downloads last month - 70 stars on GitHub - 1 maintainer
hybridbackend-deeprec2208-cu114 0.7.0.dev1672985131
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: over 1 year ago - 8 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu118 0.8.0.dev1678154818
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 1 year ago - 13 downloads last month - 144 stars on GitHub - 1 maintainer
catalystcoop.pudl-catalog 2022.11.30 💰
A catalog of open data related to the US energy system.4 versions - Latest release: over 1 year ago - 51 downloads last month - 9 stars on GitHub - 1 maintainer
ddump 0.2.0
A data dump tool8 versions - Latest release: 22 days ago - 1 dependent repositories - 55 downloads last month - 11 stars on GitHub - 1 maintainer
hybridbackend-tf115-cu114 0.8.0.dev1679539959
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster24 versions - Latest release: about 1 year ago - 80 downloads last month - 144 stars on GitHub - 1 maintainer
microdrill 0.0.3
Simple Apache Drill alternative using PySpark3 versions - Latest release: about 8 years ago - 2 dependent repositories - 13 downloads last month - 7 stars on GitHub - 3 maintainers
hybridbackend-tf115-cu121 1.0.0
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: 10 months ago - 6 downloads last month - 144 stars on GitHub - 1 maintainer
hybridbackend-cu114-tf115 0.6.1a0 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster1 version - Latest release: almost 2 years ago - 105 stars on GitHub
hybridbackend-cu114 0.6.0a2 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster2 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-nightly 0.6.0a0.dev2182810798 removed
A High-Performance Framework for GPU-centric Training of Wide-and-deep Recommender Systems4 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-legacy-nightly 0.5.2.post1.dev2160826157 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend-cpu-nightly 0.5.3.post1.dev2180131484 removed
Efficient training of deep recommenders on cloud.6 versions - Latest release: about 2 years ago - 105 stars on GitHub
hybridbackend 0.5.2.post1 removed
Efficient training of deep recommenders on cloud.1 version - Latest release: over 2 years ago - 105 stars on GitHub
hybridbackend-cpu 0.5.4 removed
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 stars on GitHub
Related Keywords
python
44
data
25
data-science
22
csv
20
deep-learning
20
hacktoberfest
19
mysql
19
postgresql
19
redshift
19
snowflake
19
data-lineage
18
dataengineering
18
datalineage
18
dbt
18
django
18
fivetran
18
mssql
18
open-source
18
gpu
17
arrow
17
recommender-system
16
hybrid-parallelism
16
pandas
12
json
11
pyarrow
11
deep learning
10
recommendation system
9
serialization
8
python3
8
sql
8
system
7
data-engineering
7
recommendation
7
hdf5
7
apache-arrow
7
data-visualization
7
learning
7
deep
7
s3
6
rust
6
cloud
5
etl
5
dataframe
5
database
5
columnar
5
gct
4
pivot-tables
4
polars
4
gene-expression
4
kallisto
4
merge
4
msgpack
4
pickle
4
salmon
4
apache
4
stata
4
tabular-data
4
transcription
4
transforming-files
4
tsv
4
pyspark
4
parquet-files
4
pytorch
4
excel
4
apache-parquet
4
analytics
4
dask
4
filter
4
arff
4
data-version-control
4
data-versioning
4
machine-learning
4
graphql
4
cube
3
rest-api
3
query-frontends
3
query
3
in-memory-database
3
delta-lake
3
datasets
3
datafusion
3
tensorflow
3
cloud-native
3
blob-storage
3
sysml
3
influxdb
3
static-datasets
3
multidimensional-analysis
3
olap
3
logstash
3
elasticsearch-loader
3
hive
3
elasticsearch
3
elastic
3
parquet-tools
3
dataset
3
what-if-analysis
3
aws
3
charts
3
elt
3