proxy.golang.org "pyspark" keyword
View the packages on the proxy.golang.org package registry that are tagged with the "pyspark" keyword.
Top 6.7% on proxy.golang.org
9 versions - Latest release: about 2 months ago - 1,086 stars on GitHub
github.com/graphframes/graphframes v0.9.3
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs9 versions - Latest release: about 2 months ago - 1,086 stars on GitHub
Top 5.9% on proxy.golang.org
11 versions - Latest release: 7 months ago - 673 stars on GitHub
github.com/mrpowers/chispa v0.11.1
PySpark test helper methods with beautiful error messages11 versions - Latest release: 7 months ago - 673 stars on GitHub
Top 6.7% on proxy.golang.org
67 versions - Latest release: about 1 month ago - 5,170 stars on GitHub
github.com/microsoft/SynapseML v1.0.15
Simple and Distributed Machine Learning67 versions - Latest release: about 1 month ago - 5,170 stars on GitHub
Top 6.7% on proxy.golang.org
67 versions - Latest release: about 1 month ago - 5,170 stars on GitHub
github.com/microsoft/synapseml v1.0.15
Simple and Distributed Machine Learning67 versions - Latest release: about 1 month ago - 5,170 stars on GitHub
Top 5.9% on proxy.golang.org
11 versions - Latest release: 7 months ago - 673 stars on GitHub
github.com/MrPowers/chispa v0.11.1
PySpark test helper methods with beautiful error messages11 versions - Latest release: 7 months ago - 673 stars on GitHub
Top 6.7% on proxy.golang.org
48 versions - Latest release: about 1 month ago - 605 stars on GitHub
github.com/capitalone/datacompy v0.18.1
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!48 versions - Latest release: about 1 month ago - 605 stars on GitHub
Top 5.7% on proxy.golang.org
2 versions - Latest release: 6 months ago - 0 stars on GitHub
github.com/nessi-dev/nessi v1.0.0
A Python-based data processing and analysis tool built with PySpark and Delta Lake2 versions - Latest release: 6 months ago - 0 stars on GitHub
Top 8.2% on proxy.golang.org
2 versions - Latest release: over 7 years ago - 401 stars on GitHub
github.com/CamDavidsonPilon/tdigest v0.5.2 💰
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed en...2 versions - Latest release: over 7 years ago - 401 stars on GitHub
Top 8.2% on proxy.golang.org
2 versions - Latest release: over 7 years ago - 401 stars on GitHub
github.com/camdavidsonpilon/tdigest v0.5.2 💰
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed en...2 versions - Latest release: over 7 years ago - 401 stars on GitHub
Top 6.7% on proxy.golang.org
22 versions - Latest release: about 7 years ago - 6,169 stars on GitHub
github.com/ibis-project/ibis v0.14.0
the portable Python dataframe library22 versions - Latest release: about 7 years ago - 6,169 stars on GitHub
Top 9.6% on proxy.golang.org
31 versions - Latest release: over 2 years ago - 1,519 stars on GitHub
github.com/hi-primus/optimus v23.5.0-beta+incompatible
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and P...31 versions - Latest release: over 2 years ago - 1,519 stars on GitHub
Top 5.6% on proxy.golang.org
46 versions - Latest release: almost 3 years ago - 1,860 stars on GitHub
github.com/uber/petastorm v0.12.1
Petastorm library enables single machine or distributed training and evaluation of deep learning ...46 versions - Latest release: almost 3 years ago - 1,860 stars on GitHub
Top 5.9% on proxy.golang.org
5 versions - Latest release: 11 months ago - 675 stars on GitHub
github.com/mrpowers-io/quinn v0.10.3
pyspark methods to enhance developer productivity 📣 👯 🎉5 versions - Latest release: 11 months ago - 675 stars on GitHub
Top 6.7% on proxy.golang.org
31 versions - Latest release: over 2 years ago - 1,447 stars on GitHub
github.com/ironmussa/optimus v23.5.0-beta+incompatible
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and P...31 versions - Latest release: over 2 years ago - 1,447 stars on GitHub
Top 6.7% on proxy.golang.org
31 versions - Latest release: over 2 years ago - 1,447 stars on GitHub
github.com/ironmussa/Optimus v23.5.0-beta+incompatible
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and P...31 versions - Latest release: over 2 years ago - 1,447 stars on GitHub
Top 5.6% on proxy.golang.org
25 versions - Latest release: 3 months ago - 229 stars on GitHub
github.com/g-research/spark-extension v2.14.2+incompatible
A library that provides useful extensions to Apache Spark and PySpark.25 versions - Latest release: 3 months ago - 229 stars on GitHub
Top 5.6% on proxy.golang.org
25 versions - Latest release: 3 months ago - 229 stars on GitHub
github.com/G-Research/spark-extension v2.14.2+incompatible
A library that provides useful extensions to Apache Spark and PySpark.25 versions - Latest release: 3 months ago - 229 stars on GitHub
Top 8.2% on proxy.golang.org
24 versions - Latest release: over 5 years ago - 54 stars on GitHub
github.com/tubular/sparkly v2.8.2+incompatible
Helpers & syntactic sugar for PySpark.24 versions - Latest release: over 5 years ago - 54 stars on GitHub
Top 9.6% on proxy.golang.org
30 versions - Latest release: over 1 year ago - 1,042 stars on GitHub
github.com/logicalclocks/hopsworks v3.7.0+incompatible
Hopsworks - Data-Intensive AI platform with a Feature Store30 versions - Latest release: over 1 year ago - 1,042 stars on GitHub
Related Keywords
spark
10
python
9
machine-learning
7
data-science
7
data-analysis
4
dask
4
scala
4
apache-spark
4
big-data
4
big-data-cleaning
3
ml
3
bigdata
3
data-wrangling
3
cudf
3
dask-cudf
3
deep-learning
3
data-preparation
3
data-extraction
3
data-profiling
3
data-exploration
3
data-cleansing
3
azure
3
data-cleaning
3
data-transformation
3
data-cleaner
3
pyarrow
2
quantile
2
gr-oss
2
percentile
2
mapreduce
2
java
2
estimate
2
distributed-computing
2
snowflake
2
dataframes
2
testing
2
ai
2
cognitive-services
2
databricks
2
http
2
lightgbm
2
microsoft
2
model-deployment
2
onnx
2
opencv
2
synapse
2
pandas
2
polars
2
bigquery
1
network-motif
1
graphs
1
parquet
1
parquet-files
1
pytorch
1
sysml
1
tensorflow
1
dataframe
1
connected-components
1
aws
1
feature-engineering
1
feature-management
1
feature-store
1
gcp
1
governance
1
hopsworks
1
kserve
1
mlops
1
model-serving
1
serverless
1
clickhouse
1
database
1
datafusion
1
duckdb
1
impala
1
mssql
1
mysql
1
postgresql
1
compare
1
sql
1
sqlite
1
trino
1
data
1
fugue
1
delta-lake
1
data-processing
1
data-engineering
1
numpy
1
snowpark
1
networks
1
network-motifs
1