Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "Spark" keyword
spark-frame 0.5.0
A library containing various utility functions for playing with PySpark DataFrames11 versions - Latest release: 15 days ago - 176 downloads last month - 10 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
36 versions - Latest release: about 1 month ago - 18 dependent packages - 44 dependent repositories - 10.6 thousand downloads last month - 7 stars on GitHub - 3 maintainers
hopsworks 3.7.0
HOPSWORKS: An environment independent client to interact with the Hopsworks API36 versions - Latest release: about 1 month ago - 18 dependent packages - 44 dependent repositories - 10.6 thousand downloads last month - 7 stars on GitHub - 3 maintainers
Top 2.9% on pypi.org
167 versions - Latest release: 15 days ago - 3 dependent packages - 29 dependent repositories - 4.57 thousand downloads last month - 2,058 stars on GitHub - 1 maintainer
graphistry 0.33.8
A visual graph analytics library for extracting, transforming, displaying, and sharing big graphs...167 versions - Latest release: 15 days ago - 3 dependent packages - 29 dependent repositories - 4.57 thousand downloads last month - 2,058 stars on GitHub - 1 maintainer
onetl 0.10.2
One ETL tool to rule them all14 versions - Latest release: about 2 months ago - 797 downloads last month - 58 stars on GitHub - 2 maintainers
oracle-ml-insights 1.1.0
ML Observability Insights Library2 versions - Latest release: 13 days ago - 132 downloads last month - 147 stars on GitHub - 4 maintainers
oracle-mlm-insights 1.0.2.dev131 removed
ML Observability Insights Library3 versions - Latest release: 5 months ago - 202 downloads last month - 2 maintainers
johnsnowlabs-my-mehmet 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...1 version - Latest release: 11 months ago - 22 downloads last month - 814 stars on GitHub - 1 maintainer
johnsnowlabs-by-kshitiz 5.0.1
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...31 versions - Latest release: 9 months ago - 45 downloads last month - 814 stars on GitHub - 1 maintainer
dtstools 0.0.9
This package aims to provide features for working with Delta Lake.9 versions - Latest release: 10 months ago - 773 downloads last month - 1 stars on GitHub - 1 maintainer
nanbi 0.0.1
A framework that allows the definition of data transformations in a composable way, agnostic of d...1 version - Latest release: about 1 year ago - 5 downloads last month - 1 stars on GitHub - 1 maintainer
sparksnake 0.2.2
Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR24 versions - Latest release: 10 months ago - 23.3 thousand downloads last month - 12 stars on GitHub - 1 maintainer
gluesnake 0.1.1 removed
Funcionalidades Spark criadas para facilitar a criação de jobs Glue na AWS6 versions - Latest release: about 1 year ago - 557 downloads last month - 0 stars on GitHub - 1 maintainer
johnsnowlabs-tmp 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...13 versions - Latest release: 11 months ago - 176 downloads last month - 814 stars on GitHub - 1 maintainer
vee-ap-generation-pyspark-app 1.0.1 removed
PySpark App on PIE Spark1 version - Latest release: over 1 year ago - 1 maintainer
Top 5.6% on pypi.org
16 versions - Latest release: 12 months ago - 68 downloads last month - 1 maintainer
johnsnowlabs-for-databricks-tmp 4.4.24 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...16 versions - Latest release: 12 months ago - 68 downloads last month - 1 maintainer
tarunsingh 0.0.1
Spark datasets match after Datacompy results1 version - Latest release: almost 2 years ago - 7 downloads last month - 1 maintainer
zstreams 0.0.2 π°
Zeek Analysis Tools2 versions - Latest release: over 3 years ago - 1 dependent repositories - 18 downloads last month - 1 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
12 versions - Latest release: 4 months ago - 3 dependent repositories - 361 downloads last month - 415 stars on GitHub - 1 maintainer
zat 0.4.7 π°
Zeek Analysis Tools12 versions - Latest release: 4 months ago - 3 dependent repositories - 361 downloads last month - 415 stars on GitHub - 1 maintainer
td-pyspark-ea 20.12.0
Treasure Data extension for pyspark4 versions - Latest release: over 3 years ago - 1 dependent repositories - 43 downloads last month - 2 maintainers
sparkdataset 1.0.0
Provides instant access to many popular datasets right from Pyspark (in dataframe structure).1 version - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 34 stars on GitHub - 1 maintainer
sparkapi 0.1
A Nice Python API to CISCO spark1 version - Latest release: 9 months ago - 1 dependent repositories - 0 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 46.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
soda-spark 0.3.3
Soda SQL API for PySpark data frame11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 46.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
shui 0.8.1
Spark-Hadoop Unix Installer11 versions - Latest release: 2 months ago - 1 dependent repositories - 106 downloads last month - 0 stars on GitHub - 1 maintainer
rstudio-spark-install 0.8.0
Utility to setup various versions of Apache Spark on multiple platforms.1 version - Latest release: almost 7 years ago - 1 dependent repositories - 12 downloads last month - 16 stars on GitHub - 1 maintainer
pyspark-json-model 0.0.3
JSON to Relational data model through Pyspark using Databricks2 versions - Latest release: over 2 years ago - 1 dependent repositories - 20 downloads last month - 1 maintainer
maggy 1.1.2
Distribution transparent Machine Learning experiments on Apache Spark21 versions - Latest release: almost 2 years ago - 1 dependent repositories - 131 downloads last month - 89 stars on GitHub - 3 maintainers
ispark 1.0.4
Spark human utils4 versions - Latest release: over 5 years ago - 1 dependent repositories - 38 downloads last month - 1 maintainer
hops 3.7.0.0
Client library for interacting with Hopsworks, a full-stack platform for scale-out data science.60 versions - Latest release: 3 months ago - 1 dependent repositories - 525 downloads last month - 27 stars on GitHub - 3 maintainers
johnsnowlabs-for-databricks-by-ckl 5.1.8rc16
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...43 versions - Latest release: 6 months ago - 300 downloads last month - 814 stars on GitHub - 1 maintainer
johnsnowlabs-by-ckl 5.0.29
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...49 versions - Latest release: 8 months ago - 300 downloads last month - 814 stars on GitHub - 1 maintainer
cars-forge 1.0.2
Create an on-demand/spot fleet of single or cluster EC2 instances.4 versions - Latest release: over 1 year ago - 39 downloads last month - 9 stars on GitHub - 1 maintainer
johnsnowlabs-for-databricks 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...135 versions - Latest release: 3 days ago - 2.28 thousand downloads last month - 818 stars on GitHub - 2 maintainers
Top 5.4% on pypi.org
141 versions - Latest release: 3 days ago - 1 dependent package - 3 dependent repositories - 28.1 thousand downloads last month - 45 stars on GitHub - 2 maintainers
johnsnowlabs 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...141 versions - Latest release: 3 days ago - 1 dependent package - 3 dependent repositories - 28.1 thousand downloads last month - 45 stars on GitHub - 2 maintainers
flintrock 2.1.0
A command-line tool for launching Apache Spark clusters.14 versions - Latest release: 6 months ago - 1 dependent repositories - 376 downloads last month - 630 stars on GitHub - 1 maintainer
hyperleaup 0.1.2
Create and publish Tableau Hyper files from Apache Spark DataFrames and Spark SQL.3 versions - Latest release: 8 months ago - 29.4 thousand downloads last month - 29 stars on GitHub - 1 maintainer
cloudtik 1.6.0
CloudTik: a cloud scale platform for distributed analytics and AI on public clouds17 versions - Latest release: 2 months ago - 1 dependent repositories - 329 downloads last month - 2 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
23 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 15 thousand downloads last month - 2 maintainers
td-pyspark 24.4.1
Treasure Data extension for pyspark23 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 15 thousand downloads last month - 2 maintainers
Top 5.4% on pypi.org
163 versions - Latest release: 13 days ago - 1 dependent package - 15 dependent repositories - 10.8 thousand downloads last month - 51 stars on GitHub - 1 maintainer
hsfs 3.7.6
HSFS: An environment independent client to interact with the Hopsworks Featurestore163 versions - Latest release: 13 days ago - 1 dependent package - 15 dependent repositories - 10.8 thousand downloads last month - 51 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
13 versions - Latest release: over 3 years ago - 19 dependent repositories - 415 downloads last month - 415 stars on GitHub - 1 maintainer
bat 0.3.9 π°
Zeek Analysis Tools13 versions - Latest release: over 3 years ago - 19 dependent repositories - 415 downloads last month - 415 stars on GitHub - 1 maintainer
pyzeek 0.3.9 π°
Zeek Analysis Tools2 versions - Latest release: over 3 years ago - 1 dependent repositories - 47 downloads last month - 415 stars on GitHub - 1 maintainer
smartframes 1.1.0
Enhanced Python Dataframes for Spark/PySpark3 versions - Latest release: over 8 years ago - 2 dependent repositories - 31 downloads last month - 6 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
5 versions - Latest release: over 1 year ago - 60 downloads last month - 1 maintainer
jsl-lib-tmp 4.2.3rc10 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...5 versions - Latest release: over 1 year ago - 60 downloads last month - 1 maintainer
ckls-test-module 4.2.5rc3 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...4 versions - Latest release: over 1 year ago - 168 downloads last month
jsl-lib-tmp-for-databricks 4.2.3rc12 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...3 versions - Latest release: over 1 year ago - 37 downloads last month - 1 maintainer
cloudtiktest 0.0.0 removed
CloudTest is a cloud scaling platform for scaling your distributed analytics and AI cluster easil...1 version - Latest release: almost 2 years ago - 0 stars on
jsl-tmp 4.2.3rc16 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...118 versions - Latest release: over 1 year ago - 505 downloads last month
azure-devops-spark 0.0.0.38 removed
A productive library to extract data from Azure Devops and apply agility metrics.35 versions - Latest release: almost 2 years ago
Related Keywords
spark
13
NLP
12
OCR
12
Finance
12
Legal
12
Medical
12
John
12
Snow
12
Labs
12
python
12
pandas
11
Python
8
seq2seq
7
t5
7
PySpark
7
bert-embedding
6
dependency-parsing
6
entity-resolution
6
language-detection
6
lemmatizer
6
named-entity-recognition
6
natural-language-understanding
6
nlu
6
sentence-embeddings
6
sentiment-analysis
6
sentiment-classifier
6
spell-checker
6
streamlit
6
text-classification
6
text-summarization
6
text-translation
6
transformers
6
Distributed
5
Security
4
Machine Learning
4
Networking
4
Bro
4
data-analysis
4
Zeek
4
aws
4
pyspark
4
Kafka
4
Cloud
4
Parquet
4
bro
3
AWS
3
kafka
3
networking
3
scikit-learn
3
security
3
zeek
3
zeek-analysis
3
Hadoop
3
machine-learning
3
Scikit-Learn
3
Hopsworks
2
Analytic
2
AI
2
TreasureData
2
hacktoberfest
2
ec2
2
DataFrame
2
TensorFlow
2
PIE
2
glue
2
emr
2
Feature Store
2
MLOps
2
deep-learning
2
Databricks
2
data-science
2
Jupyter
2
ai
2
Dask
2
data
2
Drift Detection
2
Data Science
2
ML Observability
2
ML Monitoring
2
Oracle
2
OCI
2
Oracle Cloud Infrastructure
2
DataOps
2
Apache
2
Keras
1
Training
1
Optimization
1
pydantic
1
PyTorch
1
ablation
1
ablation-studies
1
ablation-study
1
automl
1
blackbox-optimization
1
standard
1
CISCO
1
API
1
Data quality
1
Soda
1
data-engineering
1