Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "Spark" keyword
hyperleaup 0.1.2
Create and publish Tableau Hyper files from Apache Spark DataFrames and Spark SQL.3 versions - Latest release: 9 months ago - 21.6 thousand downloads last month - 29 stars on GitHub - 1 maintainer
dtstools 0.0.9
This package aims to provide features for working with Delta Lake.9 versions - Latest release: 11 months ago - 439 downloads last month - 1 stars on GitHub - 1 maintainer
cloudtik 1.6.0
CloudTik: a cloud scale platform for distributed analytics and AI on public clouds17 versions - Latest release: 3 months ago - 1 dependent repositories - 177 downloads last month - 2 stars on GitHub - 1 maintainer
rstudio-spark-install 0.8.0
Utility to setup various versions of Apache Spark on multiple platforms.1 version - Latest release: almost 7 years ago - 1 dependent repositories - 12 downloads last month - 16 stars on GitHub - 1 maintainer
oracle-ml-insights 1.1.0
ML Observability Insights Library2 versions - Latest release: about 1 month ago - 46 downloads last month - 151 stars on GitHub - 4 maintainers
johnsnowlabs-tmp 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...13 versions - Latest release: 12 months ago - 157 downloads last month - 825 stars on GitHub - 1 maintainer
johnsnowlabs-for-databricks 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...140 versions - Latest release: 29 days ago - 2.61 thousand downloads last month - 825 stars on GitHub - 2 maintainers
johnsnowlabs-for-databricks-by-ckl 5.1.8rc16
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...43 versions - Latest release: 7 months ago - 255 downloads last month - 825 stars on GitHub - 1 maintainer
johnsnowlabs-my-mehmet 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...1 version - Latest release: 12 months ago - 18 downloads last month - 825 stars on GitHub - 1 maintainer
johnsnowlabs-by-ckl 5.0.29
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...49 versions - Latest release: 9 months ago - 254 downloads last month - 822 stars on GitHub - 1 maintainer
johnsnowlabs-by-kshitiz 5.0.1
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...31 versions - Latest release: 10 months ago - 87 downloads last month - 825 stars on GitHub - 1 maintainer
hops 3.7.0.0
Client library for interacting with Hopsworks, a full-stack platform for scale-out data science.60 versions - Latest release: 4 months ago - 1 dependent repositories - 246 downloads last month - 27 stars on GitHub - 3 maintainers
Top 5.4% on pypi.org
23 versions - Latest release: 2 months ago - 1 dependent package - 3 dependent repositories - 23.7 thousand downloads last month - 2 maintainers
td-pyspark 24.4.1
Treasure Data extension for pyspark23 versions - Latest release: 2 months ago - 1 dependent package - 3 dependent repositories - 23.7 thousand downloads last month - 2 maintainers
ispark 1.0.4
Spark human utils4 versions - Latest release: over 5 years ago - 1 dependent repositories - 14 downloads last month - 1 maintainer
Top 5.4% on pypi.org
143 versions - Latest release: 29 days ago - 1 dependent package - 3 dependent repositories - 22.3 thousand downloads last month - 45 stars on GitHub - 2 maintainers
johnsnowlabs 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...143 versions - Latest release: 29 days ago - 1 dependent package - 3 dependent repositories - 22.3 thousand downloads last month - 45 stars on GitHub - 2 maintainers
sparksnake 0.2.2
Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR24 versions - Latest release: 11 months ago - 23.6 thousand downloads last month - 12 stars on GitHub - 1 maintainer
td-pyspark-ea 20.12.0
Treasure Data extension for pyspark4 versions - Latest release: over 3 years ago - 1 dependent repositories - 95 downloads last month - 2 maintainers
Top 9.9% on pypi.org
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 45.7 thousand downloads last month - 61 stars on GitHub - 1 maintainer
soda-spark 0.3.3
Soda SQL API for PySpark data frame11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 45.7 thousand downloads last month - 61 stars on GitHub - 1 maintainer
nanbi 0.0.1
A framework that allows the definition of data transformations in a composable way, agnostic of d...1 version - Latest release: about 1 year ago - 11 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
13 versions - Latest release: over 3 years ago - 19 dependent repositories - 359 downloads last month - 415 stars on GitHub - 1 maintainer
bat 0.3.9 π°
Zeek Analysis Tools13 versions - Latest release: over 3 years ago - 19 dependent repositories - 359 downloads last month - 415 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
12 versions - Latest release: 5 months ago - 3 dependent repositories - 410 downloads last month - 415 stars on GitHub - 1 maintainer
zat 0.4.7 π°
Zeek Analysis Tools12 versions - Latest release: 5 months ago - 3 dependent repositories - 410 downloads last month - 415 stars on GitHub - 1 maintainer
pyzeek 0.3.9 π°
Zeek Analysis Tools2 versions - Latest release: over 3 years ago - 1 dependent repositories - 24 downloads last month - 415 stars on GitHub - 1 maintainer
smartframes 1.1.0
Enhanced Python Dataframes for Spark/PySpark3 versions - Latest release: over 8 years ago - 2 dependent repositories - 41 downloads last month - 6 stars on GitHub - 1 maintainer
tarunsingh 0.0.1
Spark datasets match after Datacompy results1 version - Latest release: about 2 years ago - 12 downloads last month - 1 maintainer
sparkdataset 1.0.0
Provides instant access to many popular datasets right from Pyspark (in dataframe structure).1 version - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 34 stars on GitHub - 1 maintainer
onetl 0.10.2
One ETL tool to rule them all16 versions - Latest release: 3 months ago - 564 downloads last month - 58 stars on GitHub - 2 maintainers
maggy 1.1.2
Distribution transparent Machine Learning experiments on Apache Spark21 versions - Latest release: about 2 years ago - 1 dependent repositories - 106 downloads last month - 89 stars on GitHub - 3 maintainers
Top 2.9% on pypi.org
167 versions - Latest release: about 1 month ago - 3 dependent packages - 29 dependent repositories - 4.18 thousand downloads last month - 2,071 stars on GitHub - 1 maintainer
graphistry 0.33.8
A visual graph analytics library for extracting, transforming, displaying, and sharing big graphs...167 versions - Latest release: about 1 month ago - 3 dependent packages - 29 dependent repositories - 4.18 thousand downloads last month - 2,071 stars on GitHub - 1 maintainer
zstreams 0.0.2 π°
Zeek Analysis Tools2 versions - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 1 stars on GitHub - 1 maintainer
pyspark-json-model 0.0.3
JSON to Relational data model through Pyspark using Databricks2 versions - Latest release: over 2 years ago - 1 dependent repositories - 17 downloads last month - 1 maintainer
sparkapi 0.1
A Nice Python API to CISCO spark1 version - Latest release: 10 months ago - 1 dependent repositories - 0 stars on GitHub - 1 maintainer
cars-forge 1.0.2
Create an on-demand/spot fleet of single or cluster EC2 instances.4 versions - Latest release: over 1 year ago - 52 downloads last month - 9 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
163 versions - Latest release: about 1 month ago - 1 dependent package - 15 dependent repositories - 10.8 thousand downloads last month - 51 stars on GitHub - 1 maintainer
hsfs 3.7.6
HSFS: An environment independent client to interact with the Hopsworks Featurestore163 versions - Latest release: about 1 month ago - 1 dependent package - 15 dependent repositories - 10.8 thousand downloads last month - 51 stars on GitHub - 1 maintainer
spark-frame 0.5.0
A library containing various utility functions for playing with PySpark DataFrames11 versions - Latest release: about 1 month ago - 172 downloads last month - 10 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
36 versions - Latest release: about 2 months ago - 18 dependent packages - 44 dependent repositories - 10.6 thousand downloads last month - 7 stars on GitHub - 3 maintainers
hopsworks 3.7.0
HOPSWORKS: An environment independent client to interact with the Hopsworks API36 versions - Latest release: about 2 months ago - 18 dependent packages - 44 dependent repositories - 10.6 thousand downloads last month - 7 stars on GitHub - 3 maintainers
oracle-mlm-insights 1.0.2.dev131 removed
ML Observability Insights Library3 versions - Latest release: 6 months ago - 202 downloads last month - 2 maintainers
gluesnake 0.1.1 removed
Funcionalidades Spark criadas para facilitar a criação de jobs Glue na AWS6 versions - Latest release: over 1 year ago - 557 downloads last month - 0 stars on GitHub - 1 maintainer
vee-ap-generation-pyspark-app 1.0.1 removed
PySpark App on PIE Spark1 version - Latest release: over 1 year ago - 1 maintainer
Top 5.6% on pypi.org
16 versions - Latest release: about 1 year ago - 68 downloads last month - 1 maintainer
johnsnowlabs-for-databricks-tmp 4.4.24 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...16 versions - Latest release: about 1 year ago - 68 downloads last month - 1 maintainer
shui 0.8.1
Spark-Hadoop Unix Installer11 versions - Latest release: 3 months ago - 1 dependent repositories - 106 downloads last month - 0 stars on GitHub - 1 maintainer
flintrock 2.1.0
A command-line tool for launching Apache Spark clusters.14 versions - Latest release: 7 months ago - 1 dependent repositories - 376 downloads last month - 630 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
5 versions - Latest release: over 1 year ago - 60 downloads last month - 1 maintainer
jsl-lib-tmp 4.2.3rc10 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...5 versions - Latest release: over 1 year ago - 60 downloads last month - 1 maintainer
ckls-test-module 4.2.5rc3 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...4 versions - Latest release: over 1 year ago - 168 downloads last month
jsl-lib-tmp-for-databricks 4.2.3rc12 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...3 versions - Latest release: over 1 year ago - 37 downloads last month - 1 maintainer
cloudtiktest 0.0.0 removed
CloudTest is a cloud scaling platform for scaling your distributed analytics and AI cluster easil...1 version - Latest release: almost 2 years ago - 0 stars on
jsl-tmp 4.2.3rc16 removed
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...118 versions - Latest release: over 1 year ago - 505 downloads last month
azure-devops-spark 0.0.0.38 removed
A productive library to extract data from Azure Devops and apply agility metrics.35 versions - Latest release: about 2 years ago
Related Keywords
spark
13
python
12
NLP
12
OCR
12
Labs
12
Snow
12
John
12
Medical
12
Legal
12
Finance
12
pandas
11
Python
8
seq2seq
7
t5
7
PySpark
7
sentiment-classifier
6
sentiment-analysis
6
spell-checker
6
streamlit
6
text-classification
6
text-summarization
6
sentence-embeddings
6
nlu
6
natural-language-understanding
6
bert-embedding
6
dependency-parsing
6
entity-resolution
6
language-detection
6
lemmatizer
6
named-entity-recognition
6
text-translation
6
transformers
6
Distributed
5
Cloud
4
Zeek
4
Bro
4
Networking
4
Kafka
4
aws
4
Parquet
4
data-analysis
4
Security
4
Machine Learning
4
pyspark
4
scikit-learn
3
security
3
zeek
3
zeek-analysis
3
AWS
3
networking
3
kafka
3
bro
3
Scikit-Learn
3
machine-learning
3
Hadoop
3
data
2
TreasureData
2
glue
2
TensorFlow
2
emr
2
DataFrame
2
Databricks
2
Analytic
2
AI
2
ai
2
PIE
2
Jupyter
2
data-science
2
deep-learning
2
DataOps
2
MLOps
2
Apache
2
Feature Store
2
Oracle Cloud Infrastructure
2
OCI
2
Oracle
2
ML Monitoring
2
ML Observability
2
Data Science
2
Drift Detection
2
Dask
2
Hopsworks
2
hacktoberfest
2
ec2
2
graph
1
gpu
1
csv
1
graph-visualization
1
graphistry
1
jupyter
1
neo4j
1
Spot
1
hyperparameter-optimization
1
hyperparameter-search
1
hyperparameter-tuning
1
cugraph
1
cudf
1
dask
1
GPU
1
Graph
1