Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-science" keyword

Top 1.5% on pypi.org
fastbook 0.0.29
Deep Learning for Coders, 2020
27 versions - Latest release: over 1 year ago - 5 dependent packages - 55 dependent repositories - 16.6 thousand downloads last month - 20,784 stars on GitHub - 1 maintainer
feature-engineering-polars 0.4.0
Feature engineering done with Polars
9 versions - Latest release: 5 months ago - 159 downloads last month - 5 stars on GitHub - 2 maintainers
raimitigations 1.1.1
Python library for implementing and exploring mitigations for Responsible AI.
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 144 downloads last month - 52 stars on GitHub - 4 maintainers
tpot-sh 0.9.5
Tree-based Pipeline Optimization Tool - Successive Halving
1 version - Latest release: over 4 years ago - 1 dependent repositories - 33 downloads last month - 9,508 stars on GitHub - 2 maintainers
Top 1.2% on pypi.org
tpot 0.12.2
Tree-based Pipeline Optimization Tool
62 versions - Latest release: 2 months ago - 4 dependent packages - 227 dependent repositories - 43.8 thousand downloads last month - 9,508 stars on GitHub - 3 maintainers
simple-encoders 0.1.6
Simple encoders to pre-process categoric variables for machine learning systems
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 59 downloads last month - 0 stars on GitHub - 2 maintainers
geochemistrypi 0.5.0
A highly automated machine learning Python framework for data-driven geochemistry discovery
7 versions - Latest release: 4 months ago - 91 downloads last month - 64 stars on GitHub - 2 maintainers
skbase 0.4.6 πŸ’°
Base classes for sklearn-like parametric objects
10 versions - Latest release: 11 months ago - 242 downloads last month - 14 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
scikit-base 0.7.7 πŸ’°
Base classes for sklearn-like parametric objects
22 versions - Latest release: 21 days ago - 4 dependent packages - 80 dependent repositories - 692 thousand downloads last month - 14 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
openstef 3.4.25
Open short term energy forecaster
134 versions - Latest release: 8 days ago - 1 dependent package - 3 dependent repositories - 3.39 thousand downloads last month - 74 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
refuel-autolabel 0.0.16
Label, clean and enrich text datasets with LLMs
16 versions - Latest release: 7 months ago - 1 dependent repositories - 334 downloads last month - 1,788 stars on GitHub - 5 maintainers
Top 3.9% on pypi.org
mlem 0.4.14
Version and deploy your models following GitOps principles
40 versions - Latest release: 10 months ago - 1 dependent package - 23 dependent repositories - 5.31 thousand downloads last month - 712 stars on GitHub - 10 maintainers
Top 3.2% on pypi.org
meteostat 1.6.7 πŸ’°
Access and analyze historical weather and climate data with Python.
47 versions - Latest release: 7 months ago - 7 dependent packages - 41 dependent repositories - 156 thousand downloads last month - 352 stars on GitHub - 2 maintainers
Top 2.0% on pypi.org
leafmap 0.31.9 πŸ’°
A Python package for geospatial analysis and interactive mapping in a Jupyter environment.
134 versions - Latest release: 22 days ago - 11 dependent packages - 196 dependent repositories - 29.8 thousand downloads last month - 2,904 stars on GitHub - 1 maintainer
openghg 1.0.0
OpenGHG - a cloud platform for greenhouse gas data analysis
14 versions - Latest release: about 2 years ago - 1 dependent repositories - 202 downloads last month - 21 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 346 downloads last month - 1,441 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
gamma-facet 2.1.1
Human-explainable AI.
26 versions - Latest release: 6 months ago - 1 dependent repositories - 377 downloads last month - 482 stars on GitHub - 2 maintainers
Top 3.1% on pypi.org
modal 0.62.142
Python client library for Modal
938 versions - Latest release: 2 days ago - 3 dependent packages - 9 dependent repositories - 255 thousand downloads last month - 227 stars on GitHub - 8 maintainers
Top 1.8% on pypi.org
pymupdfb 1.24.1
MuPDF shared libraries for PyMuPDF.
13 versions - Latest release: about 1 month ago - 2 dependent packages - 133 dependent repositories - 1.92 million downloads last month - 4,025 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
pymupdf 1.24.2
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
113 versions - Latest release: 21 days ago - 134 dependent packages - 1,798 dependent repositories - 2.78 million downloads last month - 4,025 stars on GitHub - 1 maintainer
muler 0.4.0
A Python package for working with data from various echelle spectrographs
12 versions - Latest release: 10 months ago - 2 dependent repositories - 38 downloads last month - 13 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
bacalhau-apiclient 1.2.2
A Python client for the Bacalhau public API - https://github.com/bacalhau-project/bacalhau/tree/m...
33 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 1.11 thousand downloads last month - 610 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
bacalhau-sdk 1.2.1
Compute over Data framework for public, transparent, and optionally verifiable computation using ...
22 versions - Latest release: 3 months ago - 3 dependent packages - 1 dependent repositories - 955 downloads last month - 610 stars on GitHub - 1 maintainer
cubist 0.1.3
A Python package for fitting Quinlan's Cubist regression model.
16 versions - Latest release: about 2 months ago - 1 dependent repositories - 3.82 thousand downloads last month - 35 stars on GitHub - 2 maintainers
cxapit 1.0.17
A Python client for the Bacalha public API
17 versions - Latest release: 27 days ago - 949 downloads last month - 606 stars on GitHub - 2 maintainers
genalog 0.1.0
Tools for generating analog document (images) from raw text
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 48 downloads last month - 293 stars on GitHub - 1 maintainer
prequel 0.1.8
An interpreted relational query language that compiles to SQL
1 version - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 599 stars on GitHub - 2 maintainers
preql-lang 0.2.1
An interpreted relational query language that compiles to SQL
11 versions - Latest release: about 3 years ago - 2 dependent repositories - 117 downloads last month - 599 stars on GitHub - 1 maintainer
prql 0.1.12
An interpreted relational query language that compiles to SQL
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 332 downloads last month - 599 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
preql 0.2.19
An interpreted relational query language that compiles to SQL
19 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 226 downloads last month - 599 stars on GitHub - 1 maintainer
py-data-juicer 0.2.0
A One-Stop Data Processing System for Large Language Models.
5 versions - Latest release: 2 months ago - 96 downloads last month - 1,321 stars on GitHub - 2 maintainers
Top 4.3% on pypi.org
graspologic 3.3.0
A set of python modules for graph statistics
219 versions - Latest release: 7 months ago - 1 dependent package - 8 dependent repositories - 3.44 thousand downloads last month - 327 stars on GitHub - 2 maintainers
cancer_data 0.3.6
Preprocessing for various cancer genomics datasets
13 versions - Latest release: about 1 month ago - 211 downloads last month - 14 stars on GitHub - 2 maintainers
Top 4.0% on pypi.org
poutyne 1.17.1 πŸ’°
A simplified framework and utilities for PyTorch.
30 versions - Latest release: 10 months ago - 2 dependent packages - 9 dependent repositories - 5.04 thousand downloads last month - 557 stars on GitHub - 2 maintainers
logdissect 3.1.1
Robust CLI syslog forensics tool
18 versions - Latest release: about 6 years ago - 1 dependent repositories - 270 downloads last month - 137 stars on GitHub - 2 maintainers
buzzard 0.6.5
GIS files manipulations
16 versions - Latest release: over 3 years ago - 2 dependent repositories - 84 downloads last month - 37 stars on GitHub - 8 maintainers
whyqd 1.1.3
data wrangling simplicity, complete audit transparency, and at speed
25 versions - Latest release: 2 months ago - 1 dependent repositories - 238 downloads last month - 32 stars on GitHub - 2 maintainers
sliceguard 0.0.35
A library for detecting critical data slices in structured and unstructured data based on feature...
33 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 210 downloads last month - 51 stars on GitHub - 2 maintainers
datafog 3.0.1
Scan, redact, and manage PII in your documents before they get uploaded to a Retrieval Augmented ...
55 versions - Latest release: 1 day ago - 572 downloads last month - 5 stars on GitHub - 2 maintainers
safe-ds-runner 0.14.1
Execute Safe-DS programs that were compiled to Python.
17 versions - Latest release: 6 days ago - 698 downloads last month - 2 stars on GitHub - 2 maintainers
dataanalysistoolkit 1.2.1
The DataAnalysisToolkit project is a Python-based data analysis tool designed to streamline vario...
6 versions - Latest release: 1 day ago - 185 downloads last month - 2 stars on GitHub - 1 maintainer
aymara 0.4.1
Python bindings to the LIMA linguistic analyzer
22 versions - Latest release: almost 2 years ago - 1 dependent repositories - 306 downloads last month - 102 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
paretoset 1.2.3
Compute the Pareto (non-dominated) set, i.e., skyline operator/query.
8 versions - Latest release: 11 months ago - 3 dependent packages - 9 dependent repositories - 9.42 thousand downloads last month - 44 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
gspread-pandas 3.3.0
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...
69 versions - Latest release: 3 months ago - 6 dependent packages - 70 dependent repositories - 183 thousand downloads last month - 380 stars on GitHub - 1 maintainer
zero-true 0.1.4
A collaborative notebook built for data scientists
75 versions - Latest release: 7 days ago - 1.1 thousand downloads last month - 33 stars on GitHub - 2 maintainers
covsirphy 3.1.1 πŸ’°
COVID-19 data analysis with phase-dependent SIR-derived ODE models
59 versions - Latest release: 3 months ago - 1 dependent repositories - 366 downloads last month - 101 stars on GitHub - 2 maintainers
Top 0.7% on pypi.org
wandb 0.17.0
A CLI and library for interacting with the Weights & Biases API.
266 versions - Latest release: about 11 hours ago - 401 dependent packages - 9,299 dependent repositories - 12.8 million downloads last month - 8,194 stars on GitHub - 8 maintainers
pyoats 0.1.4
Quick and Easy Time Series Outlier Detection
5 versions - Latest release: 1 day ago - 170 downloads last month - 98 stars on GitHub - 2 maintainers
tsforecasting 1.2.92
TSForecasting is an Automated Time Series Forecasting Framework
22 versions - Latest release: about 11 hours ago - 333 downloads last month - 24 stars on GitHub - 2 maintainers
mlimputer 1.0.66
MLimputer - Missing Data Imputation Framework for Supervised Machine Learning
16 versions - Latest release: about 11 hours ago - 155 downloads last month - 5 stars on GitHub - 1 maintainer
leila 0.2
LibrerΓ­a para medir la calidad de los datos en conjuntos de datos estructurados
2 versions - Latest release: over 2 years ago - 2 dependent repositories - 267 downloads last month - 59 stars on GitHub - 2 maintainers
Top 6.5% on pypi.org
nfstream 6.5.3 πŸ’°
A Flexible Network Data Analysis Framework
75 versions - Latest release: over 1 year ago - 6 dependent repositories - 5.96 thousand downloads last month - 1,008 stars on GitHub - 1 maintainer
aim-with-auth-support 3.14.4
A super-easy way to record, search and compare AI experiments.
17 versions - Latest release: over 1 year ago - 98 downloads last month - 4,813 stars on GitHub - 1 maintainer
spectrafit 1.0.0
Fast fitting of 2D- and 3D-Spectra with established routines
97 versions - Latest release: 7 months ago - 1 dependent repositories - 650 downloads last month - 19 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
astro-sdk-python 1.8.0
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python...
50 versions - Latest release: 3 months ago - 2 dependent packages - 7 dependent repositories - 102 thousand downloads last month - 320 stars on GitHub - 4 maintainers
uto 0.0.1a1
Universal Transfer Operator for Apache Airflow to transfer between files, tables, dataframes, apis.
1 version - Latest release: over 1 year ago - 20 downloads last month - 320 stars on GitHub - 1 maintainer
cleanlab-studio 2.0.4
Client interface for all things Cleanlab Studio
79 versions - Latest release: 1 day ago - 1 dependent repositories - 3.22 thousand downloads last month - 21 stars on GitHub - 5 maintainers
apache-airflow-provider-transfers 0.1.0
This project contains the Universal Transfer Operator which can transfer all the data that could ...
1 version - Latest release: about 1 year ago - 71 downloads last month - 320 stars on GitHub - 2 maintainers
universal-transfer-operator 0.0.1a1
Universal Transfer Operator for Apache Airflow to transfer between files, tables, dataframes, apis.
1 version - Latest release: over 1 year ago - 14 downloads last month - 319 stars on GitHub - 1 maintainer
mitosheet-private 0.3.0
The mitosheet_private package is a wrapper around the mitosheet package.
18 versions - Latest release: 9 months ago - 1 dependent repositories - 132 downloads last month - 2,079 stars on GitHub - 2 maintainers
bnpm 0.5.3
A library of useful modules for data analysis.
18 versions - Latest release: 13 days ago - 448 downloads last month - 2 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
woodwork 0.30.0
a data typing library for machine learning
60 versions - Latest release: 28 days ago - 10 dependent packages - 33 dependent repositories - 59 thousand downloads last month - 139 stars on GitHub - 11 maintainers
Top 2.4% on pypi.org
ploomber 0.23.2
Write maintainable, production-ready pipelines using Jupyter or your favorite text editor. Develo...
115 versions - Latest release: 3 months ago - 5 dependent packages - 28 dependent repositories - 9.07 thousand downloads last month - 3,384 stars on GitHub - 2 maintainers
obscure_stats 0.2.3
Collection of lesser-known statistical functions
14 versions - Latest release: about 2 months ago - 247 downloads last month - 34 stars on GitHub - 1 maintainer
marimo 0.5.0
A library for making reactive notebooks and apps
129 versions - Latest release: about 13 hours ago - 1 dependent repositories - 14.4 thousand downloads last month - 3,902 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
ydata-profiling 4.8.3
Generate profile report for pandas DataFrame
19 versions - Latest release: about 13 hours ago - 20 dependent packages - 79 dependent repositories - 1.3 million downloads last month - 11,645 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
lancedb 0.6.12
lancedb
58 versions - Latest release: about 13 hours ago - 42 dependent packages - 453 dependent repositories - 215 thousand downloads last month - 2,702 stars on GitHub - 5 maintainers
mlrun-pipelines-kfp-common-experiment 0.1.3
MLRun Pipelines package for providing KFP 1.8 compatibility
4 versions - Latest release: 25 days ago - 642 downloads last month - 1,307 stars on GitHub - 2 maintainers
mlrun-pipelines-kfp-v1-8-experiment 0.1.3
MLRun Pipelines package for providing KFP 1.8 compatibility
3 versions - Latest release: 25 days ago - 570 downloads last month - 1,307 stars on GitHub - 4 maintainers
mlrun-pipelines-kfp-v2-experiment 0.1.2
MLRun Pipelines package for providing KFP 2.* compatibility
3 versions - Latest release: 18 days ago - 305 downloads last month - 1,306 stars on GitHub - 2 maintainers
sqldatamodel 0.4.3
SQLDataModel is a lightweight dataframe library designed for efficient data extraction, transform...
43 versions - Latest release: about 13 hours ago - 1.06 thousand downloads last month - 6 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
edward2 0.0.2
Edward2
2 versions - Latest release: about 3 years ago - 1 dependent package - 74 dependent repositories - 161 downloads last month - 666 stars on GitHub - 2 maintainers
impl 0.1.2
LIME-based library for interpretation of local models
3 versions - Latest release: over 5 years ago - 50 downloads last month - 0 stars on GitHub - 2 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...
7 versions - Latest release: 2 months ago - 50 downloads last month - 8,694 stars on GitHub - 1 maintainer
numerblox 1.3.1
Solid Numerai Pipelines
74 versions - Latest release: about 1 month ago - 1 dependent repositories - 567 downloads last month - 98 stars on GitHub - 5 maintainers
Top 1.5% on pypi.org
flyteidl 1.12.0
IDL for Flyte Platform
276 versions - Latest release: 1 day ago - 33 dependent packages - 22 dependent repositories - 255 thousand downloads last month - 4,312 stars on GitHub - 4 maintainers
flaightkit 0.4.0
Flyte SDK for Python (Latch fork)
4 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 43 downloads last month - 4,787 stars on GitHub - 2 maintainers
Top 3.4% on pypi.org
bytewax 0.19.1
Python Stream Processing
30 versions - Latest release: about 1 month ago - 3 dependent packages - 20 dependent repositories - 5.94 thousand downloads last month - 918 stars on GitHub - 2 maintainers
pykale 0.1.2 πŸ’°
Knowledge-aware machine learning from multiple sources in Python
12 versions - Latest release: 10 months ago - 1 dependent repositories - 127 downloads last month - 427 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
deeplake 3.9.4
Activeloop Deep Lake
140 versions - Latest release: about 14 hours ago - 27 dependent packages - 1,384 dependent repositories - 50.3 thousand downloads last month - 7,732 stars on GitHub - 5 maintainers
piperider-nightly 0.42.0.20240507
PiperRider CLI
534 versions - Latest release: about 15 hours ago - 4.31 thousand downloads last month - 450 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...
29 versions - Latest release: about 14 hours ago - 8 dependent packages - 19 dependent repositories - 20.7 thousand downloads last month - 8,694 stars on GitHub - 5 maintainers
Top 5.1% on pypi.org
obsidiantools 0.10.0
Obsidian Tools - a Python interface for Obsidian.md vaults
7 versions - Latest release: over 1 year ago - 1 dependent package - 50 dependent repositories - 2.19 thousand downloads last month - 345 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
evidently 0.4.20
Open-source tools to analyze, monitor, and debug machine learning model in production.
100 versions - Latest release: 5 days ago - 6 dependent packages - 340 dependent repositories - 1.47 million downloads last month - 4,598 stars on GitHub - 2 maintainers
Top 9.2% on pypi.org
yadg 5.0.3
yet another datagram
29 versions - Latest release: about 1 month ago - 2 dependent packages - 2 dependent repositories - 394 downloads last month - 31 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
gopup 0.3.8
GoPUP database
38 versions - Latest release: over 1 year ago - 1 dependent repositories - 414 downloads last month - 2,524 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
apache-superset 4.0.0
A modern, enterprise-ready business intelligence web application
53 versions - Latest release: 30 days ago - 5 dependent packages - 22 dependent repositories - 158 thousand downloads last month - 58,575 stars on GitHub - 5 maintainers
funix 0.5.7 πŸ’°
Building web apps without manually creating widgets
28 versions - Latest release: 11 days ago - 1 dependent repositories - 427 downloads last month - 61 stars on GitHub - 3 maintainers
geostructures 0.8.1
A lightweight implementation of shapes drawn across a geo-temporal plane.
12 versions - Latest release: about 1 month ago - 119 downloads last month - 5 stars on GitHub - 2 maintainers
Top 8.8% on pypi.org
pipeline-ai 2.1.9
Pipelines for machine learning workloads.
202 versions - Latest release: 5 days ago - 2 dependent packages - 1 dependent repositories - 3.01 thousand downloads last month - 112 stars on GitHub - 1 maintainer
ethicml 1.3.0
EthicML is a library for performing and assessing algorithmic fairness. Unlike other libraries, E...
49 versions - Latest release: 6 months ago - 1 dependent package - 4 dependent repositories - 507 downloads last month - 24 stars on GitHub - 2 maintainers
gigaleaf 0.1.6
An opinionated package for integrating Gigantum and Overleaf Projects
7 versions - Latest release: almost 4 years ago - 1 dependent repositories - 54 downloads last month - 2 maintainers
oracle-ml-insights 1.1.0
ML Observability Insights Library
2 versions - Latest release: 5 days ago - 125 downloads last month - 146 stars on GitHub - 4 maintainers
copy2hash 0.5
Copy or rename any file(s) to a hash-secured filename via terminal
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 2 stars on GitHub - 2 maintainers
pywarm 0.4.1
A cleaner way to build neural networks for PyTorch.
5 versions - Latest release: over 4 years ago - 2 dependent repositories - 47 downloads last month - 185 stars on GitHub - 2 maintainers
atlantic 1.1.25
Atlantic is an automated preprocessing framework for Supervised Machine Learning
39 versions - Latest release: 3 months ago - 1 dependent package - 141 downloads last month - 10 stars on GitHub - 1 maintainer
longtermbiosignals 2.0.2
Python library for easy managing and processing of large Long-Term Biosignals.
4 versions - Latest release: about 2 months ago - 18 downloads last month - 4 stars on GitHub - 1 maintainer
pretzelai 0.2.1
AI code completion
5 versions - Latest release: about 16 hours ago - 231 downloads last month - 1,300 stars on GitHub - 1 maintainer
explainy 0.2.9
explainy is a library for generating explanations for machine learning models in Python. It uses ...
17 versions - Latest release: about 1 month ago - 1 dependent repositories - 124 downloads last month - 16 stars on GitHub - 2 maintainers
sdesk 0.2.3
ScienceDesk helper library
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 37 downloads last month - 1 stars on GitHub - 2 maintainers
Related Keywords
python 1,525 machine-learning 1,197 data-analysis 373 deep-learning 370 mlops 325 data-engineering 322 data 305 pandas 267 data-visualization 252 etl 242 analytics 215 data-pipelines 209 workflow 199 pytorch 190 orchestration 171 statistics 167 ai 162 data-integration 155 scikit-learn 152 visualization 150 scheduler 149 data-orchestrator 147 python3 143 elt 142 artificial-intelligence 139 ml 136 automation 126 jupyter 123 pipeline 119 hacktoberfest 116 machine learning 115 automl 114 apache 113 natural-language-processing 106 tensorflow 106 data-mining 105 airflow 104 dag 103 time-series 100 database 98 workflow-engine 97 data science 93 sql 92 hyperparameter-optimization 91 apache-airflow 90 nlp 90 snowflake 86 workflow-orchestration 85 feature-engineering 83 metadata 78 computer-vision 77 jupyter-notebook 76 data-lineage 72 workflow-automation 71 numpy 70 kubernetes 68 data-structures 67 dagster 66 integration 65 data-analytics 64 dataops 64 automated-machine-learning 64 forecasting 64 neural-network 63 airflow-provider 63 matplotlib 62 dataframe 58 big-data 57 reinforcement-learning 57 data-quality 57 neural-networks 56 trino 54 tabular-data 54 keras 50 warehouse 49 machinelearning 49 data-warehouse 49 classification 49 spark 48 pipelines 47 data-engineer 47 data-engineering-pipeline 47 learning 46 science 46 eda 45 machine 44 hyperparameter-tuning 44 optimization 41 notebook 40 exploratory-data-analysis 40 dataset 40 flask 39 aws 38 developer-tools 38 datascience 38 distributed 38 pandas-dataframe 37 ensemble-learning 37 regression 37 plotting 36