Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data science" keyword

Top 1.0% on pypi.org
yellowbrick 1.2.1 ๐Ÿ’ฐ
A suite of visual analysis and diagnostic tools for machine learning.
24 versions - Latest release: over 3 years ago - 24 dependent packages - 1,085 dependent repositories - 564 thousand downloads last month - 4,194 stars on GitHub - 3 maintainers
Top 1.0% on pypi.org
kedro 0.19.5
Kedro helps you build production-ready data and analytics pipelines
50 versions - Latest release: 28 days ago - 39 dependent packages - 402 dependent repositories - 501 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
tsdownsample 0.1.4 ๐Ÿ’ฐ
Time series downsampling in rust
17 versions - Latest release: over 1 year ago - 3 dependent packages - 39 dependent repositories - 406 thousand downloads last month - 122 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
missingno 0.5.2
Missing data visualization module for Python.
26 versions - Latest release: about 1 year ago - 32 dependent packages - 1,920 dependent repositories - 266 thousand downloads last month - 3,816 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
pyldavis 3.4.1
Interactive topic model visualization. Port of the R package.
26 versions - Latest release: about 1 year ago - 10 dependent packages - 134 dependent repositories - 159 thousand downloads last month - 1,756 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
scikit-learn-intelex 2024.4.0
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application.
27 versions - Latest release: 7 days ago - 18 dependent packages - 615 dependent repositories - 120 thousand downloads last month - 1,152 stars on GitHub - 2 maintainers
Top 1.9% on pypi.org
daal4py 2024.4.0
daal4py is a Convenient Python API to the Intelยฎ oneAPI Data Analytics Library (oneDAL)
29 versions - Latest release: 7 days ago - 2 dependent packages - 433 dependent repositories - 120 thousand downloads last month - 1,152 stars on GitHub - 3 maintainers
Top 2.5% on pypi.org
kedro-viz 9.0.0
Kedro-Viz helps visualise Kedro data and analytics pipelines
71 versions - Latest release: about 1 month ago - 4 dependent packages - 131 dependent repositories - 89.5 thousand downloads last month - 635 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
scikit-posthocs 0.9.0
Statistical post-hoc analysis and outlier detection algorithms
23 versions - Latest release: 3 months ago - 16 dependent packages - 79 dependent repositories - 82.5 thousand downloads last month - 309 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
featuretools 1.31.0
a framework for automated feature engineering
105 versions - Latest release: 6 days ago - 23 dependent packages - 286 dependent repositories - 65.7 thousand downloads last month - 7,062 stars on GitHub - 8 maintainers
Top 3.5% on pypi.org
woodwork 0.31.0
a data typing library for machine learning
61 versions - Latest release: 6 days ago - 11 dependent packages - 33 dependent repositories - 61.2 thousand downloads last month - 139 stars on GitHub - 7 maintainers
Top 2.3% on pypi.org
geoplot 0.5.1
High-level geospatial plotting for Python.
20 versions - Latest release: about 2 years ago - 3 dependent packages - 100 dependent repositories - 48 thousand downloads last month - 1,118 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
dataprep 0.4.5
Dataprep: Data Preparation in Python
33 versions - Latest release: almost 2 years ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 4 maintainers
Top 1.2% on pypi.org
tpot 0.12.2
Tree-based Pipeline Optimization Tool
62 versions - Latest release: 3 months ago - 4 dependent packages - 227 dependent repositories - 42.4 thousand downloads last month - 9,521 stars on GitHub - 3 maintainers
Top 8.6% on pypi.org
pypots 0.4.1 ๐Ÿ’ฐ
A Python Toolbox for Data Mining on Partially-Observed Time Series
26 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 29.9 thousand downloads last month - 711 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
nlpcloud 1.1.46
Python client for the NLP Cloud API
42 versions - Latest release: 3 months ago - 28 dependent packages - 574 dependent repositories - 19.9 thousand downloads last month - 66 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pyvespa 0.40.0
Python API for vespa.ai
62 versions - Latest release: about 2 months ago - 24 dependent packages - 434 dependent repositories - 19.6 thousand downloads last month - 69 stars on GitHub - 3 maintainers
Top 4.4% on pypi.org
kedro-mlflow 0.12.2
A kedro-plugin to use mlflow in your kedro projects
32 versions - Latest release: about 1 month ago - 3 dependent packages - 21 dependent repositories - 15.9 thousand downloads last month - 189 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
nlp-primitives 2.12.0
natural language processing primitives for Featuretools
26 versions - Latest release: 3 months ago - 5 dependent packages - 11 dependent repositories - 12.3 thousand downloads last month - 36 stars on GitHub - 8 maintainers
Top 9.9% on pypi.org
xlcalculator 0.5.0
Converts MS Excel formulas to Python and evaluates them.
28 versions - Latest release: over 1 year ago - 1 dependent repositories - 12.1 thousand downloads last month - 105 stars on GitHub - 2 maintainers
Top 3.7% on pypi.org
composeml 0.10.1
a framework for automated prediction engineering
21 versions - Latest release: over 1 year ago - 4 dependent packages - 11 dependent repositories - 10.2 thousand downloads last month - 467 stars on GitHub - 9 maintainers
outerbounds 0.3.66
More Data Science, Less Administration
111 versions - Latest release: 2 days ago - 1 dependent package - 1 dependent repositories - 9.51 thousand downloads last month - 1 maintainer
Top 9.5% on pypi.org
upgini 1.1.286 ๐Ÿ’ฐ
Intelligent data search & enrichment for Machine Learning
706 versions - Latest release: 3 days ago - 1 dependent repositories - 9.19 thousand downloads last month - 290 stars on GitHub - 3 maintainers
Top 2.8% on pypi.org
pmlb 1.0.1
A Python wrapper for the Penn Machine Learning Benchmark data repository.
9 versions - Latest release: over 3 years ago - 9 dependent packages - 28 dependent repositories - 9.18 thousand downloads last month - 783 stars on GitHub - 3 maintainers
Top 9.3% on pypi.org
pigeonxt-jupyter 0.7.3
Quickly annotate data in Jupyter notebooks.
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 6.82 thousand downloads last month - 263 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
evalml 0.83.0
an AutoML library that builds, optimizes, and evaluates machine learning pipelines using domain-s...
86 versions - Latest release: 4 months ago - 4 dependent packages - 12 dependent repositories - 6.77 thousand downloads last month - 705 stars on GitHub - 5 maintainers
Top 2.8% on pypi.org
pycwt 0.4.0b0
Continuous wavelet transform module for Python.
20 versions - Latest release: about 1 year ago - 10 dependent packages - 97 dependent repositories - 5.13 thousand downloads last month - 271 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
circle-fit 0.2.1
A Circle Fitting Library for Python
5 versions - Latest release: about 1 year ago - 3 dependent packages - 72 dependent repositories - 5.06 thousand downloads last month - 53 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
caer 2.0.8 ๐Ÿ’ฐ
A lightweight Computer Vision library for high-performance AI research - Modern Computer Vision o...
114 versions - Latest release: over 2 years ago - 29 dependent repositories - 4.9 thousand downloads last month - 749 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
skrebate 0.3.4
Relief-based feature selection algorithms
13 versions - Latest release: about 7 years ago - 8 dependent packages - 51 dependent repositories - 4.43 thousand downloads last month - 394 stars on GitHub - 4 maintainers
modelaapi 0.6.262
A suite of automatic machine learning for kubernetes
870 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 3.93 thousand downloads last month - 6 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
yabox 1.1.0
Yet another black-box optimization library for Python
6 versions - Latest release: almost 4 years ago - 1 dependent package - 7 dependent repositories - 3.69 thousand downloads last month - 132 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
mljar-supervised 1.1.7
Automated Machine Learning for Humans
90 versions - Latest release: about 1 month ago - 11 dependent repositories - 3.48 thousand downloads last month - 2,923 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
xedro 0.17.6
Kedro helps you build production-ready data and analytics pipelines
1 version - Latest release: over 2 years ago - 1 dependent repositories - 2.95 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
sqlalchemy_exasol 5.0.0
EXASOL dialect for SQLAlchemy
49 versions - Latest release: 4 months ago - 2 dependent repositories - 2.61 thousand downloads last month - 35 stars on GitHub - 1 maintainer
pytimetk 0.4.0
The time series toolkit for Python.
5 versions - Latest release: 2 months ago - 2.41 thousand downloads last month - 609 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
renumics-spotlight 1.6.8
Visualize and maintain datasets to develop and understand data-driven algorithms.
53 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 2.34 thousand downloads last month - 1,016 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
pythresh 0.3.6
A Python Toolbox for Outlier Detection Thresholding
27 versions - Latest release: 4 months ago - 3 dependent repositories - 2.05 thousand downloads last month - 113 stars on GitHub - 1 maintainer
premium-primitives 0.1.0
4 versions - Latest release: 5 days ago - 1 dependent package - 1.82 thousand downloads last month - 4 stars on GitHub - 3 maintainers
joinem 0.1.5
CLI for fast, flexbile concatenation of tabular data using polars.
2 versions - Latest release: 3 months ago - 1.44 thousand downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
auto_ml 2.9.10
Automated machine learning for production and analytics
78 versions - Latest release: about 6 years ago - 3 dependent repositories - 1.44 thousand downloads last month - 1,635 stars on GitHub - 1 maintainer
dataidea 0.2.7
Learn Programming For Data Science
14 versions - Latest release: 15 days ago - 1.42 thousand downloads last month - 0 stars on GitHub - 1 maintainer
opentimspy 1.0.15
opentimspy: An open-source parser of Bruker Tims Data File (.tdf).
17 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 1.31 thousand downloads last month - 1 maintainer
Top 4.2% on pypi.org
baytune 0.5.0
Bayesian Tuning and Bandits
29 versions - Latest release: 10 months ago - 4 dependent packages - 35 dependent repositories - 1.23 thousand downloads last month - 169 stars on GitHub - 4 maintainers
rnanorm 2.1.0
Common RNA-seq normalization methods
14 versions - Latest release: 7 months ago - 1 dependent repositories - 1.11 thousand downloads last month - 52 stars on GitHub - 3 maintainers
maadstml 3.48
Multi-Agent Accelerator for Data Science (MAADS): Transactional Machine Learning
122 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.1 thousand downloads last month - 12 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
lexicalrichness 0.5.1
A small module to compute textual lexical richness (aka lexical diversity).
15 versions - Latest release: 9 months ago - 5 dependent packages - 7 dependent repositories - 987 downloads last month - 75 stars on GitHub - 1 maintainer
wolta 0.2.3
Data Science Library
23 versions - Latest release: 3 days ago - 919 downloads last month - 1 maintainer
import-a 0.0.20
A folder of functions and classes that are easy to import
20 versions - Latest release: about 2 months ago - 1 dependent package - 906 downloads last month - 1 maintainer
np-mlp 0.1.0
Light-weight implementation of a MLP library using only Numpy
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 887 downloads last month - 3 stars on GitHub - 1 maintainer
movie-barcodes 0.2.0
Compress every frame of a movie in a single color barcode.Transform entire movies into stunning s...
7 versions - Latest release: 9 days ago - 876 downloads last month - 1 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
kangas 2.4.9
Tool for exploring columnar data, including multimedia
94 versions - Latest release: 25 days ago - 3 dependent repositories - 827 downloads last month - 1,028 stars on GitHub - 1 maintainer
meerkatio 1.20
Personal push notification and debug tool for multi-tasking software developers
20 versions - Latest release: 3 days ago - 804 downloads last month - 1 maintainer
Top 5.6% on pypi.org
automl 2.9.9
Automated machine learning for production and analytics
19 versions - Latest release: over 6 years ago - 9 dependent repositories - 785 downloads last month - 1,635 stars on GitHub - 1 maintainer
betaboost 0.0.5
BetaBoosting: gradient boosting with a beta function.
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 747 downloads last month - 13 stars on GitHub - 1 maintainer
relieff 0.1.2
ReliefF feature selection algorithms
3 versions - Latest release: about 8 years ago - 1 dependent repositories - 710 downloads last month - 3 maintainers
maads 5.2.3
Multi-Agent Accelerator for Data Science (MAADS)
110 versions - Latest release: 10 months ago - 1 dependent repositories - 649 downloads last month - 4 stars on GitHub - 1 maintainer
ai-project-setup 0.2.7
A versatile utility package designed to streamline the setup of AI project structures while seaml...
9 versions - Latest release: 2 days ago - 636 downloads last month - 0 stars on GitHub - 1 maintainer
dsplus 0.5.1
Helper functions for data science applications.
33 versions - Latest release: 10 days ago - 1 dependent package - 624 downloads last month - 1 maintainer
plynx 1.11.1
ML platform
57 versions - Latest release: 12 months ago - 1 dependent repositories - 454 downloads last month - 289 stars on GitHub - 1 maintainer
eanalytics-api-py 0.1.49
Locally download a datamining dataset from the Eulerian Technologies API
64 versions - Latest release: almost 3 years ago - 1 dependent repositories - 449 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
augmenty 1.4.4
An augmentation library based on SpaCy for joint augmentation of text and labels.
33 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 432 downloads last month - 147 stars on GitHub - 1 maintainer
vandal 4.0.9 ๐Ÿ’ฐ
Data science, Data manipulation and Machine learning library.
126 versions - Latest release: about 1 year ago - 2 dependent repositories - 418 downloads last month - 8 stars on GitHub - 1 maintainer
tsforecasting 1.4.0
TSForecasting is an Automated Time Series Forecasting Framework
23 versions - Latest release: 5 days ago - 410 downloads last month - 24 stars on GitHub - 1 maintainer
Top 7.6% on pypi.org
bloxs 1.0.2
Display data in an attractive way
6 versions - Latest release: almost 2 years ago - 1 dependent package - 4 dependent repositories - 408 downloads last month - 213 stars on GitHub - 1 maintainer
tmplot 0.1.2
Visualization of Topic Modeling Results
13 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 400 downloads last month - 19 stars on GitHub - 1 maintainer
outflow 0.7.0
Outflow is a framework that helps you create and execute sequential, parallel as well as distribu...
23 versions - Latest release: over 1 year ago - 1 dependent repositories - 398 downloads last month - 1 stars on GitLab.com - 4 maintainers
Top 8.6% on pypi.org
pipelinex 0.7.9
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
48 versions - Latest release: 6 months ago - 3 dependent repositories - 397 downloads last month - 220 stars on GitHub - 1 maintainer
pandas-wizard 1.1.0.dev0
Utility Functions, Wrappers for pandas Module
4 versions - Latest release: 30 days ago - 388 downloads last month - 1 maintainer
mercapy 0.0.0
A Mercadona SDK for Python to track product prices, amounts, and more.
4 versions - Latest release: 3 days ago - 344 downloads last month - 1 stars on GitHub - 1 maintainer
sklearn-gbmi 1.0.4
Compute Friedman and Popescu's H statistics, in order to look for interactions among variables in...
6 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 333 downloads last month - 26 stars on GitHub - 1 maintainer
dsutils-ms 1.10
My Data Science Utils
11 versions - Latest release: 10 months ago - 321 downloads last month - 1 maintainer
niaarm 0.3.9
A minimalistic framework for numerical association rule mining
21 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 317 downloads last month - 14 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
pyitlib 0.2.3
A library of information-theoretic methods
8 versions - Latest release: over 1 year ago - 8 dependent packages - 23 dependent repositories - 315 downloads last month - 86 stars on GitHub - 1 maintainer
hillfit 0.1.7
Model for fitting data with the Hill equation, and exporting the contents
18 versions - Latest release: over 1 year ago - 1 dependent repositories - 311 downloads last month - 10 stars on GitHub - 2 maintainers
dsmanager 1.2.11
Data Science tools to ease access and use of data and models
54 versions - Latest release: about 1 year ago - 307 downloads last month - 0 stars on GitLab.com - 1 maintainer
Top 10.0% on pypi.org
deon 0.3.0
Deon adds an ethics checklist to your data science projects.
7 versions - Latest release: over 3 years ago - 4 dependent repositories - 306 downloads last month - 274 stars on GitHub - 1 maintainer
sk-transformers 0.11.0
A collection of various pandas & scikit-learn compatible transformers for all kinds of preprocess...
25 versions - Latest release: about 1 year ago - 1 dependent repositories - 298 downloads last month - 8 stars on GitHub - 1 maintainer
redflag 0.5.0
Safety net for machine learning pipelines.
30 versions - Latest release: 28 days ago - 1 dependent repositories - 297 downloads last month - 19 stars on GitHub - 2 maintainers
nlup 0.8
('Core libraries for natural language processing',)
4 versions - Latest release: over 5 years ago - 11 dependent repositories - 293 downloads last month - 10 stars on GitHub - 3 maintainers
alteryx-open-src-update-checker 3.1.0
an update checker for alteryx open source libraries
5 versions - Latest release: about 1 year ago - 4 dependent packages - 1 dependent repositories - 292 downloads last month - 3 stars on GitHub - 5 maintainers
aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products
22 versions - Latest release: almost 4 years ago - 277 downloads last month - 5 stars on GitHub - 1 maintainer
oxdc-scidb 0.2b5
A simple scientific database.
31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 276 downloads last month - 0 stars on GitHub - 1 maintainer
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views
16 versions - Latest release: 27 days ago - 275 downloads last month - 1 maintainer
Top 9.1% on pypi.org
serenata-toolbox 15.1.6
Toolbox for Serenata de Amor project
35 versions - Latest release: over 4 years ago - 11 dependent repositories - 274 downloads last month - 155 stars on GitHub - 3 maintainers
astromodule 0.5.19
Astronomy Tools
46 versions - Latest release: 2 months ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
madvisor 0.3.6
An automated AI/ML solution from Marlabs
15 versions - Latest release: almost 3 years ago - 1 dependent repositories - 270 downloads last month - 1 maintainer
pydataanalysis 0.0.4 removed
Data Analysis and Visualization Functions
4 versions - Latest release: about 1 year ago - 263 downloads last month - 1 maintainer
pycomex 0.10.2
Python Computational Experiments
33 versions - Latest release: 6 months ago - 4 dependent packages - 1 dependent repositories - 260 downloads last month - 1 maintainer
tpot2 0.1.6a1
Tree-based Pipeline Optimization Tool
7 versions - Latest release: 3 days ago - 260 downloads last month - 151 stars on GitHub - 1 maintainer
pennsieve2 0.1.2
Pennsieve Python Client
7 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 251 downloads last month - 0 stars on GitHub - 1 maintainer
django-flow-forge 0.3.7
Eliminate unnecessary system complexity in Data Ops and in Machine Learning Ops (MLOps) with this...
28 versions - Latest release: 4 days ago - 251 downloads last month - 1 stars on GitHub - 1 maintainer
h2o-autodoc 1.1.0
Create a machine learning model documentation
6 versions - Latest release: 7 months ago - 1 dependent repositories - 238 downloads last month - 1 maintainer
azulejo 0.10.1
tile phylogenetic space with subtrees
26 versions - Latest release: over 3 years ago - 1 dependent repositories - 237 downloads last month - 0 stars on GitHub - 1 maintainer
covsirphy 3.1.1 ๐Ÿ’ฐ
COVID-19 data analysis with phase-dependent SIR-derived ODE models
59 versions - Latest release: 4 months ago - 1 dependent repositories - 235 downloads last month - 101 stars on GitHub - 1 maintainer
differential-evolution 1.12.0
Differential Evolution Algorithm with OpenMDAO Driver
22 versions - Latest release: over 4 years ago - 1 dependent repositories - 235 downloads last month - 8 stars on GitHub - 1 maintainer
sritpot 1.2.0
SRI Fork of Tree-based Pipeline Optimization Tool
27 versions - Latest release: about 5 years ago - 1 dependent repositories - 233 downloads last month - 1 stars on GitHub - 1 maintainer
autoembedder 0.2.5
PyTorch autoencoder with additional embeddings layer for categorical data.
23 versions - Latest release: over 1 year ago - 225 downloads last month - 8 stars on GitHub - 1 maintainer
maadsbml 1.3.12
Multi-Agent Accelerator for Data Science (MAADS) Batch AutoML (MAADSBML)
12 versions - Latest release: about 1 month ago - 225 downloads last month - 4 stars on GitHub - 1 maintainer
mlimputer 1.0.66
MLimputer - Missing Data Imputation Framework for Supervised Machine Learning
16 versions - Latest release: 12 days ago - 207 downloads last month - 5 stars on GitHub - 1 maintainer
Related Keywords
machine learning 203 python 122 data-science 94 machine-learning 71 data analysis 63 data 50 pandas 42 statistics 32 artificial intelligence 32 scikit-learn 29 automl 27 deep learning 23 data visualization 21 data engineering 21 data-visualization 20 data-analysis 20 visualization 20 feature engineering 19 ai 19 analytics 19 data mining 18 classification 17 science 17 pipelines 16 automated machine learning 15 pipeline 14 data analytics 14 matplotlib 14 analysis 13 feature-engineering 13 numpy 13 data preprocessing 12 timeseries 12 preprocessing 12 sklearn 12 regression 12 database 12 optimization 11 python3 11 natural language processing 11 feature selection 11 AI 11 data cleaning 11 time series 11 mlops 11 pipeline optimization 10 lightgbm 10 automated-machine-learning 10 data pipelines 10 deep-learning 10 automation 10 genetic programming 9 evolutionary computation 9 hyperparameter optimization 9 workflow 9 bootcamp 9 data exploration 8 hyperparameter-optimization 8 data manipulation 8 data processing 8 bioinformatics 8 xgboost 8 jupyter 8 datasets 8 research 8 gradient boosting 7 NLP 7 model-selection 7 data-mining 7 nlp 7 predictive modeling 7 big data 7 notebook 6 multi-agent 6 dataframes 6 dataframe 6 production ready 6 kedro 6 api 6 math 6 tensorflow 6 gradient-boosting 5 feature extraction 5 production 5 ensembling 5 seaborn 5 topsis 5 feature-selection 5 feature importance 5 algorithms 5 neural network 5 sql 5 regressor 5 regressors 5 classifiers 5 classifier 5 estimators 5 predictors 5 XGBoost 5 forecasting 5