An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data science" keyword

View the packages on the pypi.org package registry that are tagged with the "data science" keyword.

edaexcelreport 0.1.9
A Python package for generating detailed EDA reports in Excel format with structured insights and...
10 versions - Latest release: about 1 month ago - 331 downloads last month - 2 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
pyldavis 3.4.1
Interactive topic model visualization. Port of the R package.
26 versions - Latest release: almost 2 years ago - 10 dependent packages - 134 dependent repositories - 144 thousand downloads last month - 1,756 stars on GitHub - 2 maintainers
Top 1.9% on pypi.org
daal4py 2024.7.0
daal4py is a Convenient Python API to the Intel® oneAPI Data Analytics Library (oneDAL)
32 versions - Latest release: 7 months ago - 2 dependent packages - 433 dependent repositories - 66.4 thousand downloads last month - 1,216 stars on GitHub - 3 maintainers
battery-data-toolkit 0.4.4
Utilities for reading and manipulating battery testing data
13 versions - Latest release: about 1 month ago - 1 dependent repositories - 855 downloads last month - 39 stars on GitHub - 1 maintainer
splitbbox 0.0.1
Split overlapping bounding boxes in Python
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 56 downloads last month - 1 stars on GitHub - 1 maintainer
transtab 0.0.5
A flexible tabular prediction model that handles variable-column input tables.
5 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 405 downloads last month - 185 stars on GitHub - 1 maintainer
django-flow-forge 0.9.9
Keep Data Ops and Machine Learning Ops (MLOps) simple and vendor agnostic with this Django module...
58 versions - Latest release: 4 months ago - 2.34 thousand downloads last month - 1 stars on GitHub - 1 maintainer
scitbx 0.0.92
For academic data processing and plotting etc.
92 versions - Latest release: about 6 hours ago - 1 dependent repositories - 1.87 thousand downloads last month - 1 stars on GitHub - 1 maintainer
yellowbrick-datasets 1.0
Yellowbrick datasets management and deployment scripts.
1 version - Latest release: over 6 years ago - 2 dependent repositories - 60 downloads last month - 5 stars on GitHub - 1 maintainer
mercapy 1.0.3
A Mercadona interface for Python to track product prices, amounts, and more.
10 versions - Latest release: 11 months ago - 265 downloads last month - 2 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++
1 version - Latest release: 11 months ago - 55 downloads last month - 2 stars on GitHub - 1 maintainer
scieco 0.0.0
Ecology
1 version - Latest release: about 9 hours ago - 1 maintainer
temul-toolkit 0.1.7
Functions for analysis of high resolution electron microscopy and spectroscopy data.
9 versions - Latest release: about 3 years ago - 1 dependent repositories - 430 downloads last month - 1 maintainer
Top 1.0% on pypi.org
kedro 0.19.12
Kedro helps you build production-ready data and analytics pipelines
57 versions - Latest release: about 1 month ago - 39 dependent packages - 402 dependent repositories - 536 thousand downloads last month - 9,932 stars on GitHub - 3 maintainers
Top 1.7% on pypi.org
scikit-posthocs 0.11.4
Statistical post-hoc analysis and outlier detection algorithms
30 versions - Latest release: 21 days ago - 16 dependent packages - 79 dependent repositories - 92.4 thousand downloads last month - 309 stars on GitHub - 1 maintainer
metrdsutil 0.1.9
Common exploratory data functionality for METR
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 58 downloads last month - 1 maintainer
covsirphy 3.1.2 💰
COVID-19 data analysis with phase-dependent SIR-derived ODE models
60 versions - Latest release: 8 months ago - 1 dependent repositories - 1.68 thousand downloads last month - 110 stars on GitHub - 1 maintainer
dataramp 0.3.5 💰
A Data science library for data science / data analysis teams
34 versions - Latest release: about 2 months ago - 660 downloads last month - 5,099 stars on GitHub - 1 maintainer
dsplus 0.9.4
Helper functions for data science applications
75 versions - Latest release: about 13 hours ago - 1 dependent package - 3.17 thousand downloads last month - 1 maintainer
Top 1.0% on pypi.org
featuretools 1.31.0
a framework for automated feature engineering
105 versions - Latest release: 11 months ago - 23 dependent packages - 286 dependent repositories - 86.5 thousand downloads last month - 7,148 stars on GitHub - 8 maintainers
Top 9.9% on pypi.org
xlcalculator 0.5.0
Converts MS Excel formulas to Python and evaluates them.
28 versions - Latest release: about 2 years ago - 1 dependent repositories - 12.1 thousand downloads last month - 128 stars on GitHub - 2 maintainers
statistical-iv 0.3.1
Statistical IV: Statistical Hypothesis Testing for the Information Value (IV). Evaluation of the ...
13 versions - Latest release: over 1 year ago - 601 downloads last month - 11 stars on GitHub - 1 maintainer
rushdb 0.3.0
RushDB Python SDK
3 versions - Latest release: 2 months ago - 132 downloads last month - 1 maintainer
madvisor 0.3.6
An automated AI/ML solution from Marlabs
15 versions - Latest release: almost 4 years ago - 1 dependent repositories - 1.38 thousand downloads last month - 1 maintainer
advanced-value-counts 1.0.0
A package to perform an advanced version of pandas' value_counts()
2 versions - Latest release: over 2 years ago - 125 downloads last month - 3 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
scikit-learn-intelex 2025.4.0
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application.
35 versions - Latest release: 29 days ago - 18 dependent packages - 615 dependent repositories - 85.2 thousand downloads last month - 1,265 stars on GitHub - 2 maintainers
framework3 1.0.12
A flexible framework for machine learning pipelines
12 versions - Latest release: about 18 hours ago - 365 downloads last month - 0 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
evalml 0.84.0
an AutoML library that builds, optimizes, and evaluates machine learning pipelines using domain-s...
87 versions - Latest release: 11 months ago - 4 dependent packages - 12 dependent repositories - 5.71 thousand downloads last month - 805 stars on GitHub - 8 maintainers
eh-evalml 0.0.0
an AutoML library that builds, optimizes, and evaluates machine learning pipelines using domain-s...
1 version - Latest release: 4 months ago - 57 downloads last month - 805 stars on GitHub - 1 maintainer
evalml-automining 0.0.4
an AutoML library that builds, optimizes, and evaluates machine learning pipelines using domain-s...
4 versions - Latest release: 2 months ago - 148 downloads last month - 805 stars on GitHub - 1 maintainer
bigleaf 0.0.1
R bigleaf module in Python
1 version - Latest release: about 19 hours ago - 1 maintainer
Top 1.2% on pypi.org
tpot 1.0.0
Tree-based Pipeline Optimization Tool
63 versions - Latest release: about 2 months ago - 4 dependent packages - 227 dependent repositories - 49.9 thousand downloads last month - 9,876 stars on GitHub - 2 maintainers
morpho-toolkit 0.1.0.dev0
The MORPHO data-science toolkit.
1 version - Latest release: about 20 hours ago - 0 stars on GitHub
aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products
22 versions - Latest release: over 4 years ago - 928 downloads last month - 5 stars on GitHub - 1 maintainer
uflux 0.0.1
Unified FLUXes (UFLUX)
1 version - Latest release: about 1 month ago - 151 downloads last month - 0 stars on GitHub - 1 maintainer
dsmlbc5 0.0.2
Data Science Tools
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 57 downloads last month - 12 stars on GitHub - 1 maintainer
easymoney 1.5.0
Data Science Tools for Monetary Information and Conversions.
2 versions - Latest release: over 8 years ago - 1 dependent repositories - 72 downloads last month - 7 stars on GitHub - 1 maintainer
pycodeml 0.0.16
Automatically train multiple regression models and return the best one.
8 versions - Latest release: 2 days ago - 843 downloads last month - 0 stars on GitHub - 1 maintainer
colda 0.0.1
Collaborative Data Analysis for All
1 version - Latest release: almost 2 years ago - 52 downloads last month - 19 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
albumentations 2.0.5 💰
Fast, flexible, and advanced augmentation library for deep learning, computer vision, and medical...
85 versions - Latest release: about 2 months ago - 198 dependent packages - 5,487 dependent repositories - 6.6 million downloads last month - 14,783 stars on GitHub - 1 maintainer
nlpurify 2.0.0a0
Text cleaning and feature extractions using NLP, Traditional approach.
4 versions - Latest release: 5 months ago - 90 downloads last month - 0 stars on GitHub - 1 maintainer
amlearn 0.3.4
Machine Learning package for amorphous materials.
13 versions - Latest release: over 5 years ago - 1 dependent repositories - 369 downloads last month - 19 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
pypots 0.8.1 💰
A Python Toolbox for Machine Learning on Partially-Observed Time Series
41 versions - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 84.1 thousand downloads last month - 940 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
baytune 0.5.0
Bayesian Tuning and Bandits
29 versions - Latest release: over 1 year ago - 4 dependent packages - 35 dependent repositories - 1.75 thousand downloads last month - 170 stars on GitHub - 4 maintainers
pilotis-io 0.2.0
This is a lib with IO functions to use while coding Python ML projects
2 versions - Latest release: over 3 years ago - 2 dependent repositories - 84 downloads last month - 1 maintainer
cutoml 0.0.11
A lightweight automl library
15 versions - Latest release: almost 3 years ago - 1 dependent repositories - 397 downloads last month - 0 stars on GitHub - 1 maintainer
outerbounds 0.3.159
More Data Science, Less Administration
216 versions - Latest release: 1 day ago - 1 dependent package - 1 dependent repositories - 26.1 thousand downloads last month - 2 maintainers
alphabetsoup 0.5.3
tile phylogenetic space with subtrees
7 versions - Latest release: almost 5 years ago - 206 downloads last month - 0 stars on GitHub - 1 maintainer
surv-ai 0.2.0
A framework for multi-agent modeling using large language models
12 versions - Latest release: almost 2 years ago - 490 downloads last month - 105 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
nlp-primitives 2.13.0
natural language processing primitives for Featuretools
27 versions - Latest release: 11 months ago - 5 dependent packages - 11 dependent repositories - 8.3 thousand downloads last month - 38 stars on GitHub - 8 maintainers
xp 1.1
xp is a framework for building and executing computing pipelines
1 version - Latest release: about 9 years ago - 3 dependent repositories - 235 downloads last month - 56 stars on GitHub - 1 maintainer
crypticorn 2.4.7
Maximise Your Crypto Trading Profits with AI Predictions
25 versions - Latest release: 2 days ago - 2.71 thousand downloads last month - 1 maintainer
pyplotify 0.2.0
A simple class to give plots some styling. It is a very light skin over matplotlib.pyplot.
5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 127 downloads last month - 3 stars on GitHub - 1 maintainer
datagif 0.1.1
Make animated gifs out of multiple data plots.
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 103 downloads last month - 2 stars on GitHub - 1 maintainer
h2o-experiment-tracking 0.0.4
Python client for H2O.ai Experiment Tracking.
4 versions - Latest release: over 2 years ago - 305 downloads last month - 1 maintainer
dsutils-ms 1.10
My Data Science Utils
11 versions - Latest release: almost 2 years ago - 638 downloads last month - 1 maintainer
data-science-common 0.1.22
UNDER CONSTRUCTION: A simple python library to facilitate analysis
12 versions - Latest release: almost 2 years ago - 1 dependent repositories - 439 downloads last month - 0 stars on GitHub - 1 maintainer
makeup 0.1.3
Make your models look pretty.
3 versions - Latest release: over 3 years ago - 2 dependent repositories - 60 downloads last month - 3 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
pyvespa 0.55.0
Python API for vespa.ai
77 versions - Latest release: about 1 month ago - 24 dependent packages - 434 dependent repositories - 39.4 thousand downloads last month - 69 stars on GitHub - 4 maintainers
generate-face 3.0
Automatically download face from www.thispersondoesnotexist.com
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 55 downloads last month - 0 stars on GitHub - 1 maintainer
tdprepview 1.5.0
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views
17 versions - Latest release: 7 months ago - 482 downloads last month - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 49 downloads last month - 1 stars on GitHub - 1 maintainer
langagent 3.3.9
LangAgent is a powerful multi-agent system designed to automate and streamline complex tasks, inc...
13 versions - Latest release: about 2 months ago - 304 downloads last month - 1 stars on GitHub - 1 maintainer
lykability 1.0.0
Using empythy to score likability based on sentiment analysis of recent tweets about a given person
1 version - Latest release: over 8 years ago - 1 dependent repositories - 55 downloads last month - 1 stars on GitHub - 1 maintainer
prettierplot 0.1.2
Quickly create prettier plots
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 344 downloads last month - 1 stars on GitHub - 1 maintainer
ppmml 0.0.1
Python library for converting machine learning models to pmml file
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 55 downloads last month - 9 stars on GitHub - 1 maintainer
random-tables 0.0.5
little module that helps to create tables for a specified schema with random content
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 193 downloads last month - 1 maintainer
jam-ai 0.2.1
Engaging with Multiple AI Agents with Jam.
12 versions - Latest release: almost 2 years ago - 434 downloads last month - 1 stars on GitHub - 1 maintainer
datasciencelab 0.1.8
Building blocks for common data science work
8 versions - Latest release: over 3 years ago - 1 dependent repositories - 230 downloads last month - 0 stars on GitHub - 1 maintainer
scikit-na 0.2.0
Missing Values Analysis for Data Science
9 versions - Latest release: 2 months ago - 1 dependent repositories - 243 downloads last month - 4 stars on GitHub - 1 maintainer
ai-assistant-manager 2.9.0
This repository provides tools and services to manage OpenAI Assistants, including creating, list...
20 versions - Latest release: 3 days ago - 1.37 thousand downloads last month - 2 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
kedro-viz 11.0.0
Kedro-Viz helps visualise Kedro data and analytics pipelines
77 versions - Latest release: about 1 month ago - 4 dependent packages - 131 dependent repositories - 405 thousand downloads last month - 707 stars on GitHub - 3 maintainers
rafm 1.1.4
rafm
10 versions - Latest release: about 3 years ago - 1 dependent repositories - 281 downloads last month - 2 stars on GitHub - 1 maintainer
dataidea 0.2.7
Learn Programming For Data Science
24 versions - Latest release: 12 months ago - 778 downloads last month - 0 stars on GitHub - 1 maintainer
pointblank 0.8.6
Find out if your data is what you think it is.
21 versions - Latest release: 3 days ago - 14 thousand downloads last month - 109 stars on GitHub - 1 maintainer
planetoids 0.1-alpha.2
Planetoids is a high level Python API for generating interactive, procedurally generated worlds f...
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 57 downloads last month - 1 stars on GitHub - 1 maintainer
freq-frame 1.0.0
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 34 downloads last month - 1 maintainer
suntzu 0.8.0
SunTzu is a Data Science Python Library that simplifies data tasks, empowering users with robust ...
11 versions - Latest release: 10 months ago - 412 downloads last month - 0 stars on GitHub - 1 maintainer
sri-autoflow 1.0.3
Autoflow is a tool that explores available primitives and assembles them into pipelines that disc...
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 212 downloads last month - 1 stars on gitlab.com - 1 maintainer
segmentae 1.0.27
SegmentAE: A Python Library for Anomaly Detection Optimization
9 versions - Latest release: 6 months ago - 99 downloads last month - 3 stars on GitHub - 1 maintainer
didemtaha 0.0.1
Data Science Tools
1 version - Latest release: over 3 years ago - 1 dependent repositories - 29 downloads last month - 1 maintainer
flexds 1.1
Flex is a framework for building and executing computing pipelines
1 version - Latest release: about 9 years ago - 2 dependent repositories - 35 downloads last month - 55 stars on GitHub - 1 maintainer
pipey 0.0.1a5
Declarative syntactic sugar that enables piping in python.
2 versions - Latest release: almost 5 years ago - 1 dependent package - 2 dependent repositories - 77 downloads last month - 72 stars on GitHub - 1 maintainer
tablemage 0.1.0a1
A Python package for low-code analysis of tabular data
1 version - Latest release: 2 months ago - 62 downloads last month - 9 stars on GitHub - 1 maintainer
waterflow v0.3
Dataflow package provides a data analysis pipelineframework for data transformation and machine l...
4 versions - Latest release: over 8 years ago - 1 dependent repositories - 99 downloads last month - 2 stars on GitHub - 1 maintainer
distanceclassifier 0.0.8
Distance Classifier
9 versions - Latest release: over 8 years ago - 1 dependent repositories - 245 downloads last month - 1 stars on GitHub - 1 maintainer
digen 0.0.5
DIGEN: Diverse Generative ML Benchmark
5 versions - Latest release: about 3 years ago - 2 dependent repositories - 267 downloads last month - 15 stars on GitHub - 1 maintainer
autoqtl 0.1.1a0
Automated Quantitative Trait Locus Analysis Tool
1 version - Latest release: over 1 year ago - 58 downloads last month - 8 stars on GitHub - 1 maintainer
autodatap 1.5.2
Automating Data Preprocessing
33 versions - Latest release: over 1 year ago - 1.26 thousand downloads last month - 0 stars on GitHub - 1 maintainer
dscience 0.0.1
A collection of Python snippets for the Kokel Lab
1 version - Latest release: about 5 years ago - 1 dependent repositories - 57 downloads last month - 0 stars on GitHub - 1 maintainer
pandas-excel-limitedrows 2.0.1
Pandas Extension Package used to read Excel files with limit rows
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 248 downloads last month - 4 stars on GitHub - 1 maintainer
hyperband-multithreading 0.0.1
Hyperband-multithreading
1 version - Latest release: over 3 years ago - 1 dependent repositories - 55 downloads last month - 1 maintainer
caspian-ml 1.0.0
A deep learning library focused entirely around NumPy.
1 version - Latest release: 13 days ago - 145 downloads last month - 0 stars on GitHub - 1 maintainer
nsde 0.1.1
Non-dominated Sorting Differential Evolution (NSDE) Algorithm
15 versions - Latest release: about 5 years ago - 1 dependent repositories - 617 downloads last month - 0 stars on GitHub - 1 maintainer
hassans-frame 0.0.0
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 31 downloads last month - 1 maintainer
arrowtextclassifier 1.0.3
ArrowTextClassifier is a simple text classification tool written in pytorch that allows you to tr...
4 versions - Latest release: 12 months ago - 197 downloads last month - 1 maintainer
maads 5.2.3
Multi-Agent Accelerator for Data Science (MAADS)
110 versions - Latest release: over 1 year ago - 1 dependent repositories - 3.84 thousand downloads last month - 4 stars on GitHub - 1 maintainer
kedro-diff 0.1.1 💰
diff commits to your kedro pipeline
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 105 downloads last month - 10 stars on GitHub - 1 maintainer
atriumdb 2.4.0
Timeseries Database
9 versions - Latest release: 6 months ago - 289 downloads last month - 4 stars on GitHub - 1 maintainer
autoembedder 0.2.5
PyTorch autoencoder with additional embeddings layer for categorical data.
23 versions - Latest release: about 2 years ago - 1.04 thousand downloads last month - 8 stars on GitHub - 1 maintainer
Related Keywords
machine learning 261 python 148 data-science 114 machine-learning 91 data analysis 82 pandas 59 data 56 artificial intelligence 44 scikit-learn 37 statistics 37 deep learning 35 ai 32 automl 31 data visualization 26 feature engineering 26 data engineering 25 data mining 24 visualization 23 numpy 23 data-analysis 23 data preprocessing 22 data-visualization 20 AI 20 analytics 19 analysis 19 science 19 feature-engineering 19 classification 19 matplotlib 18 data cleaning 18 pipeline 17 data analytics 17 pipelines 17 automated machine learning 16 database 15 deep-learning 15 optimization 14 mlops 14 automation 14 preprocessing 14 research 13 time series 13 python3 13 natural language processing 13 regression 12 data processing 12 sklearn 12 timeseries 12 NLP 11 workflow 11 feature selection 11 datasets 11 data pipelines 11 data manipulation 10 xgboost 10 automated-machine-learning 10 nlp 10 data exploration 10 pytorch 10 pipeline optimization 10 data transformation 9 big data 9 bootcamp 9 lightgbm 9 predictive modeling 9 jupyter 9 clustering 9 evolutionary computation 9 genetic programming 9 model-selection 9 bioinformatics 9 hyperparameter optimization 9 data wrangling 8 hyperparameter-optimization 8 forecasting 8 api 8 ml 8 eda 8 feature-selection 8 exploratory data analysis 8 graph 7 hacktoberfest 7 Python 7 data-engineering 7 tensorflow 7 sql 7 dataframe 7 data-mining 7 neural networks 7 notebook 7 sentiment analysis 6 multi-agent 6 feature importance 6 xarray 6 data-processing 6 plot 6 seaborn 6 artificial-intelligence 6 llm 6 utilities 6