Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "data-science" keyword
openrefine 3.5.0 💰
OpenRefine is a free, open source power tool for working with messy data and improving it3 versions - Latest release: over 2 years ago - 9,377 stars on GitHub
tpot-dask 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
Top 7.3% on conda-forge.org
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
tpot-full 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...1 version - Latest release: about 3 years ago - 8,986 stars on GitHub
pyprocessmacro 1.0.12
A Python library for moderation, mediation and conditional process analysis.3 versions - Latest release: about 2 years ago - 67 stars on GitHub
ghostpii 1.0.11
This repository contains the Python library for interacting with Capnion's private computation AP...5 versions - Latest release: over 1 year ago - 21 stars on GitHub
girder-client 3.1.15
A data management platform for the web, developed by Kitware16 versions - Latest release: almost 2 years ago - 2 dependent packages - 402 stars on GitHub
dataprep 0.4.5
Open-source low code data preparation library in python. Collect, clean and visualization your da...8 versions - Latest release: almost 2 years ago - 4 dependent repositories - 1,571 stars on GitHub
aim 2.7.4
Aim 💫 — easy-to-use and performant open-source ML experiment tracker.3 versions - Latest release: over 2 years ago - 3,276 stars on GitHub
rpy2 3.5.6
Interface to use R from Python16 versions - Latest release: over 1 year ago - 4 dependent packages - 73 dependent repositories - 376 stars on GitHub
nlpaug 1.1.11 💰
This python library helps you with augmenting NLP for your machine learning projects. `Augmenter...7 versions - Latest release: almost 2 years ago - 3,846 stars on GitHub
sweetviz 2.1.4
Visualize and compare datasets, target values and associations, with one line of code.10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 2,352 stars on GitHub
matrixprofile 1.1.10
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, acc...1 version - Latest release: over 3 years ago - 2 dependent packages - 1 dependent repositories - 305 stars on GitHub
aimodelshare 0.0.144
23 versions - Latest release: over 1 year ago - 36 stars on GitHubd2l 0.17.6
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 u...2 versions - Latest release: over 1 year ago - 16,875 stars on GitHub
pyglmnet 1.1
Python implementation of elastic-net regularized generalized linear models1 version - Latest release: about 4 years ago - 258 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...3 versions - Latest release: over 2 years ago - 272 stars on GitHub
eli5 0.13.0
ELI5 is a Python package which helps to debug machine learning classifiers and explain their pred...11 versions - Latest release: about 2 years ago - 1 dependent package - 14 dependent repositories - 2,652 stars on GitHub
dirty_cat 0.3.0
Machine learning on dirty tabular data3 versions - Latest release: over 1 year ago - 682 stars on GitHub
pyrolite 0.3.2
A set of tools for getting the most from your geochemical data.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 100 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
kneed 0.7.0
Knee point detection in Python :chart_with_upwards_trend:8 versions - Latest release: almost 4 years ago - 1 dependent package - 9 dependent repositories - 570 stars on GitHub
dagster-mysql 1.0.17
An orchestration platform for the development, production, and observation of data assets.82 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 1.6% on conda-forge.org
11 versions - Latest release: over 1 year ago - 181 dependent packages - 3,503 dependent repositories - 10,487 stars on GitHub
seaborn 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...11 versions - Latest release: over 1 year ago - 181 dependent packages - 3,503 dependent repositories - 10,487 stars on GitHub
mprod-package 0.0.4a1
The mprod package provides python implementation for applying tensor-tensor (tubal) products. In ...3 versions - Latest release: almost 2 years ago - 9 stars on GitHub
dagster-snowflake 1.0.17
An orchestration platform for the development, production, and observation of data assets.108 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_papertrail 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-papertrail 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
mpl_sample_data 3.4.3 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...29 versions - Latest release: over 2 years ago - 17,056 stars on GitHub
Top 0.8% on conda-forge.org
30 versions - Latest release: over 1 year ago - 699 dependent packages - 10,108 dependent repositories - 18,105 stars on GitHub
matplotlib 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...30 versions - Latest release: over 1 year ago - 699 dependent packages - 10,108 dependent repositories - 18,105 stars on GitHub
pycompare 1.5.4
Python module for generating Bland-Altman plots8 versions - Latest release: about 2 years ago - 26 stars on GitHub
dagster-cron 0.11.15
An orchestration platform for the development, production, and observation of data assets.44 versions - Latest release: almost 3 years ago - 6,905 stars on GitHub
dagster-graphql 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6 dependent packages - 6,905 stars on GitHub
_dvc-ssh 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
Top 0.1% on conda-forge.org
31 versions - Latest release: over 1 year ago - 647 dependent packages - 5,166 dependent repositories - 53,452 stars on GitHub
scikit-learn 1.1.3 💰
scikit-learn: machine learning in Python31 versions - Latest release: over 1 year ago - 647 dependent packages - 5,166 dependent repositories - 53,452 stars on GitHub
sktime-all-extras 0.12.1 💰
A unified framework for machine learning with time series13 versions - Latest release: almost 2 years ago - 2 dependent repositories - 6,633 stars on GitHub
Top 9.3% on conda-forge.org
18 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 6,633 stars on GitHub
sktime 0.12.1 💰
A unified framework for machine learning with time series18 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 6,633 stars on GitHub
dagster-datadog 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dask 0.6.5
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-slack 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-twilio 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.86 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
typedframe 0.7.0
Typed wrappers over pandas DataFrames with schema validation1 version - Latest release: almost 2 years ago - 65 stars on GitHub
Top 10.0% on conda-forge.org
27 versions - Latest release: over 1 year ago - 3 dependent packages - 4 dependent repositories - 6,847 stars on GitHub
pyod 1.0.6 💰
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)27 versions - Latest release: over 1 year ago - 3 dependent packages - 4 dependent repositories - 6,847 stars on GitHub
Top 1.1% on conda-forge.org
22 versions - Latest release: over 1 year ago - 29 dependent packages - 327 dependent repositories - 57,664 stars on GitHub
keras 2.10.0
Deep Learning for humans22 versions - Latest release: over 1 year ago - 29 dependent packages - 327 dependent repositories - 57,664 stars on GitHub
foundry_ml 0.5.0
Simplifying the discovery and usage of machine-learning ready datasets in materials science and c...4 versions - Latest release: over 1 year ago - 52 stars on GitHub
verticapy 0.11.0
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science pro...17 versions - Latest release: over 1 year ago - 124 stars on GitHub
klib 1.0.6 💰
Easy to use Python library of customized functions for cleaning and analyzing data.30 versions - Latest release: over 1 year ago - 356 stars on GitHub
Top 0.9% on conda-forge.org
29 versions - Latest release: over 1 year ago - 1,213 dependent packages - 2,191 dependent repositories - 18,105 stars on GitHub
matplotlib-base 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...29 versions - Latest release: over 1 year ago - 1,213 dependent packages - 2,191 dependent repositories - 18,105 stars on GitHub
ptitprince 0.2.6
python version of raincloud2 versions - Latest release: over 1 year ago - 1 dependent repositories - 169 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...2 versions - Latest release: over 2 years ago - 68 stars on GitHub
Top 1.7% on conda-forge.org
16 versions - Latest release: over 1 year ago - 130 dependent packages - 979 dependent repositories - 8,306 stars on GitHub
statsmodels 0.13.5
Statsmodels: statistical modeling and econometrics in Python16 versions - Latest release: over 1 year ago - 130 dependent packages - 979 dependent repositories - 8,306 stars on GitHub
psy-view 0.2.0
This package provides a graphical user interface to quickly visualize the contents of a netCDF file3 versions - Latest release: over 2 years ago - 2 dependent repositories - 9 stars on GitHub
spectrafit 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...8 versions - Latest release: over 1 year ago - 2 dependent packages - 11 stars on GitHub
spectrafit-all 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...3 versions - Latest release: over 1 year ago - 11 stars on GitHub
erroranalysis 0.3.12
**Responsible AI Toolbox**: `erroranalysis` Responsible AI is an approach to assessing, developi...8 versions - Latest release: over 1 year ago - 886 stars on GitHub
r-modeltime 1.2.4
Modeltime unlocks time series forecast models and machine learning in one framework12 versions - Latest release: over 1 year ago - 440 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling5 versions - Latest release: over 1 year ago - 152 stars on GitHub
cleanlab 2.1.0
cleanlab is the data-centric ML ops package for machine learning with noisy labels. cleanlab clea...5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 5,564 stars on GitHub
r-catboost 1.1.1
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking,...49 versions - Latest release: over 1 year ago - 7,013 stars on GitHub
Top 5.3% on conda-forge.org
61 versions - Latest release: over 1 year ago - 9 dependent packages - 32 dependent repositories - 7,012 stars on GitHub
catboost 1.1.1
General purpose gradient boosting on decision trees library with categorical features support out...61 versions - Latest release: over 1 year ago - 9 dependent packages - 32 dependent repositories - 7,012 stars on GitHub
flaml 1.0.14
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.31 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 2,337 stars on GitHub
dagster_slack 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-pandera 1.0.17
An orchestration platform for the development, production, and observation of data assets.34 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster-celery-k8s 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mlflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.70 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 7.6% on conda-forge.org
119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster_aws 0.6.6
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-aws 1.0.17
An orchestration platform for the development, production, and observation of data assets.55 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_postgres 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_twilio 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
dagster-fivetran 1.0.17
An orchestration platform for the development, production, and observation of data assets.56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster-duckdb-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster-airbyte 1.0.17
An orchestration platform for the development, production, and observation of data assets.47 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_ssh 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-dbt 1.0.17
An orchestration platform for the development, production, and observation of data assets.106 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster_cron 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-snowflake-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
kfp-pipeline-spec 0.1.16
Machine Learning Pipelines for Kubeflow6 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 3,137 stars on GitHub
kfp 1.8.14
Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflo...58 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 3,137 stars on GitHub
kfp-server-api 1.8.4
Generated python client for the KF Pipelines server API20 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 3,137 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
r-collapse 1.8.9 💰
Advanced and Fast Data Transformation in R5 versions - Latest release: over 1 year ago - 2 dependent packages - 451 stars on GitHub
r-janitor 2.1.0
simple tools for data cleaning in R8 versions - Latest release: over 3 years ago - 8 dependent packages - 8 dependent repositories - 1,224 stars on GitHub
Top 2.6% on conda-forge.org
17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub
folium 0.13.0
Python Data. Leaflet.js Maps.17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub
Related Keywords
python
265
machine-learning
148
mlops
88
data-engineering
75
analytics
70
workflow
70
etl
67
orchestration
65
data-integration
64
data-pipelines
64
data-orchestrator
63
scheduler
63
metadata
62
workflow-automation
62
dagster
61
hacktoberfest
44
data-analysis
40
pandas
36
visualization
35
deep-learning
35
data-visualization
29
ai
27
automl
27
r
27
scikit-learn
24
dataframe
24
reproducibility
24
data
22
hyperparameter-optimization
22
model-selection
21
developer-tools
21
pytorch
21
git
20
jupyter
20
statistics
20
collaboration
20
data-version-control
19
distributed
19
time-series
18
automation
18
spark
17
tensorflow
16
data-mining
15
feature-engineering
15
machinelearning
15
random-forest
15
jupyter-notebook
15
natural-language-processing
13
java
13
reinforcement-learning
12
optimization
12
matplotlib
12
nlp
11
python3
11
r-package
11
parallel
11
pipeline
11
automated-machine-learning
11
hyperparameter-search
11
tabular-data
11
rllib
10
forecasting
10
serving
10
ray
10
gradient-boosting
10
bigdata
10
deployment
10
exploratory-data-analysis
10
ml
10
pyarrow
10
hdf5
10
memory-mapped-file
10
datascience
9
llm-serving
9
sql
9
numpy
8
time-series-analysis
8
big-data
8
classification
8
extensible
8
flyte
8
notebook
8
workflows
8
flyte-tasks
8
rstats
8
pypi
8
sdk
8
plotly
7
aiml
7
alzheimer
7
plotly-dash
7
ag066833
7
adsp
7
alzheimers
7
regression
7
nia
7
parameter-tuning
7
artificial-intelligence
7
plotting
7
anomaly-detection
7