Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

Top 9.2% on conda-forge.org
scikit-plot 0.3.7
An intuitive library to add plotting functionality to scikit-learn objects.
6 versions - Latest release: over 5 years ago - 7 dependent packages - 13 dependent repositories - 2,295 stars on GitHub
bowtie-py 0.11.0
Bowtie is a library for writing dashboards in Python. No need to know web frameworks or JavaScrip...
7 versions - Latest release: over 5 years ago - 756 stars on GitHub
pygam 0.8.0
pyGAM is a library for training generalized additive models in Python. GAMs are powerful and inte...
3 versions - Latest release: over 5 years ago - 1 dependent package - 9 dependent repositories - 770 stars on GitHub
corral-pipeline 0.3
Corral will solve your pipeline needs by merging a database full connection interface with a MVC ...
1 version - Latest release: about 5 years ago - 5 stars on GitHub
vaex-distributed 0.3.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
3 versions - Latest release: about 5 years ago - 1 dependent package - 7,837 stars on GitHub
nteract_on_jupyter 2.1.3 💰
📘 The interactive computing suite for you! ✨
2 versions - Latest release: almost 5 years ago - 26 dependent repositories - 5,983 stars on GitHub
handout 1.1.1
Turn Python scripts into handouts with Markdown and figures
1 version - Latest release: over 4 years ago - 1,990 stars on GitHub
vaex-ui 0.3.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
5 versions - Latest release: over 4 years ago - 1 dependent package - 7,837 stars on GitHub
scikit-time 0.1
A unified framework for machine learning with time series
1 version - Latest release: over 4 years ago - 39 stars on GitHub
dagster_pagerduty 0.6.4
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_postgres 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_slack 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_twilio 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_papertrail 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_dbt 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_cron 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_bash 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_spark 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_gcp 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ssh 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_dask 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_aws 0.6.6
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
bookstore 2.5.1
bookstore provides tooling and workflow recommendations for storing, scheduling, and publishing n...
2 versions - Latest release: over 4 years ago - 187 stars on GitHub
boltzmannclean 0.1.2
Fills missing values in a pandas DataFrame using a Restricted Boltzmann Machine.
1 version - Latest release: over 4 years ago - 23 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
pandas_schema 0.3.5
A validation library for Pandas data frames using user-friendly schemas
1 version - Latest release: about 4 years ago - 1 dependent repositories - 180 stars on GitHub
r-naivebayes 0.9.7
High performance implementation of the Naive Bayes algorithm in R
2 versions - Latest release: about 4 years ago - 2 dependent repositories - 28 stars on GitHub
gspread-pandas 2.2.3
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...
3 versions - Latest release: about 4 years ago - 357 stars on GitHub
pyglmnet 1.1
Python implementation of elastic-net regularized generalized linear models
1 version - Latest release: about 4 years ago - 258 stars on GitHub
shogun-cpp 6.1.4 💰
The Shogun Machine learning toolbox offers a wide range of efficient and unified Machine Learning...
6 versions - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 2,933 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.
1 version - Latest release: about 4 years ago - 47 stars on GitHub
graspy 0.2
A graph, or network, provides a mathematically intuitive representation of data with some sort of...
1 version - Latest release: about 4 years ago - 1 dependent package - 291 stars on GitHub
vaex-arrow 0.5.1
Arrow support for vaex (out of core dataframes)
12 versions - Latest release: almost 4 years ago - 1 dependent package - 7,837 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.
11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
geomatics 0.10.1
A python tool for time series of multidimensional scientific data
8 versions - Latest release: almost 4 years ago - 1 stars on GitHub
dash_cytoscape 0.2.0 💰
Interactive network visualization in Python and Dash, powered by Cytoscape.js
2 versions - Latest release: almost 4 years ago - 3 dependent repositories - 495 stars on GitHub
artificial-adversary 1.1.1
🗣️ Tool to generate adversarial text examples and test machine learning models against them
2 versions - Latest release: almost 4 years ago - 377 stars on GitHub
evaml-core 0.12.2
An open source python library for automated feature engineering
1 version - Latest release: almost 4 years ago - 6,558 stars on GitHub
kneed 0.7.0
Knee point detection in Python :chart_with_upwards_trend:
8 versions - Latest release: over 3 years ago - 1 dependent package - 9 dependent repositories - 570 stars on GitHub
_dvc-azure 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-gdrive 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-gs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-hdfs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-oss 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-s3 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-ssh 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 8 dependent packages - 11,235 stars on GitHub
lifetimes 0.11.3 💰
Lifetime value in Python
2 versions - Latest release: over 3 years ago - 1,361 stars on GitHub
chartify 3.0.3
Python library that makes it easy for data scientists to create charts.
10 versions - Latest release: over 3 years ago - 3,297 stars on GitHub
pyclustering 0.10.1
pyclustering is a Python, C++ data mining library.
4 versions - Latest release: over 3 years ago - 1 dependent package - 3 dependent repositories - 1,052 stars on GitHub
deon 0.3.0
deon is a command line tool that allows you to easily add an ethics checklist to your data scienc...
3 versions - Latest release: over 3 years ago - 250 stars on GitHub
r-dataexplorer 0.8.2
Automate Data Exploration and Treatment
2 versions - Latest release: over 3 years ago - 467 stars on GitHub
r-janitor 2.1.0
simple tools for data cleaning in R
8 versions - Latest release: over 3 years ago - 8 dependent packages - 8 dependent repositories - 1,224 stars on GitHub
matrixprofile 1.1.10
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, acc...
1 version - Latest release: over 3 years ago - 2 dependent packages - 1 dependent repositories - 305 stars on GitHub
r-breakdown 0.2.1 💰
Model Agnostics breakDown plots
5 versions - Latest release: over 3 years ago - 1 dependent package - 99 stars on GitHub
pixiedust 1.1.19
Python Helper library for Jupyter Notebooks
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 1,029 stars on GitHub
Top 9.4% on conda-forge.org
dvc-base 1.11.16
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
20 versions - Latest release: about 3 years ago - 9 dependent packages - 1 dependent repositories - 11,242 stars on GitHub
Top 7.3% on conda-forge.org
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
tpot-dask 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-imblearn 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-mdr 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-skrebate 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-torch 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-full 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 8,986 stars on GitHub
ray-autoscaler 1.1.0
Ray is a fast and simple framework for building and running distributed applications.
2 versions - Latest release: about 3 years ago - 1 dependent package - 24,669 stars on GitHub
pysparkling 0.6.1
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
1 version - Latest release: about 3 years ago - 256 stars on GitHub
scitime 0.1.1
Training time estimation for scikit-learn algorithms.
4 versions - Latest release: about 3 years ago - 120 stars on GitHub
baikal 0.4.2
baikal is a graph-based, functional API for building complex machine learning pipelines of object...
1 version - Latest release: about 3 years ago - 594 stars on GitHub
tscv 0.1.2
This repository is a scikit-learn extension for time series cross-validation. It introduces gaps ...
1 version - Latest release: about 3 years ago - 229 stars on GitHub
leafmaptools 0.0.2 💰
A Python package for building a tool widgets infrastructure with ipyleaflet and ipywidgets
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 9 stars on GitHub
r-loose.rock 1.1.0
An R :package: that contains a wide set of useful functions for data science and survival analysis
2 versions - Latest release: almost 3 years ago - 2 stars on GitHub
Top 6.4% on conda-forge.org
boltons 21.0.0
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on no...
17 versions - Latest release: almost 3 years ago - 21 dependent packages - 15 dependent repositories - 6,089 stars on GitHub
genestboost 0.3.1
genestboost is an ML boosting library that separates the modeling algorithm from the boosting alg...
3 versions - Latest release: almost 3 years ago - 2 stars on GitHub
splink 1.0.6
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
16 versions - Latest release: almost 3 years ago - 571 stars on GitHub
dotnet-interactive 1.0.2309030
.NET Interactive combines the power of .NET with many other languages to create notebooks, REPLs,...
6 versions - Latest release: almost 3 years ago - 2,292 stars on GitHub
dagster-cron 0.11.15
An orchestration platform for the development, production, and observation of data assets.
44 versions - Latest release: almost 3 years ago - 6,905 stars on GitHub
evidently 0.1.23.dev0
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.c...
2 versions - Latest release: over 2 years ago - 3,246 stars on GitHub
dash-table 5.0.0 💰
OBSOLETE: now part of https://github.com/plotly/dash
31 versions - Latest release: over 2 years ago - 12 dependent packages - 29 dependent repositories - 422 stars on GitHub
r-vtree 5.4.6
An R package for calculating and drawing variable trees
3 versions - Latest release: over 2 years ago - 71 stars on GitHub
r-fivethirtyeight 0.6.2
R package of data and code behind the stories and interactives at FiveThirtyEight
4 versions - Latest release: over 2 years ago - 443 stars on GitHub
psyplot-gui 1.4.0
This package provides a graphical user interface to interact with the psyplot framework.
10 versions - Latest release: over 2 years ago - 2 dependent packages - 3 dependent repositories - 7 stars on GitHub
psy-view 0.2.0
This package provides a graphical user interface to quickly visualize the contents of a netCDF file
3 versions - Latest release: over 2 years ago - 2 dependent repositories - 9 stars on GitHub
aim 2.7.4
Aim 💫 — easy-to-use and performant open-source ML experiment tracker.
3 versions - Latest release: over 2 years ago - 3,276 stars on GitHub
koalas 1.8.2
Koalas: pandas API on Apache Spark
42 versions - Latest release: over 2 years ago - 1 dependent package - 3,256 stars on GitHub
openrefine 3.5.0 💰
OpenRefine is a free, open source power tool for working with messy data and improving it
3 versions - Latest release: over 2 years ago - 9,377 stars on GitHub
fastds 0.6.0
fds is a tool for Data Scientists made by DAGsHub to version control data and code at once. At a...
12 versions - Latest release: over 2 years ago - 365 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...
3 versions - Latest release: over 2 years ago - 272 stars on GitHub
mpl_sample_data 3.4.3 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 2 years ago - 17,056 stars on GitHub
psy-reg 1.4.0
This psyplot plugin can be used to make fits to your data and visualize them
5 versions - Latest release: over 2 years ago - 1 dependent package - 5 dependent repositories - 1 stars on GitHub
r-rio 0.5.29
A Swiss-Army Knife for Data I/O
5 versions - Latest release: over 2 years ago - 6 dependent packages - 6 dependent repositories - 548 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...
2 versions - Latest release: over 2 years ago - 68 stars on GitHub
visions 0.7.5
Type System for Data Analysis in Python
12 versions - Latest release: over 2 years ago - 2 dependent packages - 13 dependent repositories - 174 stars on GitHub
susi 1.2.2
SuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)
4 versions - Latest release: over 2 years ago - 83 stars on GitHub
tsfresh 0.19.0
Automatic extraction of relevant features from time series:
11 versions - Latest release: over 2 years ago - 2 dependent packages - 5 dependent repositories - 7,166 stars on GitHub
creme 0.6.1 💰
🌊 Online machine learning in Python
7 versions - Latest release: over 2 years ago - 4,146 stars on GitHub
vaex-server 0.8.1
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
12 versions - Latest release: over 2 years ago - 3 dependent packages - 1 dependent repositories - 7,837 stars on GitHub