Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

dagster-celery-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.
98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-github 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dbt 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-gcp 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-postgres 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-shell 1.0.17
An orchestration platform for the development, production, and observation of data assets.
103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_bash 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...
44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
ray-all 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 28,849 stars on GitHub
ray-rllib 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
ray-data 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
ray-autoscaler 1.1.0
Ray is a fast and simple framework for building and running distributed applications.
2 versions - Latest release: about 3 years ago - 1 dependent package - 24,669 stars on GitHub
modin-all 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
31 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 8,468 stars on GitHub
modin-hdk 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
4 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin-ray 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 4 dependent packages - 8,468 stars on GitHub
modin-core 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 9 dependent packages - 1 dependent repositories - 8,468 stars on GitHub
r-naivebayes 0.9.7
High performance implementation of the Naive Bayes algorithm in R
2 versions - Latest release: about 4 years ago - 2 dependent repositories - 28 stars on GitHub
ml-research 0.4a2
A Python library with the implementation of algorithms for all papers I have been involved with.
2 versions - Latest release: over 1 year ago - 3 stars on GitHub
_dvc-s3 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-oss 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-gs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
Top 9.4% on conda-forge.org
dvc-base 1.11.16
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
20 versions - Latest release: over 3 years ago - 9 dependent packages - 1 dependent repositories - 11,242 stars on GitHub
Top 5.1% on conda-forge.org
dvc 2.34.2
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
dvc-gdrive 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 11,242 stars on GitHub
dvc-s3 2.21.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
70 versions - Latest release: over 1 year ago - 1 dependent package - 12 dependent repositories - 11,242 stars on GitHub
lifelines 0.27.4 💰
Survival analysis in Python
58 versions - Latest release: over 1 year ago - 4 dependent packages - 9 dependent repositories - 2,055 stars on GitHub
stumpy 1.11.1
STUMPY is a powerful and scalable Python library that computes something called a matrix profile,...
25 versions - Latest release: about 2 years ago - 3 dependent packages - 1 dependent repositories - 2,618 stars on GitHub
pytorch-forecasting 0.10.2
PyTorch Forecasting is a timeseries forecasting package for PyTorch build on PyTorch Lightning. I...
23 versions - Latest release: about 2 years ago - 1 dependent repositories - 2,683 stars on GitHub
metaflow 2.7.14
:rocket: Build and manage real-life data science projects with ease!
58 versions - Latest release: over 1 year ago - 1 dependent repositories - 6,497 stars on GitHub
Top 8.4% on conda-forge.org
scanpy 1.9.1
Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with an...
5 versions - Latest release: about 2 years ago - 6 dependent packages - 33 dependent repositories - 1,728 stars on GitHub
r-astsa 1.16
R package to accompany Time Series Analysis and Its Applications: With R Examples -and- Time Seri...
4 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 97 stars on GitHub
Top 7.3% on conda-forge.org
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
pysparkling 0.6.1
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
1 version - Latest release: about 3 years ago - 256 stars on GitHub
tscv 0.1.2
This repository is a scikit-learn extension for time series cross-validation. It introduces gaps ...
1 version - Latest release: about 3 years ago - 229 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.
1 version - Latest release: about 4 years ago - 47 stars on GitHub
u8darts-all 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.
18 versions - Latest release: over 1 year ago - 5,568 stars on GitHub
u8darts-torch 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.
15 versions - Latest release: over 1 year ago - 5,568 stars on GitHub
u8darts 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.
18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 5,568 stars on GitHub
r-mlr3data 0.6.1 💰
Data sets used in the book, gallery, or in examples of mlr3.
6 versions - Latest release: almost 2 years ago - 1 dependent package - 2 stars on GitHub
handout 1.1.1
Turn Python scripts into handouts with Markdown and figures
1 version - Latest release: over 4 years ago - 1,990 stars on GitHub
Top 1.0% on conda-forge.org
ipython 8.6.0 💰
IPython provides a rich architecture for interactive computing with a powerful interactive shell,...
72 versions - Latest release: over 1 year ago - 306 dependent packages - 3,810 dependent repositories - 15,737 stars on GitHub
fastds 0.6.0
fds is a tool for Data Scientists made by DAGsHub to version control data and code at once. At a...
12 versions - Latest release: over 2 years ago - 365 stars on GitHub
_dvc 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 8 dependent packages - 11,235 stars on GitHub
_dvc-azure 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-hdfs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-webdav 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
24 versions - Latest release: almost 2 years ago - 11,242 stars on GitHub
_dvc-gdrive 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-azure 2.20.4
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
71 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
dvc-webhdfs 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
64 versions - Latest release: almost 2 years ago - 1 dependent repositories - 11,242 stars on GitHub
dvc-gs 2.20.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
69 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
dotnet-interactive 1.0.2309030
.NET Interactive combines the power of .NET with many other languages to create notebooks, REPLs,...
6 versions - Latest release: almost 3 years ago - 2,292 stars on GitHub
r-tarchetypes 0.7.2
Archetypes for targets and pipelines
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 89 stars on GitHub
splink 1.0.6
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
16 versions - Latest release: almost 3 years ago - 571 stars on GitHub
nlpaug 1.1.11 💰
This python library helps you with augmenting NLP for your machine learning projects. `Augmenter...
7 versions - Latest release: almost 2 years ago - 3,846 stars on GitHub
xeofs 0.7.0
Collection of EOF analysis and related variants for climate science
3 versions - Latest release: over 1 year ago - 34 stars on GitHub
mvlearn 0.5.0
mvlearn aims to serve as a community-driven open-source software package that offers reference im...
5 versions - Latest release: about 2 years ago - 169 stars on GitHub
turbodbc 4.5.5
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (OD...
27 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 566 stars on GitHub
deon 0.3.0
deon is a command line tool that allows you to easily add an ethics checklist to your data scienc...
3 versions - Latest release: over 3 years ago - 250 stars on GitHub
r-vtree 5.4.6
An R package for calculating and drawing variable trees
3 versions - Latest release: over 2 years ago - 71 stars on GitHub
jupyter_pivottablejs 0.9.0
Drag and drop Pivot Tables and Charts for Jupyter/IPython Notebook
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 541 stars on GitHub
koalas 1.8.2
Koalas: pandas API on Apache Spark
42 versions - Latest release: over 2 years ago - 1 dependent package - 3,256 stars on GitHub
tsflex 0.3.0
Flexible time series feature extraction & processing
15 versions - Latest release: over 1 year ago - 240 stars on GitHub
Top 9.0% on conda-forge.org
r-tidyverse 1.3.2
Easily install and load packages from the tidyverse
5 versions - Latest release: almost 2 years ago - 4 dependent packages - 173 dependent repositories - 1,456 stars on GitHub
scikit-mobility 1.3.1
scikit-mobility is a library for human mobility analysis in Python. The library allows to: (i) re...
4 versions - Latest release: almost 2 years ago - 2 dependent repositories - 598 stars on GitHub
retriever 3.1.0
This module analyzes jpeg/jpeg2000/png/gif image header and return image size.
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 279 stars on GitHub
lifetimes 0.11.3 💰
Lifetime value in Python
2 versions - Latest release: over 3 years ago - 1,361 stars on GitHub
Top 6.4% on conda-forge.org
boltons 21.0.0
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on no...
17 versions - Latest release: about 3 years ago - 21 dependent packages - 15 dependent repositories - 6,089 stars on GitHub
darr 0.5.4
Darr is a Python science library for disk-based NumPy arrays that persist in a format that is sim...
11 versions - Latest release: almost 2 years ago - 20 stars on GitHub
nipype 1.8.5
Workflows and interfaces for neuroimaging packages
45 versions - Latest release: over 1 year ago - 1 dependent package - 5 dependent repositories - 661 stars on GitHub
pygam 0.8.0
pyGAM is a library for training generalized additive models in Python. GAMs are powerful and inte...
3 versions - Latest release: over 5 years ago - 1 dependent package - 9 dependent repositories - 770 stars on GitHub
baikal 0.4.2
baikal is a graph-based, functional API for building complex machine learning pipelines of object...
1 version - Latest release: about 3 years ago - 594 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning
1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
skweak 0.3.3
skweak: A software toolkit for weak supervision applied to NLP tasks
4 versions - Latest release: over 1 year ago - 870 stars on GitHub
combo 0.1.3 💰
(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
1 version - Latest release: almost 2 years ago - 2 dependent repositories - 611 stars on GitHub
allennlp-all 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp-checklist 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
16 versions - Latest release: almost 2 years ago - 6 dependent packages - 11,429 stars on GitHub
skrebate 0.5
These algorithms excel at identifying features that are predictive of the outcome in supervised l...
5 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 376 stars on GitHub
r-drake 7.13.4
An R-focused pipeline toolkit for reproducibility and high-performance computing
21 versions - Latest release: almost 2 years ago - 1,321 stars on GitHub
Top 2.5% on conda-forge.org
dash 2.7.0 💰
Data Apps & Dashboards for Python. No JavaScript Required.
87 versions - Latest release: over 1 year ago - 37 dependent packages - 108 dependent repositories - 18,331 stars on GitHub
tsfresh 0.19.0
Automatic extraction of relevant features from time series:
11 versions - Latest release: over 2 years ago - 2 dependent packages - 5 dependent repositories - 7,166 stars on GitHub
pycwt 0.3.0a22
A Python module for continuous wavelet spectral analysis. It includes a collection of routines fo...
1 version - Latest release: over 1 year ago - 3 dependent repositories - 212 stars on GitHub
mapie 0.5.0
A scikit-learn-compatible module for estimating prediction intervals.
8 versions - Latest release: over 1 year ago - 1 dependent repositories - 689 stars on GitHub
Top 4.7% on conda-forge.org
imbalanced-learn 0.9.1
imbalanced-learn is a python package offering a number of re-sampling techniques commonly used in...
15 versions - Latest release: about 2 years ago - 8 dependent packages - 118 dependent repositories - 6,276 stars on GitHub
Top 4.9% on conda-forge.org
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
chartify 3.0.3
Python library that makes it easy for data scientists to create charts.
10 versions - Latest release: over 3 years ago - 3,297 stars on GitHub
snorkel 0.9.9
Snorkel is a system for programmatically building and managing training datasets to rapidly and f...
10 versions - Latest release: almost 2 years ago - 1 dependent package - 4 dependent repositories - 5,445 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...
33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
miceforest 5.6.2
Multiple Imputation iteratively 'fills in' missing values in a dataset by modeling each variable ...
8 versions - Latest release: almost 2 years ago - 229 stars on GitHub
pdpipe 0.3.2
Ever written a preprocessing pipeline for pandas dataframes and had trouble serializing it for la...
20 versions - Latest release: over 1 year ago - 703 stars on GitHub
r-targets 0.14.0
Function-oriented Make-like declarative workflows for R
18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 845 stars on GitHub
mljar-mercury 0.5.1
Build Web Apps in Jupyter Notebook with Python only
3 versions - Latest release: over 2 years ago - 2,513 stars on GitHub
Top 7.3% on conda-forge.org
ray-tune 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 4 dependent packages - 6 dependent repositories - 28,849 stars on GitHub
jupyterlab_templates 0.3.2
Jupyter notebook templates
5 versions - Latest release: over 1 year ago - 314 stars on GitHub
mljar-supervised 0.11.3
The mljar-supervised is an Automated Machine Learning Python package that works with tabular data...
5 versions - Latest release: almost 2 years ago - 2,520 stars on GitHub
Top 10.0% on conda-forge.org
doit 0.36.0 💰
`doit` is a task management & automation tool. `doit` comes from the idea of bringing the power ...
10 versions - Latest release: about 2 years ago - 6 dependent packages - 27 dependent repositories - 1,551 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...
22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...
144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform
12 versions - Latest release: almost 2 years ago - 58,575 stars on GitHub