Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

openrefine 3.5.0 💰
OpenRefine is a free, open source power tool for working with messy data and improving it
3 versions - Latest release: over 2 years ago - 9,377 stars on GitHub
tpot-dask 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
Top 7.3% on conda-forge.org
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
tpot-full 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 8,986 stars on GitHub
pyprocessmacro 1.0.12
A Python library for moderation, mediation and conditional process analysis.
3 versions - Latest release: about 2 years ago - 67 stars on GitHub
ghostpii 1.0.11
This repository contains the Python library for interacting with Capnion's private computation AP...
5 versions - Latest release: over 1 year ago - 21 stars on GitHub
girder-client 3.1.15
A data management platform for the web, developed by Kitware
16 versions - Latest release: almost 2 years ago - 2 dependent packages - 402 stars on GitHub
dataprep 0.4.5
Open-source low code data preparation library in python. Collect, clean and visualization your da...
8 versions - Latest release: almost 2 years ago - 4 dependent repositories - 1,571 stars on GitHub
aim 2.7.4
Aim 💫 — easy-to-use and performant open-source ML experiment tracker.
3 versions - Latest release: over 2 years ago - 3,276 stars on GitHub
rpy2 3.5.6
Interface to use R from Python
16 versions - Latest release: over 1 year ago - 4 dependent packages - 73 dependent repositories - 376 stars on GitHub
nlpaug 1.1.11 💰
This python library helps you with augmenting NLP for your machine learning projects. `Augmenter...
7 versions - Latest release: almost 2 years ago - 3,846 stars on GitHub
sweetviz 2.1.4
Visualize and compare datasets, target values and associations, with one line of code.
10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 2,352 stars on GitHub
matrixprofile 1.1.10
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, acc...
1 version - Latest release: over 3 years ago - 2 dependent packages - 1 dependent repositories - 305 stars on GitHub
aimodelshare 0.0.144
23 versions - Latest release: over 1 year ago - 36 stars on GitHub
d2l 0.17.6
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 u...
2 versions - Latest release: over 1 year ago - 16,875 stars on GitHub
pyglmnet 1.1
Python implementation of elastic-net regularized generalized linear models
1 version - Latest release: about 4 years ago - 258 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...
3 versions - Latest release: over 2 years ago - 272 stars on GitHub
eli5 0.13.0
ELI5 is a Python package which helps to debug machine learning classifiers and explain their pred...
11 versions - Latest release: about 2 years ago - 1 dependent package - 14 dependent repositories - 2,652 stars on GitHub
dirty_cat 0.3.0
Machine learning on dirty tabular data
3 versions - Latest release: over 1 year ago - 682 stars on GitHub
pyrolite 0.3.2
A set of tools for getting the most from your geochemical data.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 100 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data
7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
kneed 0.7.0
Knee point detection in Python :chart_with_upwards_trend:
8 versions - Latest release: almost 4 years ago - 1 dependent package - 9 dependent repositories - 570 stars on GitHub
dagster-mysql 1.0.17
An orchestration platform for the development, production, and observation of data assets.
82 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 1.6% on conda-forge.org
seaborn 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...
11 versions - Latest release: over 1 year ago - 181 dependent packages - 3,503 dependent repositories - 10,487 stars on GitHub
mprod-package 0.0.4a1
The mprod package provides python implementation for applying tensor-tensor (tubal) products. In ...
3 versions - Latest release: almost 2 years ago - 9 stars on GitHub
dagster-snowflake 1.0.17
An orchestration platform for the development, production, and observation of data assets.
108 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_papertrail 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-papertrail 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
mpl_sample_data 3.4.3 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 2 years ago - 17,056 stars on GitHub
Top 0.8% on conda-forge.org
matplotlib 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
30 versions - Latest release: over 1 year ago - 699 dependent packages - 10,108 dependent repositories - 18,105 stars on GitHub
pycompare 1.5.4
Python module for generating Bland-Altman plots
8 versions - Latest release: about 2 years ago - 26 stars on GitHub
dagster-cron 0.11.15
An orchestration platform for the development, production, and observation of data assets.
44 versions - Latest release: almost 3 years ago - 6,905 stars on GitHub
dagster-graphql 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6 dependent packages - 6,905 stars on GitHub
_dvc-ssh 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
Top 0.1% on conda-forge.org
scikit-learn 1.1.3 💰
scikit-learn: machine learning in Python
31 versions - Latest release: over 1 year ago - 647 dependent packages - 5,166 dependent repositories - 53,452 stars on GitHub
sktime-all-extras 0.12.1 💰
A unified framework for machine learning with time series
13 versions - Latest release: almost 2 years ago - 2 dependent repositories - 6,633 stars on GitHub
Top 9.3% on conda-forge.org
sktime 0.12.1 💰
A unified framework for machine learning with time series
18 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 6,633 stars on GitHub
dagster-datadog 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dask 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-slack 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-twilio 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.
86 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
typedframe 0.7.0
Typed wrappers over pandas DataFrames with schema validation
1 version - Latest release: almost 2 years ago - 65 stars on GitHub
Top 10.0% on conda-forge.org
pyod 1.0.6 💰
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
27 versions - Latest release: over 1 year ago - 3 dependent packages - 4 dependent repositories - 6,847 stars on GitHub
Top 1.1% on conda-forge.org
keras 2.10.0
Deep Learning for humans
22 versions - Latest release: over 1 year ago - 29 dependent packages - 327 dependent repositories - 57,664 stars on GitHub
foundry_ml 0.5.0
Simplifying the discovery and usage of machine-learning ready datasets in materials science and c...
4 versions - Latest release: over 1 year ago - 52 stars on GitHub
verticapy 0.11.0
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science pro...
17 versions - Latest release: over 1 year ago - 124 stars on GitHub
klib 1.0.6 💰
Easy to use Python library of customized functions for cleaning and analyzing data.
30 versions - Latest release: over 1 year ago - 356 stars on GitHub
Top 0.9% on conda-forge.org
matplotlib-base 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 1 year ago - 1,213 dependent packages - 2,191 dependent repositories - 18,105 stars on GitHub
ptitprince 0.2.6
python version of raincloud
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 169 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...
2 versions - Latest release: over 2 years ago - 68 stars on GitHub
Top 1.7% on conda-forge.org
statsmodels 0.13.5
Statsmodels: statistical modeling and econometrics in Python
16 versions - Latest release: over 1 year ago - 130 dependent packages - 979 dependent repositories - 8,306 stars on GitHub
psy-view 0.2.0
This package provides a graphical user interface to quickly visualize the contents of a netCDF file
3 versions - Latest release: over 2 years ago - 2 dependent repositories - 9 stars on GitHub
spectrafit 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
8 versions - Latest release: over 1 year ago - 2 dependent packages - 11 stars on GitHub
spectrafit-all 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
3 versions - Latest release: over 1 year ago - 11 stars on GitHub
erroranalysis 0.3.12
**Responsible AI Toolbox**: `erroranalysis` Responsible AI is an approach to assessing, developi...
8 versions - Latest release: over 1 year ago - 886 stars on GitHub
r-modeltime 1.2.4
Modeltime unlocks time series forecast models and machine learning in one framework
12 versions - Latest release: over 1 year ago - 440 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling
5 versions - Latest release: over 1 year ago - 152 stars on GitHub
cleanlab 2.1.0
cleanlab is the data-centric ML ops package for machine learning with noisy labels. cleanlab clea...
5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 5,564 stars on GitHub
r-catboost 1.1.1
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking,...
49 versions - Latest release: over 1 year ago - 7,013 stars on GitHub
Top 5.3% on conda-forge.org
catboost 1.1.1
General purpose gradient boosting on decision trees library with categorical features support out...
61 versions - Latest release: over 1 year ago - 9 dependent packages - 32 dependent repositories - 7,012 stars on GitHub
flaml 1.0.14
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
31 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 2,337 stars on GitHub
dagster_slack 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-pandera 1.0.17
An orchestration platform for the development, production, and observation of data assets.
34 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster-celery-k8s 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mlflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.
70 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 7.6% on conda-forge.org
dagster 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster_aws 0.6.6
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-aws 1.0.17
An orchestration platform for the development, production, and observation of data assets.
55 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_postgres 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_twilio 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.
11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
dagster-fivetran 1.0.17
An orchestration platform for the development, production, and observation of data assets.
56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster-duckdb-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster-airbyte 1.0.17
An orchestration platform for the development, production, and observation of data assets.
47 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.
3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_ssh 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.
12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-dbt 1.0.17
An orchestration platform for the development, production, and observation of data assets.
106 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster_cron 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-snowflake-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
kfp-pipeline-spec 0.1.16
Machine Learning Pipelines for Kubeflow
6 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 3,137 stars on GitHub
kfp 1.8.14
Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflo...
58 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 3,137 stars on GitHub
kfp-server-api 1.8.4
Generated python client for the KF Pipelines server API
20 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 3,137 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans
9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
r-collapse 1.8.9 💰
Advanced and Fast Data Transformation in R
5 versions - Latest release: over 1 year ago - 2 dependent packages - 451 stars on GitHub
r-janitor 2.1.0
simple tools for data cleaning in R
8 versions - Latest release: over 3 years ago - 8 dependent packages - 8 dependent repositories - 1,224 stars on GitHub
Top 2.6% on conda-forge.org
folium 0.13.0
Python Data. Leaflet.js Maps.
17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub