Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

Top 0.1% on conda-forge.org
pandas 1.5.1 💰
Flexible and powerful data analysis / manipulation library for Python, providing labeled data str...
56 versions - Latest release: over 1 year ago - 1,663 dependent packages - 11,366 dependent repositories - 37,320 stars on GitHub
Top 0.9% on conda-forge.org
matplotlib-base 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 1 year ago - 1,213 dependent packages - 2,191 dependent repositories - 18,105 stars on GitHub
Top 0.8% on conda-forge.org
matplotlib 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
30 versions - Latest release: over 1 year ago - 699 dependent packages - 10,108 dependent repositories - 18,105 stars on GitHub
Top 0.1% on conda-forge.org
scikit-learn 1.1.3 💰
scikit-learn: machine learning in Python
31 versions - Latest release: over 1 year ago - 647 dependent packages - 5,166 dependent repositories - 53,452 stars on GitHub
Top 1.0% on conda-forge.org
ipython 8.6.0 💰
IPython provides a rich architecture for interactive computing with a powerful interactive shell,...
72 versions - Latest release: over 1 year ago - 306 dependent packages - 3,810 dependent repositories - 15,737 stars on GitHub
Top 1.6% on conda-forge.org
seaborn 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...
11 versions - Latest release: over 1 year ago - 181 dependent packages - 3,503 dependent repositories - 10,487 stars on GitHub
Top 1.7% on conda-forge.org
statsmodels 0.13.5
Statsmodels: statistical modeling and econometrics in Python
16 versions - Latest release: over 1 year ago - 130 dependent packages - 979 dependent repositories - 8,306 stars on GitHub
Top 1.6% on conda-forge.org
spacy 3.4.3
spaCy is a library for advanced natural language processing in Python and Cython.
68 versions - Latest release: over 1 year ago - 92 dependent packages - 174 dependent repositories - 25,557 stars on GitHub
Top 7.6% on conda-forge.org
dagster 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
Top 2.6% on conda-forge.org
folium 0.13.0
Python Data. Leaflet.js Maps.
17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub
Top 2.5% on conda-forge.org
dash 2.7.0 💰
Data Apps & Dashboards for Python. No JavaScript Required.
87 versions - Latest release: over 1 year ago - 37 dependent packages - 108 dependent repositories - 18,331 stars on GitHub
Top 1.1% on conda-forge.org
keras 2.10.0
Deep Learning for humans
22 versions - Latest release: over 1 year ago - 29 dependent packages - 327 dependent repositories - 57,664 stars on GitHub
Top 5.1% on conda-forge.org
ray-core 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 21 dependent packages - 5 dependent repositories - 28,849 stars on GitHub
Top 6.4% on conda-forge.org
boltons 21.0.0
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on no...
17 versions - Latest release: almost 3 years ago - 21 dependent packages - 15 dependent repositories - 6,089 stars on GitHub
Top 2.7% on conda-forge.org
gensim 4.2.0 💰
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with l...
18 versions - Latest release: almost 2 years ago - 17 dependent packages - 105 dependent repositories - 14,085 stars on GitHub
Top 8.8% on conda-forge.org
orange3 3.33.0 💰
Open source data visualization and data analysis for novice and expert. Interactive workflows wit...
54 versions - Latest release: over 1 year ago - 15 dependent packages - 2 dependent repositories - 4,009 stars on GitHub
dash-table 5.0.0 💰
OBSOLETE: now part of https://github.com/plotly/dash
31 versions - Latest release: over 2 years ago - 12 dependent packages - 29 dependent repositories - 422 stars on GitHub
Top 6.1% on conda-forge.org
ray-default 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
15 versions - Latest release: over 1 year ago - 11 dependent packages - 4 dependent repositories - 28,849 stars on GitHub
anndata 0.8.0
AnnData provides a scalable way of keeping track of data and learned annotations. It was initiall...
10 versions - Latest release: about 2 years ago - 10 dependent packages - 17 dependent repositories - 358 stars on GitHub
vaex-core 4.14.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
54 versions - Latest release: over 1 year ago - 10 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
Top 5.1% on conda-forge.org
dvc 2.34.2
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
Top 9.4% on conda-forge.org
dvc-base 1.11.16
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
20 versions - Latest release: about 3 years ago - 9 dependent packages - 1 dependent repositories - 11,242 stars on GitHub
modin-core 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 9 dependent packages - 1 dependent repositories - 8,468 stars on GitHub
Top 8.2% on conda-forge.org
torchmetrics 0.10.3
Torchmetrics is a metrics API created for easy metric development and usage in both PyTorch and [...
26 versions - Latest release: over 1 year ago - 9 dependent packages - 50 dependent repositories - 1,314 stars on GitHub
Top 5.3% on conda-forge.org
catboost 1.1.1
General purpose gradient boosting on decision trees library with categorical features support out...
61 versions - Latest release: over 1 year ago - 9 dependent packages - 32 dependent repositories - 7,012 stars on GitHub
Top 4.7% on conda-forge.org
imbalanced-learn 0.9.1
imbalanced-learn is a python package offering a number of re-sampling techniques commonly used in...
15 versions - Latest release: almost 2 years ago - 8 dependent packages - 118 dependent repositories - 6,276 stars on GitHub
Top 8.6% on conda-forge.org
tiledb 2.12.2
TileDB is an efficient multi-dimensional array management system which introduces a novel on-disk...
88 versions - Latest release: over 1 year ago - 8 dependent packages - 155 dependent repositories - 1,481 stars on GitHub
Top 5.8% on conda-forge.org
wandb 0.13.5
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the C...
48 versions - Latest release: over 1 year ago - 8 dependent packages - 86 dependent repositories - 5,671 stars on GitHub
r-janitor 2.1.0
simple tools for data cleaning in R
8 versions - Latest release: over 3 years ago - 8 dependent packages - 8 dependent repositories - 1,224 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...
4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
_dvc 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 8 dependent packages - 11,235 stars on GitHub
Top 4.9% on conda-forge.org
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
python-crfsuite 0.9.8
A python binding for crfsuite
7 versions - Latest release: about 2 years ago - 8 dependent packages - 2 dependent repositories - 749 stars on GitHub
Top 8.1% on conda-forge.org
featuretools 1.18.0
Featuretools is a framework to perform automated feature engineering. It excels at transforming t...
63 versions - Latest release: over 1 year ago - 7 dependent packages - 5 dependent repositories - 6,558 stars on GitHub
psyplot 1.4.3
psyplot is an cross-platform open source python project that mainly combines the plotting utiliti...
10 versions - Latest release: over 1 year ago - 7 dependent packages - 10 dependent repositories - 65 stars on GitHub
Top 7.3% on conda-forge.org
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
Top 9.2% on conda-forge.org
scikit-plot 0.3.7
An intuitive library to add plotting functionality to scikit-learn objects.
6 versions - Latest release: over 5 years ago - 7 dependent packages - 13 dependent repositories - 2,295 stars on GitHub
dagster-graphql 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6 dependent packages - 6,905 stars on GitHub
allennlp 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
16 versions - Latest release: almost 2 years ago - 6 dependent packages - 11,429 stars on GitHub
Top 6.4% on conda-forge.org
tensorflow-probability 0.18.0
TensorFlow Probability is a library for probabilistic reasoning and statistical analysis in Tenso...
13 versions - Latest release: over 1 year ago - 6 dependent packages - 35 dependent repositories - 3,875 stars on GitHub
Top 8.4% on conda-forge.org
scanpy 1.9.1
Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with an...
5 versions - Latest release: about 2 years ago - 6 dependent packages - 33 dependent repositories - 1,728 stars on GitHub
Top 5.0% on conda-forge.org
pandas-profiling 3.4.0
Create HTML profiling reports from pandas DataFrame objects
23 versions - Latest release: over 1 year ago - 6 dependent packages - 69 dependent repositories - 10,373 stars on GitHub
r-rio 0.5.29
A Swiss-Army Knife for Data I/O
5 versions - Latest release: over 2 years ago - 6 dependent packages - 6 dependent repositories - 548 stars on GitHub
Top 10.0% on conda-forge.org
doit 0.36.0 💰
`doit` is a task management & automation tool. `doit` comes from the idea of bringing the power ...
10 versions - Latest release: about 2 years ago - 6 dependent packages - 27 dependent repositories - 1,551 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...
44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
Top 9.3% on conda-forge.org
sktime 0.12.1 💰
A unified framework for machine learning with time series
18 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 6,633 stars on GitHub
modin-dask 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 5 dependent packages - 1 dependent repositories - 8,468 stars on GitHub
Top 8.0% on conda-forge.org
geemap 0.17.2 💰
A Python package for interactive mapping with Google Earth Engine, ipyleaflet, and folium.
99 versions - Latest release: over 1 year ago - 4 dependent packages - 27 dependent repositories - 2,792 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
psy-simple 1.4.1
This psyplot plugin provides plot methods for simple visualization tasks like 2D plots, line plot...
7 versions - Latest release: about 2 years ago - 4 dependent packages - 4 dependent repositories - 1 stars on GitHub
Top 9.0% on conda-forge.org
r-tidyverse 1.3.2
Easily install and load packages from the tidyverse
5 versions - Latest release: almost 2 years ago - 4 dependent packages - 173 dependent repositories - 1,456 stars on GitHub
kfp 1.8.14
Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflo...
58 versions - Latest release: over 1 year ago - 4 dependent packages - 2 dependent repositories - 3,137 stars on GitHub
modin-ray 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 4 dependent packages - 8,468 stars on GitHub
Top 5.3% on conda-forge.org
seaborn-base 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...
6 versions - Latest release: over 1 year ago - 4 dependent packages - 134 dependent repositories - 10,487 stars on GitHub
Top 7.3% on conda-forge.org
ray-tune 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 4 dependent packages - 6 dependent repositories - 28,849 stars on GitHub
rpy2 3.5.6
Interface to use R from Python
16 versions - Latest release: over 1 year ago - 4 dependent packages - 73 dependent repositories - 376 stars on GitHub
lifelines 0.27.4 💰
Survival analysis in Python
58 versions - Latest release: over 1 year ago - 4 dependent packages - 9 dependent repositories - 2,055 stars on GitHub
leafmap 0.12.1 💰
A Python package for geospatial analysis and interactive mapping in a Jupyter environment
53 versions - Latest release: over 1 year ago - 4 dependent packages - 16 dependent repositories - 1,543 stars on GitHub
modin 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
31 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 8,468 stars on GitHub
ray-all 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 28,849 stars on GitHub
r-mlr 2.19.1 💰
Machine Learning in R
10 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 1,598 stars on GitHub
Top 10.0% on conda-forge.org
pyod 1.0.6 💰
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
27 versions - Latest release: over 1 year ago - 3 dependent packages - 4 dependent repositories - 6,847 stars on GitHub
vaex-viz 0.5.4
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
16 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
dtreeviz 1.4.0
A python library for decision tree visualization and model interpretation.
13 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 2,458 stars on GitHub
vaex-server 0.8.1
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
12 versions - Latest release: over 2 years ago - 3 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
stumpy 1.11.1
STUMPY is a powerful and scalable Python library that computes something called a matrix profile,...
25 versions - Latest release: about 2 years ago - 3 dependent packages - 1 dependent repositories - 2,618 stars on GitHub
turbodbc 4.5.5
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (OD...
27 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 566 stars on GitHub
modin-omnisci 0.15.3
Modin: Scale your Pandas workflows by changing a single line of code
18 versions - Latest release: over 1 year ago - 2 dependent packages - 8,468 stars on GitHub
r-targets 0.14.0
Function-oriented Make-like declarative workflows for R
18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 845 stars on GitHub
psyplot-gui 1.4.0
This package provides a graphical user interface to interact with the psyplot framework.
10 versions - Latest release: over 2 years ago - 2 dependent packages - 3 dependent repositories - 7 stars on GitHub
spectrafit 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
8 versions - Latest release: over 1 year ago - 2 dependent packages - 11 stars on GitHub
skrebate 0.5
These algorithms excel at identifying features that are predictive of the outcome in supervised l...
5 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 376 stars on GitHub
mlxtend 0.21.0 💰
A library of Python tools and extensions for data science and machine learning. Contact =========...
17 versions - Latest release: over 1 year ago - 2 dependent packages - 9 dependent repositories - 4,310 stars on GitHub
dtale 2.9.0
Visualizer for pandas data structures
114 versions - Latest release: over 1 year ago - 2 dependent packages - 2 dependent repositories - 3,944 stars on GitHub
vaex-astro 0.9.2
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
14 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...
144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
dagster-celery 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
redshift_connector 2.0.908
redshift_connector is the Amazon Redshift connector for Python. Easy integration with pandas and ...
26 versions - Latest release: almost 2 years ago - 2 dependent packages - 178 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
vaex-hdf5 0.13.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
23 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
statsforecast 1.3.0
**StatsForecast** offers a collection of widely used univariate time series forecasting models, i...
18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 2,350 stars on GitHub
visions 0.7.5
Type System for Data Analysis in Python
12 versions - Latest release: over 2 years ago - 2 dependent packages - 13 dependent repositories - 174 stars on GitHub
tsfresh 0.19.0
Automatic extraction of relevant features from time series:
11 versions - Latest release: over 2 years ago - 2 dependent packages - 5 dependent repositories - 7,166 stars on GitHub
girder-client 3.1.15
A data management platform for the web, developed by Kitware
16 versions - Latest release: over 1 year ago - 2 dependent packages - 402 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
matrixprofile 1.1.10
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, acc...
1 version - Latest release: over 3 years ago - 2 dependent packages - 1 dependent repositories - 305 stars on GitHub
r-collapse 1.8.9 💰
Advanced and Fast Data Transformation in R
5 versions - Latest release: over 1 year ago - 2 dependent packages - 451 stars on GitHub
psy-maps 1.4.2
This psyplot plugin uses the cartopy package to visualize geo-referenced data on a map
9 versions - Latest release: about 2 years ago - 2 dependent packages - 5 dependent repositories - 8 stars on GitHub
u8darts 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.
18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 5,568 stars on GitHub
_dvc-hdfs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
allennlp-all 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
dvc-azure 2.20.4
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
71 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
_dvc-gdrive 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
snorkel 0.9.9
Snorkel is a system for programmatically building and managing training datasets to rapidly and f...
10 versions - Latest release: almost 2 years ago - 1 dependent package - 4 dependent repositories - 5,445 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
nipype 1.8.5
Workflows and interfaces for neuroimaging packages
45 versions - Latest release: over 1 year ago - 1 dependent package - 5 dependent repositories - 661 stars on GitHub
dvc-gs 2.20.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
69 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
cleanlab 2.1.0
cleanlab is the data-centric ML ops package for machine learning with noisy labels. cleanlab clea...
5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 5,564 stars on GitHub
_dvc-ssh 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
koalas 1.8.2
Koalas: pandas API on Apache Spark
42 versions - Latest release: over 2 years ago - 1 dependent package - 3,256 stars on GitHub