Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

Top 1.1% on conda-forge.org
keras 2.10.0
Deep Learning for humans
22 versions - Latest release: over 1 year ago - 29 dependent packages - 327 dependent repositories - 57,664 stars on GitHub
Top 0.1% on conda-forge.org
scikit-learn 1.1.3 💰
scikit-learn: machine learning in Python
31 versions - Latest release: over 1 year ago - 647 dependent packages - 5,166 dependent repositories - 53,452 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform
12 versions - Latest release: almost 2 years ago - 51,076 stars on GitHub
Top 0.1% on conda-forge.org
pandas 1.5.1 💰
Flexible and powerful data analysis / manipulation library for Python, providing labeled data str...
56 versions - Latest release: over 1 year ago - 1,663 dependent packages - 11,366 dependent repositories - 37,320 stars on GitHub
apache-airflow-providers-microsoft-azure 4.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
16 versions - Latest release: almost 2 years ago - 33,967 stars on GitHub
apache-airflow-providers-samba 4.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
7 versions - Latest release: almost 2 years ago - 33,057 stars on GitHub
Top 5.1% on conda-forge.org
ray-core 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 21 dependent packages - 5 dependent repositories - 28,849 stars on GitHub
ray-rllib 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
ray-serve 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
ray-all 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 28,849 stars on GitHub
ray-data 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
Top 7.3% on conda-forge.org
ray-tune 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 4 dependent packages - 6 dependent repositories - 28,849 stars on GitHub
Top 6.1% on conda-forge.org
ray-default 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
15 versions - Latest release: over 1 year ago - 11 dependent packages - 4 dependent repositories - 28,849 stars on GitHub
ray-dashboard 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
21 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 26,719 stars on GitHub
ray-k8s 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
19 versions - Latest release: over 1 year ago - 1 dependent package - 26,341 stars on GitHub
Top 1.6% on conda-forge.org
spacy 3.4.3
spaCy is a library for advanced natural language processing in Python and Cython.
68 versions - Latest release: over 1 year ago - 92 dependent packages - 174 dependent repositories - 25,557 stars on GitHub
ray-autoscaler 1.1.0
Ray is a fast and simple framework for building and running distributed applications.
2 versions - Latest release: about 3 years ago - 1 dependent package - 24,669 stars on GitHub
Top 2.5% on conda-forge.org
dash 2.7.0 💰
Data Apps & Dashboards for Python. No JavaScript Required.
87 versions - Latest release: over 1 year ago - 37 dependent packages - 108 dependent repositories - 18,331 stars on GitHub
Top 0.9% on conda-forge.org
matplotlib-base 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 1 year ago - 1,213 dependent packages - 2,191 dependent repositories - 18,105 stars on GitHub
Top 0.8% on conda-forge.org
matplotlib 3.6.2 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
30 versions - Latest release: over 1 year ago - 699 dependent packages - 10,108 dependent repositories - 18,105 stars on GitHub
mpl_sample_data 3.4.3 💰
matplotlib is a python 2D plotting library which produces publication quality figures in a variet...
29 versions - Latest release: over 2 years ago - 17,056 stars on GitHub
d2l 0.17.6
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 u...
2 versions - Latest release: over 1 year ago - 16,875 stars on GitHub
Top 1.0% on conda-forge.org
ipython 8.6.0 💰
IPython provides a rich architecture for interactive computing with a powerful interactive shell,...
72 versions - Latest release: over 1 year ago - 306 dependent packages - 3,810 dependent repositories - 15,737 stars on GitHub
Top 2.7% on conda-forge.org
gensim 4.2.0 💰
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with l...
18 versions - Latest release: almost 2 years ago - 17 dependent packages - 105 dependent repositories - 14,085 stars on GitHub
Top 4.9% on conda-forge.org
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
allennlp-all 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp-checklist 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
4 versions - Latest release: almost 2 years ago - 1 dependent package - 11,429 stars on GitHub
allennlp 2.10.0
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learni...
16 versions - Latest release: almost 2 years ago - 6 dependent packages - 11,429 stars on GitHub
dvc-webhdfs 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
64 versions - Latest release: over 1 year ago - 1 dependent repositories - 11,242 stars on GitHub
_dvc-ssh 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-ssh 2.20.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
Top 9.4% on conda-forge.org
dvc-base 1.11.16
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
20 versions - Latest release: about 3 years ago - 9 dependent packages - 1 dependent repositories - 11,242 stars on GitHub
_dvc-s3 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
Top 5.1% on conda-forge.org
dvc 2.34.2
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
dvc-webdav 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
24 versions - Latest release: over 1 year ago - 11,242 stars on GitHub
_dvc-oss 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-azure 2.20.4
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
71 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
dvc-s3 2.21.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
70 versions - Latest release: over 1 year ago - 1 dependent package - 12 dependent repositories - 11,242 stars on GitHub
_dvc-hdfs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-gs 2.20.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
69 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
_dvc-gs 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-gdrive 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
_dvc-azure 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 1 dependent package - 11,242 stars on GitHub
dvc-gdrive 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 11,242 stars on GitHub
dvc-oss 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
dvc-hdfs 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
_dvc 1.9.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
1 version - Latest release: over 3 years ago - 8 dependent packages - 11,235 stars on GitHub
Top 1.6% on conda-forge.org
seaborn 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...
11 versions - Latest release: over 1 year ago - 181 dependent packages - 3,503 dependent repositories - 10,487 stars on GitHub
Top 5.3% on conda-forge.org
seaborn-base 0.12.1
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface...
6 versions - Latest release: over 1 year ago - 4 dependent packages - 134 dependent repositories - 10,487 stars on GitHub
Top 5.0% on conda-forge.org
pandas-profiling 3.4.0
Create HTML profiling reports from pandas DataFrame objects
23 versions - Latest release: over 1 year ago - 6 dependent packages - 69 dependent repositories - 10,373 stars on GitHub
openrefine 3.5.0 💰
OpenRefine is a free, open source power tool for working with messy data and improving it
3 versions - Latest release: over 2 years ago - 9,377 stars on GitHub
tpot-skrebate 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-full 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 8,986 stars on GitHub
tpot-mdr 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-torch 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
Top 7.3% on conda-forge.org
tpot 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
19 versions - Latest release: about 3 years ago - 7 dependent packages - 5 dependent repositories - 8,986 stars on GitHub
tpot-imblearn 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
tpot-dask 0.11.7
Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that ...
1 version - Latest release: about 3 years ago - 1 dependent package - 8,986 stars on GitHub
modin-omnisci 0.15.3
Modin: Scale your Pandas workflows by changing a single line of code
18 versions - Latest release: over 1 year ago - 2 dependent packages - 8,468 stars on GitHub
modin-core 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 9 dependent packages - 1 dependent repositories - 8,468 stars on GitHub
modin-all 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin-dask 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 5 dependent packages - 1 dependent repositories - 8,468 stars on GitHub
modin-hdk 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
4 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
31 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 8,468 stars on GitHub
modin-ray 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 4 dependent packages - 8,468 stars on GitHub
Top 1.7% on conda-forge.org
statsmodels 0.13.5
Statsmodels: statistical modeling and econometrics in Python
16 versions - Latest release: over 1 year ago - 130 dependent packages - 979 dependent repositories - 8,306 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...
144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
vaex-ui 0.3.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
5 versions - Latest release: over 4 years ago - 1 dependent package - 7,837 stars on GitHub
vaex-jupyter 0.8.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
15 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 7,837 stars on GitHub
vaex-hdf5 0.13.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
23 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
vaex-ml 0.18.0
Wrappers for various machine learning libraries to make them integrate into vaex.
16 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 7,837 stars on GitHub
vaex-viz 0.5.4
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
16 versions - Latest release: over 1 year ago - 3 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
vaex-core 4.14.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
54 versions - Latest release: over 1 year ago - 10 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
vaex-distributed 0.3.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
3 versions - Latest release: about 5 years ago - 1 dependent package - 7,837 stars on GitHub
vaex-astro 0.9.2
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
14 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
vaex-server 0.8.1
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
12 versions - Latest release: over 2 years ago - 3 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
vaex-arrow 0.5.1
Arrow support for vaex (out of core dataframes)
12 versions - Latest release: almost 4 years ago - 1 dependent package - 7,837 stars on GitHub
pycaret 2.3.10
An open-source, low-code machine learning library in Python
13 versions - Latest release: about 2 years ago - 3 dependent repositories - 7,561 stars on GitHub
tsfresh 0.19.0
Automatic extraction of relevant features from time series:
11 versions - Latest release: over 2 years ago - 2 dependent packages - 5 dependent repositories - 7,166 stars on GitHub
r-catboost 1.1.1
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking,...
49 versions - Latest release: over 1 year ago - 7,013 stars on GitHub
Top 5.3% on conda-forge.org
catboost 1.1.1
General purpose gradient boosting on decision trees library with categorical features support out...
61 versions - Latest release: over 1 year ago - 9 dependent packages - 32 dependent repositories - 7,012 stars on GitHub
dagster-ge 1.0.17
An orchestration platform for the development, production, and observation of data assets.
103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-datadog 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.
11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.
12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.
3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_bash 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-celery-k8s 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub