Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "data-science" keyword
vaex-arrow 0.5.1
Arrow support for vaex (out of core dataframes)12 versions - Latest release: almost 4 years ago - 1 dependent package - 7,837 stars on GitHub
causalnex 0.11.0
A Python library that helps data scientists to infer causation rather than observing correlation.1 version - Latest release: over 1 year ago - 1,811 stars on GitHub
psyplot 1.4.3
psyplot is an cross-platform open source python project that mainly combines the plotting utiliti...10 versions - Latest release: over 1 year ago - 7 dependent packages - 10 dependent repositories - 65 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.1 version - Latest release: over 1 year ago - 1 stars on GitHub
statsforecast 1.3.0
**StatsForecast** offers a collection of widely used univariate time series forecasting models, i...18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 2,350 stars on GitHub
chemml 1.2
ChemML is a machine learning and informatics program suite for the analysis, mining, and modeling...4 versions - Latest release: about 2 years ago - 128 stars on GitHub
gspread-pandas 2.2.3
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...3 versions - Latest release: about 4 years ago - 357 stars on GitHub
composeml 0.9.1
A machine learning tool for automated prediction engineering. It allows you to easily structure p...9 versions - Latest release: over 1 year ago - 409 stars on GitHub
traceml 1.0.0
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for P...1 version - Latest release: almost 2 years ago - 463 stars on GitHub
jupyterlab_templates 0.3.2
Jupyter notebook templates5 versions - Latest release: over 1 year ago - 314 stars on GitHub
Top 6.4% on conda-forge.org
17 versions - Latest release: almost 3 years ago - 21 dependent packages - 15 dependent repositories - 6,089 stars on GitHub
boltons 21.0.0
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on no...17 versions - Latest release: almost 3 years ago - 21 dependent packages - 15 dependent repositories - 6,089 stars on GitHub
ipychart 0.4.0
ipychart is an ipywidget which allows to create dynamic, refined and customizable charts within t...9 versions - Latest release: about 2 years ago - 71 stars on GitHub
dvc-gdrive 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.67 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 11,242 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...2 versions - Latest release: over 2 years ago - 68 stars on GitHub
r-drake 7.13.4
An R-focused pipeline toolkit for reproducibility and high-performance computing21 versions - Latest release: over 1 year ago - 1,321 stars on GitHub
ploomber 0.21.7
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️60 versions - Latest release: over 1 year ago - 2 dependent repositories - 3,017 stars on GitHub
genestboost 0.3.1
genestboost is an ML boosting library that separates the modeling algorithm from the boosting alg...3 versions - Latest release: almost 3 years ago - 2 stars on GitHub
shogun-cpp 6.1.4 💰
The Shogun Machine learning toolbox offers a wide range of efficient and unified Machine Learning...6 versions - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 2,933 stars on GitHub
rubrix 0.18.0
Rubrix is a **production-ready Python framework for exploring, annotating, and managing data** in...21 versions - Latest release: over 1 year ago - 1 dependent package - 1,710 stars on GitHub
skweak 0.3.3
skweak: A software toolkit for weak supervision applied to NLP tasks4 versions - Latest release: over 1 year ago - 870 stars on GitHub
nipype 1.8.5
Workflows and interfaces for neuroimaging packages45 versions - Latest release: over 1 year ago - 1 dependent package - 5 dependent repositories - 661 stars on GitHub
r-tarchetypes 0.7.2
Archetypes for targets and pipelines14 versions - Latest release: over 1 year ago - 1 dependent repositories - 89 stars on GitHub
pdpipe 0.3.2
Ever written a preprocessing pipeline for pandas dataframes and had trouble serializing it for la...20 versions - Latest release: over 1 year ago - 703 stars on GitHub
psy-maps 1.4.2
This psyplot plugin uses the cartopy package to visualize geo-referenced data on a map9 versions - Latest release: about 2 years ago - 2 dependent packages - 5 dependent repositories - 8 stars on GitHub
mljar-mercury 0.5.1
Build Web Apps in Jupyter Notebook with Python only3 versions - Latest release: about 2 years ago - 2,513 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
pandas_schema 0.3.5
A validation library for Pandas data frames using user-friendly schemas1 version - Latest release: about 4 years ago - 1 dependent repositories - 180 stars on GitHub
dagster-ge 1.0.17
An orchestration platform for the development, production, and observation of data assets.103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_pagerduty 0.6.4
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagit 1.0.17
An orchestration platform for the development, production, and observation of data assets.118 versions - Latest release: over 1 year ago - 2 dependent repositories - 6,905 stars on GitHub
r-dalex 2.4.2 💰
moDel Agnostic Language for Exploration and eXplanation18 versions - Latest release: almost 2 years ago - 1 dependent package - 1,173 stars on GitHub
r-gghoriplot 1.0.1
A user-friendly, highly customizable R package for building horizon plots in ggplot22 versions - Latest release: over 1 year ago - 121 stars on GitHub
fastds 0.6.0
fds is a tool for Data Scientists made by DAGsHub to version control data and code at once. At a...12 versions - Latest release: over 2 years ago - 365 stars on GitHub
castoredc_api 0.1.4
Python Wrapper for Castor EDC API7 versions - Latest release: almost 2 years ago - 2 stars on GitHub
deon 0.3.0
deon is a command line tool that allows you to easily add an ethics checklist to your data scienc...3 versions - Latest release: over 3 years ago - 250 stars on GitHub
bowtie-py 0.11.0
Bowtie is a library for writing dashboards in Python. No need to know web frameworks or JavaScrip...7 versions - Latest release: over 5 years ago - 756 stars on GitHub
psy-reg 1.4.0
This psyplot plugin can be used to make fits to your data and visualize them5 versions - Latest release: over 2 years ago - 1 dependent package - 5 dependent repositories - 1 stars on GitHub
Top 2.6% on conda-forge.org
17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub
folium 0.13.0
Python Data. Leaflet.js Maps.17 versions - Latest release: over 1 year ago - 44 dependent packages - 259 dependent repositories - 6,139 stars on GitHub
nannyml 0.7.0
Detecting silent model failure. NannyML estimates performance for regression and classification m...8 versions - Latest release: over 1 year ago - 1,451 stars on GitHub
sweetviz 2.1.4
Visualize and compare datasets, target values and associations, with one line of code.10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 2,352 stars on GitHub
Top 4.9% on conda-forge.org
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
verticapy 0.11.0
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science pro...17 versions - Latest release: over 1 year ago - 124 stars on GitHub
Top 1.0% on conda-forge.org
72 versions - Latest release: over 1 year ago - 306 dependent packages - 3,810 dependent repositories - 15,737 stars on GitHub
ipython 8.6.0 💰
IPython provides a rich architecture for interactive computing with a powerful interactive shell,...72 versions - Latest release: over 1 year ago - 306 dependent packages - 3,810 dependent repositories - 15,737 stars on GitHub
dagster-aws 1.0.17
An orchestration platform for the development, production, and observation of data assets.55 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dataprep 0.4.5
Open-source low code data preparation library in python. Collect, clean and visualization your da...8 versions - Latest release: almost 2 years ago - 4 dependent repositories - 1,571 stars on GitHub
pycwt 0.3.0a22
A Python module for continuous wavelet spectral analysis. It includes a collection of routines fo...1 version - Latest release: over 1 year ago - 3 dependent repositories - 212 stars on GitHub
hyppo 0.3.2
Python package for multivariate hypothesis testing1 version - Latest release: almost 2 years ago - 154 stars on GitHub
r-breakdown 0.2.1 💰
Model Agnostics breakDown plots5 versions - Latest release: over 3 years ago - 1 dependent package - 99 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
pyclustering 0.10.1
pyclustering is a Python, C++ data mining library.4 versions - Latest release: over 3 years ago - 1 dependent package - 3 dependent repositories - 1,052 stars on GitHub
ptitprince 0.2.6
python version of raincloud2 versions - Latest release: over 1 year ago - 1 dependent repositories - 169 stars on GitHub
pyscaffoldext-dsproject 0.7.2 💰
💫 PyScaffold extension for data-science projects5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 139 stars on GitHub
xeofs 0.7.0
Collection of EOF analysis and related variants for climate science3 versions - Latest release: over 1 year ago - 34 stars on GitHub
pycompare 1.5.4
Python module for generating Bland-Altman plots8 versions - Latest release: almost 2 years ago - 26 stars on GitHub
flytekitplugins-spark 1.0.5
Spark 3 plugin for Flytekit: `flytekitplugins-spark` PyPI: [https://pypi.org/project/flytekitplu...1 version - Latest release: almost 2 years ago - 123 stars on GitHub
klib 1.0.6 💰
Easy to use Python library of customized functions for cleaning and analyzing data.30 versions - Latest release: over 1 year ago - 356 stars on GitHub
creme 0.6.1 💰
🌊 Online machine learning in Python7 versions - Latest release: over 2 years ago - 4,146 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...3 versions - Latest release: over 2 years ago - 272 stars on GitHub
flytekitplugins-data-fsspec 1.2.4
`fsspec` powered data-plugins for Flytekit: `flytekitplugins-data-fsspec` PyPI: [https://pypi.or...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
river 0.13.0 💰
🌊 Online machine learning in Python5 versions - Latest release: over 1 year ago - 4,146 stars on GitHub
mapie 0.5.0
A scikit-learn-compatible module for estimating prediction intervals.8 versions - Latest release: over 1 year ago - 1 dependent repositories - 689 stars on GitHub
dagster-pagerduty 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
ml-research 0.4a2
A Python library with the implementation of algorithms for all papers I have been involved with.2 versions - Latest release: over 1 year ago - 3 stars on GitHub
r-mlr3data 0.6.1 💰
Data sets used in the book, gallery, or in examples of mlr3.6 versions - Latest release: over 1 year ago - 1 dependent package - 2 stars on GitHub
modin 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code31 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 8,468 stars on GitHub
thepipe 1.3.8
A simplistic, general purpose pipeline framework.4 versions - Latest release: over 1 year ago - 1 dependent package - 13 stars on GitHub
bloxs 1.0.2
Build dashboards in Jupyter Notebook with numeric and chart boxes1 version - Latest release: almost 2 years ago - 203 stars on GitHub
vaex-distributed 0.3.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...3 versions - Latest release: about 5 years ago - 1 dependent package - 7,837 stars on GitHub
r-rio 0.5.29
A Swiss-Army Knife for Data I/O5 versions - Latest release: over 2 years ago - 6 dependent packages - 6 dependent repositories - 548 stars on GitHub
r-loon 1.4.0
A Toolkit for Interactive Statistical Data Visualization1 version - Latest release: over 1 year ago - 45 stars on GitHub
r-loose.rock 1.1.0
An R :package: that contains a wide set of useful functions for data science and survival analysis2 versions - Latest release: almost 3 years ago - 2 stars on GitHub
geomatics 0.10.1
A python tool for time series of multidimensional scientific data8 versions - Latest release: almost 4 years ago - 1 stars on GitHub
plotly-resampler 0.8.1
Visualize large time series data with plotly.py14 versions - Latest release: over 1 year ago - 1 dependent repositories - 675 stars on GitHub
evaml-core 0.12.2
An open source python library for automated feature engineering1 version - Latest release: almost 4 years ago - 6,558 stars on GitHub
u8darts-torch 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.15 versions - Latest release: over 1 year ago - 5,568 stars on GitHub
u8darts-all 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.18 versions - Latest release: over 1 year ago - 5,568 stars on GitHub
u8darts 0.22.0
A python library for user-friendly forecasting and anomaly detection on time series.18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 5,568 stars on GitHub
r-modeltime 1.2.4
Modeltime unlocks time series forecast models and machine learning in one framework12 versions - Latest release: over 1 year ago - 440 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.1 version - Latest release: about 4 years ago - 47 stars on GitHub
tscv 0.1.2
This repository is a scikit-learn extension for time series cross-validation. It introduces gaps ...1 version - Latest release: about 3 years ago - 229 stars on GitHub
pygam 0.8.0
pyGAM is a library for training generalized additive models in Python. GAMs are powerful and inte...3 versions - Latest release: over 5 years ago - 1 dependent package - 9 dependent repositories - 770 stars on GitHub
lcensemble 0.3.2
Random Forest or XGBoost? It is Time to Explore LCE10 versions - Latest release: over 1 year ago - 48 stars on GitHub
anndata 0.8.0
AnnData provides a scalable way of keeping track of data and learned annotations. It was initiall...10 versions - Latest release: about 2 years ago - 10 dependent packages - 17 dependent repositories - 358 stars on GitHub
rpy2 3.5.6
Interface to use R from Python16 versions - Latest release: over 1 year ago - 4 dependent packages - 73 dependent repositories - 376 stars on GitHub
pysparkling 0.6.1
A pure Python implementation of Apache Spark's RDD and DStream interfaces.1 version - Latest release: about 3 years ago - 256 stars on GitHub
girder-client 3.1.15
A data management platform for the web, developed by Kitware16 versions - Latest release: over 1 year ago - 2 dependent packages - 402 stars on GitHub
jupyter_pivottablejs 0.9.0
Drag and drop Pivot Tables and Charts for Jupyter/IPython Notebook2 versions - Latest release: over 1 year ago - 1 dependent repositories - 541 stars on GitHub
Related Keywords
python
265
machine-learning
148
mlops
88
data-engineering
75
analytics
70
workflow
70
etl
67
orchestration
65
data-integration
64
data-pipelines
64
data-orchestrator
63
scheduler
63
metadata
62
workflow-automation
62
dagster
61
hacktoberfest
44
data-analysis
40
pandas
36
visualization
35
deep-learning
35
data-visualization
29
r
27
ai
27
automl
27
scikit-learn
24
reproducibility
24
dataframe
24
data
22
hyperparameter-optimization
22
pytorch
21
model-selection
21
developer-tools
21
statistics
20
git
20
collaboration
20
jupyter
20
data-version-control
19
distributed
19
time-series
18
automation
18
spark
17
tensorflow
16
data-mining
15
jupyter-notebook
15
random-forest
15
machinelearning
15
feature-engineering
15
natural-language-processing
13
java
13
reinforcement-learning
12
matplotlib
12
optimization
12
tabular-data
11
hyperparameter-search
11
pipeline
11
r-package
11
python3
11
parallel
11
nlp
11
automated-machine-learning
11
ml
10
memory-mapped-file
10
pyarrow
10
hdf5
10
bigdata
10
deployment
10
forecasting
10
exploratory-data-analysis
10
serving
10
rllib
10
gradient-boosting
10
ray
10
llm-serving
9
datascience
9
sql
9
notebook
8
extensible
8
numpy
8
big-data
8
flyte
8
flyte-tasks
8
pypi
8
sdk
8
workflows
8
rstats
8
classification
8
time-series-analysis
8
alzheimer
7
regression
7
modin
7
alzheimers
7
parameter-tuning
7
plotly-dash
7
anomaly-detection
7
aiml
7
nia
7
artificial-intelligence
7
u01ag066833
7
plotting
7
plotly
7