Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "data-science" keyword
psy-simple 1.4.1
This psyplot plugin provides plot methods for simple visualization tasks like 2D plots, line plot...7 versions - Latest release: about 2 years ago - 4 dependent packages - 4 dependent repositories - 1 stars on GitHub
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.1 version - Latest release: over 1 year ago - 1 stars on GitHub
psy-reg 1.4.0
This psyplot plugin can be used to make fits to your data and visualize them5 versions - Latest release: over 2 years ago - 1 dependent package - 5 dependent repositories - 1 stars on GitHub
geomatics 0.10.1
A python tool for time series of multidimensional scientific data8 versions - Latest release: almost 4 years ago - 1 stars on GitHub
r-mlr3data 0.6.1 ๐ฐ
Data sets used in the book, gallery, or in examples of mlr3.6 versions - Latest release: over 1 year ago - 1 dependent package - 2 stars on GitHub
r-loose.rock 1.1.0
An R :package: that contains a wide set of useful functions for data science and survival analysis2 versions - Latest release: almost 3 years ago - 2 stars on GitHub
genestboost 0.3.1
genestboost is an ML boosting library that separates the modeling algorithm from the boosting alg...3 versions - Latest release: almost 3 years ago - 2 stars on GitHub
castoredc_api 0.1.4
Python Wrapper for Castor EDC API7 versions - Latest release: almost 2 years ago - 2 stars on GitHub
ml-research 0.4a2
A Python library with the implementation of algorithms for all papers I have been involved with.2 versions - Latest release: over 1 year ago - 3 stars on GitHub
corral-pipeline 0.3
Corral will solve your pipeline needs by merging a database full connection interface with a MVC ...1 version - Latest release: about 5 years ago - 5 stars on GitHub
psyplot-gui 1.4.0
This package provides a graphical user interface to interact with the psyplot framework.10 versions - Latest release: over 2 years ago - 2 dependent packages - 3 dependent repositories - 7 stars on GitHub
psy-maps 1.4.2
This psyplot plugin uses the cartopy package to visualize geo-referenced data on a map9 versions - Latest release: about 2 years ago - 2 dependent packages - 5 dependent repositories - 8 stars on GitHub
psy-view 0.2.0
This package provides a graphical user interface to quickly visualize the contents of a netCDF file3 versions - Latest release: over 2 years ago - 2 dependent repositories - 9 stars on GitHub
mprod-package 0.0.4a1
The mprod package provides python implementation for applying tensor-tensor (tubal) products. In ...3 versions - Latest release: almost 2 years ago - 9 stars on GitHub
leafmaptools 0.0.2 ๐ฐ
A Python package for building a tool widgets infrastructure with ipyleaflet and ipywidgets2 versions - Latest release: about 3 years ago - 1 dependent repositories - 9 stars on GitHub
muler 0.3.3
muler is an easy to use Python package for post-processing echelle spectroscopy from near-infrare...4 versions - Latest release: almost 2 years ago - 11 stars on GitHub
spectrafit-jupyter 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...3 versions - Latest release: over 1 year ago - 1 dependent package - 11 stars on GitHub
spectrafit 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...8 versions - Latest release: over 1 year ago - 2 dependent packages - 11 stars on GitHub
spectrafit-all 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...3 versions - Latest release: over 1 year ago - 11 stars on GitHub
thepipe 1.3.8
A simplistic, general purpose pipeline framework.4 versions - Latest release: over 1 year ago - 1 dependent package - 13 stars on GitHub
piso 0.9.0
Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and join...9 versions - Latest release: about 2 years ago - 16 stars on GitHub
scikit-data 0.1.3
This library offers functions to manipulate, clean and visualize data in a easy way.3 versions - Latest release: over 1 year ago - 19 stars on GitHub
darr 0.5.4
Darr is a Python science library for disk-based NumPy arrays that persist in a format that is sim...11 versions - Latest release: almost 2 years ago - 20 stars on GitHub
ghostpii 1.0.11
This repository contains the Python library for interacting with Capnion's private computation AP...5 versions - Latest release: over 1 year ago - 21 stars on GitHub
boltzmannclean 0.1.2
Fills missing values in a pandas DataFrame using a Restricted Boltzmann Machine.1 version - Latest release: over 4 years ago - 23 stars on GitHub
pycompare 1.5.4
Python module for generating Bland-Altman plots8 versions - Latest release: almost 2 years ago - 26 stars on GitHub
cfanalytics 0.1.4
Downloading, analyzing and visualizing CrossFit data1 version - Latest release: over 1 year ago - 27 stars on GitHub
ukcensusapi 1.1.6
UK Census Data queries and downloads from python or R3 versions - Latest release: about 2 years ago - 1 dependent package - 27 stars on GitHub
r-naivebayes 0.9.7
High performance implementation of the Naive Bayes algorithm in R2 versions - Latest release: about 4 years ago - 2 dependent repositories - 28 stars on GitHub
xeofs 0.7.0
Collection of EOF analysis and related variants for climate science3 versions - Latest release: over 1 year ago - 34 stars on GitHub
aimodelshare 0.0.144
23 versions - Latest release: over 1 year ago - 36 stars on GitHubscikit-time 0.1
A unified framework for machine learning with time series1 version - Latest release: over 4 years ago - 39 stars on GitHub
zen3geo 0.5.0 ๐ฐ
The ๐ data science library you've been waiting for~3 versions - Latest release: over 1 year ago - 40 stars on GitHub
r-loon 1.4.0
A Toolkit for Interactive Statistical Data Visualization1 version - Latest release: over 1 year ago - 45 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.1 version - Latest release: about 4 years ago - 47 stars on GitHub
dask-awkward 2022.10a0
dask-awkward provides a native Dask collection representing partitioned awkward arrays.2 versions - Latest release: over 1 year ago - 2 dependent repositories - 47 stars on GitHub
lcensemble 0.3.2
Random Forest or XGBoost? It is Time to Explore LCE10 versions - Latest release: over 1 year ago - 48 stars on GitHub
lexicalrichness 0.3.0
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).4 versions - Latest release: over 1 year ago - 1 dependent repositories - 50 stars on GitHub
foundry_ml 0.5.0
Simplifying the discovery and usage of machine-learning ready datasets in materials science and c...4 versions - Latest release: over 1 year ago - 52 stars on GitHub
matbench 0.6
Matbench: Benchmarks for materials science property prediction2 versions - Latest release: almost 2 years ago - 57 stars on GitHub
forestplot 0.2.0
A Python package to make publication-ready but customizable coefficient plots.2 versions - Latest release: over 1 year ago - 59 stars on GitHub
typedframe 0.7.0
Typed wrappers over pandas DataFrames with schema validation1 version - Latest release: almost 2 years ago - 65 stars on GitHub
psyplot 1.4.3
psyplot is an cross-platform open source python project that mainly combines the plotting utiliti...10 versions - Latest release: over 1 year ago - 7 dependent packages - 10 dependent repositories - 65 stars on GitHub
coqui-trainer 0.0.5
๐ธ - A general purpose model trainer, as flexible as it gets2 versions - Latest release: about 2 years ago - 67 stars on GitHub
pyprocessmacro 1.0.12
A Python library for moderation, mediation and conditional process analysis.3 versions - Latest release: about 2 years ago - 67 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...2 versions - Latest release: over 2 years ago - 68 stars on GitHub
tidypandas 0.2.3
A grammar of data manipulation for pandas inspired by tidyverse3 versions - Latest release: over 1 year ago - 70 stars on GitHub
ipychart 0.4.0
ipychart is an ipywidget which allows to create dynamic, refined and customizable charts within t...9 versions - Latest release: about 2 years ago - 71 stars on GitHub
r-vtree 5.4.6
An R package for calculating and drawing variable trees3 versions - Latest release: over 2 years ago - 71 stars on GitHub
stripnet 0.0.7
Leverage the power of NLP Topic Modeling, Semantic Similarity and Network analysis to study the t...2 versions - Latest release: about 2 years ago - 82 stars on GitHub
susi 1.2.2
SuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)4 versions - Latest release: over 2 years ago - 83 stars on GitHub
r-tarchetypes 0.7.2
Archetypes for targets and pipelines14 versions - Latest release: over 1 year ago - 1 dependent repositories - 89 stars on GitHub
pymks 0.4.1
PyMKS is an open source, pythonic implementation of the methodologies developed under the aegis o...4 versions - Latest release: about 2 years ago - 1 dependent repositories - 92 stars on GitHub
r-astsa 1.16
R package to accompany Time Series Analysis and Its Applications: With R Examples -and- Time Seri...4 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 97 stars on GitHub
panel-chemistry 0.2.2
๐งช๐ ๐. The purpose of the panel-chemistry project is to make it really easy for you to do DATA AN...5 versions - Latest release: over 1 year ago - 98 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
r-breakdown 0.2.1 ๐ฐ
Model Agnostics breakDown plots5 versions - Latest release: over 3 years ago - 1 dependent package - 99 stars on GitHub
pyrolite 0.3.2
A set of tools for getting the most from your geochemical data.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 100 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
scitime 0.1.1
Training time estimation for scikit-learn algorithms.4 versions - Latest release: about 3 years ago - 120 stars on GitHub
r-gghoriplot 1.0.1
A user-friendly, highly customizable R package for building horizon plots in ggplot22 versions - Latest release: over 1 year ago - 121 stars on GitHub
flytekitplugins-modin 1.2.4
Modin plugin for Flytekit: `flytekitplugins-modin` PyPI: [https://pypi.org/project/flytekitplugi...9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-data-fsspec 1.2.4
`fsspec` powered data-plugins for Flytekit: `flytekitplugins-data-fsspec` PyPI: [https://pypi.or...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-sqlalchemy 1.2.4
SQLAlchemy plugin for Flytekit: `flytekitplugins-sqlalchemy` PyPI: [https://pypi.org/project/fly...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-athena 1.2.4
Athena plugin for Flytekit: `flytekitplugins-athena` PyPI: [https://pypi.org/project/flytekitplu...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-spark 1.0.5
Spark 3 plugin for Flytekit: `flytekitplugins-spark` PyPI: [https://pypi.org/project/flytekitplu...1 version - Latest release: almost 2 years ago - 123 stars on GitHub
flytekitplugins-awsbatch 1.2.4
AWS Batch plugin for Flytekit: `flytekitplugins-awsbatch` PyPI: [https://pypi.org/project/flytek...9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-pandera 1.2.4
Pandera plugin for Flytekit: `flytekitplugins-pandera` PyPI: [https://pypi.org/project/flytekitp...7 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
verticapy 0.11.0
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science pro...17 versions - Latest release: over 1 year ago - 124 stars on GitHub
chemml 1.2
ChemML is a machine learning and informatics program suite for the analysis, mining, and modeling...4 versions - Latest release: about 2 years ago - 128 stars on GitHub
pyscaffoldext-dsproject 0.7.2 ๐ฐ
๐ซ PyScaffold extension for data-science projects5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 139 stars on GitHub
hcrystalball 0.1.12
A library that unifies the API for most commonly used libraries and modeling techniques for time-...10 versions - Latest release: about 2 years ago - 1 dependent package - 147 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling5 versions - Latest release: over 1 year ago - 152 stars on GitHub
hyppo 0.3.2
Python package for multivariate hypothesis testing1 version - Latest release: almost 2 years ago - 154 stars on GitHub
mvlearn 0.5.0
mvlearn aims to serve as a community-driven open-source software package that offers reference im...5 versions - Latest release: about 2 years ago - 169 stars on GitHub
ptitprince 0.2.6
python version of raincloud2 versions - Latest release: over 1 year ago - 1 dependent repositories - 169 stars on GitHub
visions 0.7.5
Type System for Data Analysis in Python12 versions - Latest release: over 2 years ago - 2 dependent packages - 13 dependent repositories - 174 stars on GitHub
r-fastverse 0.3.0
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and...3 versions - Latest release: over 1 year ago - 175 stars on GitHub
redshift_connector 2.0.908
redshift_connector is the Amazon Redshift connector for Python. Easy integration with pandas and ...26 versions - Latest release: almost 2 years ago - 2 dependent packages - 178 stars on GitHub
pandas_schema 0.3.5
A validation library for Pandas data frames using user-friendly schemas1 version - Latest release: about 4 years ago - 1 dependent repositories - 180 stars on GitHub
cdsdashboards 0.6.3
JupyterHub extension for ContainDS Dashboards22 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 183 stars on GitHub
cdsdashboards-singleuser 0.6.3
JupyterHub extension for ContainDS Dashboards22 versions - Latest release: over 1 year ago - 6 dependent repositories - 183 stars on GitHub
bookstore 2.5.1
bookstore provides tooling and workflow recommendations for storing, scheduling, and publishing n...2 versions - Latest release: over 4 years ago - 187 stars on GitHub
bloxs 1.0.2
Build dashboards in Jupyter Notebook with numeric and chart boxes1 version - Latest release: almost 2 years ago - 203 stars on GitHub
ripser 0.6.4
Ripser.py is a lean persistent homology package for Python. Building on the blazing fast C++ Rips...8 versions - Latest release: over 1 year ago - 209 stars on GitHub
pycwt 0.3.0a22
A Python module for continuous wavelet spectral analysis. It includes a collection of routines fo...1 version - Latest release: over 1 year ago - 3 dependent repositories - 212 stars on GitHub
histolab 0.5.1 ๐ฐ
Library for Digital Pathology Image Processing1 version - Latest release: about 2 years ago - 229 stars on GitHub
miceforest 5.6.2
Multiple Imputation iteratively 'fills in' missing values in a dataset by modeling each variable ...8 versions - Latest release: over 1 year ago - 229 stars on GitHub
tscv 0.1.2
This repository is a scikit-learn extension for time series cross-validation. It introduces gaps ...1 version - Latest release: about 3 years ago - 229 stars on GitHub
tsflex 0.3.0
Flexible time series feature extraction & processing15 versions - Latest release: over 1 year ago - 240 stars on GitHub
deon 0.3.0
deon is a command line tool that allows you to easily add an ethics checklist to your data scienc...3 versions - Latest release: over 3 years ago - 250 stars on GitHub
pysparkling 0.6.1
A pure Python implementation of Apache Spark's RDD and DStream interfaces.1 version - Latest release: about 3 years ago - 256 stars on GitHub
pyglmnet 1.1
Python implementation of elastic-net regularized generalized linear models1 version - Latest release: about 4 years ago - 258 stars on GitHub
doubleml 0.5.2
The Python package DoubleML provides an implementation of the double / debiased machine learning ...10 versions - Latest release: over 1 year ago - 261 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...3 versions - Latest release: over 2 years ago - 272 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
retriever 3.1.0
This module analyzes jpeg/jpeg2000/png/gif image header and return image size.8 versions - Latest release: about 2 years ago - 1 dependent repositories - 279 stars on GitHub
graspy 0.2
A graph, or network, provides a mathematically intuitive representation of data with some sort of...1 version - Latest release: about 4 years ago - 1 dependent package - 291 stars on GitHub
Related Keywords
python
265
machine-learning
148
mlops
88
data-engineering
75
workflow
70
analytics
70
etl
67
orchestration
65
data-pipelines
64
data-integration
64
scheduler
63
data-orchestrator
63
metadata
62
workflow-automation
62
dagster
61
hacktoberfest
44
data-analysis
40
pandas
36
deep-learning
35
visualization
35
data-visualization
29
automl
27
ai
27
r
27
reproducibility
24
dataframe
24
scikit-learn
24
hyperparameter-optimization
22
data
22
developer-tools
21
pytorch
21
model-selection
21
jupyter
20
collaboration
20
statistics
20
git
20
data-version-control
19
distributed
19
time-series
18
automation
18
spark
17
tensorflow
16
jupyter-notebook
15
machinelearning
15
data-mining
15
random-forest
15
feature-engineering
15
natural-language-processing
13
java
13
reinforcement-learning
12
optimization
12
matplotlib
12
automated-machine-learning
11
tabular-data
11
hyperparameter-search
11
nlp
11
parallel
11
python3
11
pipeline
11
r-package
11
exploratory-data-analysis
10
gradient-boosting
10
pyarrow
10
memory-mapped-file
10
hdf5
10
forecasting
10
ml
10
bigdata
10
deployment
10
ray
10
rllib
10
serving
10
sql
9
llm-serving
9
datascience
9
pypi
8
sdk
8
numpy
8
rstats
8
workflows
8
big-data
8
classification
8
notebook
8
flyte
8
time-series-analysis
8
extensible
8
flyte-tasks
8
parameter-tuning
7
regression
7
u01ag066833
7
anomaly-detection
7
nia
7
plotly-dash
7
alzheimers
7
alzheimer
7
aiml
7
ag066833
7
artificial-intelligence
7
adsp
7
modin
7