Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-science" keyword

psy-simple 1.4.1
This psyplot plugin provides plot methods for simple visualization tasks like 2D plots, line plot...
7 versions - Latest release: about 2 years ago - 4 dependent packages - 4 dependent repositories - 1 stars on GitHub
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.
1 version - Latest release: over 1 year ago - 1 stars on GitHub
psy-reg 1.4.0
This psyplot plugin can be used to make fits to your data and visualize them
5 versions - Latest release: over 2 years ago - 1 dependent package - 5 dependent repositories - 1 stars on GitHub
geomatics 0.10.1
A python tool for time series of multidimensional scientific data
8 versions - Latest release: almost 4 years ago - 1 stars on GitHub
r-mlr3data 0.6.1 ๐Ÿ’ฐ
Data sets used in the book, gallery, or in examples of mlr3.
6 versions - Latest release: over 1 year ago - 1 dependent package - 2 stars on GitHub
r-loose.rock 1.1.0
An R :package: that contains a wide set of useful functions for data science and survival analysis
2 versions - Latest release: almost 3 years ago - 2 stars on GitHub
genestboost 0.3.1
genestboost is an ML boosting library that separates the modeling algorithm from the boosting alg...
3 versions - Latest release: almost 3 years ago - 2 stars on GitHub
castoredc_api 0.1.4
Python Wrapper for Castor EDC API
7 versions - Latest release: almost 2 years ago - 2 stars on GitHub
ml-research 0.4a2
A Python library with the implementation of algorithms for all papers I have been involved with.
2 versions - Latest release: over 1 year ago - 3 stars on GitHub
corral-pipeline 0.3
Corral will solve your pipeline needs by merging a database full connection interface with a MVC ...
1 version - Latest release: about 5 years ago - 5 stars on GitHub
psyplot-gui 1.4.0
This package provides a graphical user interface to interact with the psyplot framework.
10 versions - Latest release: over 2 years ago - 2 dependent packages - 3 dependent repositories - 7 stars on GitHub
psy-maps 1.4.2
This psyplot plugin uses the cartopy package to visualize geo-referenced data on a map
9 versions - Latest release: about 2 years ago - 2 dependent packages - 5 dependent repositories - 8 stars on GitHub
psy-view 0.2.0
This package provides a graphical user interface to quickly visualize the contents of a netCDF file
3 versions - Latest release: over 2 years ago - 2 dependent repositories - 9 stars on GitHub
mprod-package 0.0.4a1
The mprod package provides python implementation for applying tensor-tensor (tubal) products. In ...
3 versions - Latest release: almost 2 years ago - 9 stars on GitHub
leafmaptools 0.0.2 ๐Ÿ’ฐ
A Python package for building a tool widgets infrastructure with ipyleaflet and ipywidgets
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 9 stars on GitHub
muler 0.3.3
muler is an easy to use Python package for post-processing echelle spectroscopy from near-infrare...
4 versions - Latest release: almost 2 years ago - 11 stars on GitHub
spectrafit-jupyter 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
3 versions - Latest release: over 1 year ago - 1 dependent package - 11 stars on GitHub
spectrafit 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
8 versions - Latest release: over 1 year ago - 2 dependent packages - 11 stars on GitHub
spectrafit-all 0.12.5
SpectraFit is a command-line and Jupyter-notebook tool for quick data-fitting based on the regula...
3 versions - Latest release: over 1 year ago - 11 stars on GitHub
thepipe 1.3.8
A simplistic, general purpose pipeline framework.
4 versions - Latest release: over 1 year ago - 1 dependent package - 13 stars on GitHub
piso 0.9.0
Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and join...
9 versions - Latest release: about 2 years ago - 16 stars on GitHub
scikit-data 0.1.3
This library offers functions to manipulate, clean and visualize data in a easy way.
3 versions - Latest release: over 1 year ago - 19 stars on GitHub
darr 0.5.4
Darr is a Python science library for disk-based NumPy arrays that persist in a format that is sim...
11 versions - Latest release: almost 2 years ago - 20 stars on GitHub
ghostpii 1.0.11
This repository contains the Python library for interacting with Capnion's private computation AP...
5 versions - Latest release: over 1 year ago - 21 stars on GitHub
boltzmannclean 0.1.2
Fills missing values in a pandas DataFrame using a Restricted Boltzmann Machine.
1 version - Latest release: over 4 years ago - 23 stars on GitHub
pycompare 1.5.4
Python module for generating Bland-Altman plots
8 versions - Latest release: almost 2 years ago - 26 stars on GitHub
cfanalytics 0.1.4
Downloading, analyzing and visualizing CrossFit data
1 version - Latest release: over 1 year ago - 27 stars on GitHub
ukcensusapi 1.1.6
UK Census Data queries and downloads from python or R
3 versions - Latest release: about 2 years ago - 1 dependent package - 27 stars on GitHub
r-naivebayes 0.9.7
High performance implementation of the Naive Bayes algorithm in R
2 versions - Latest release: about 4 years ago - 2 dependent repositories - 28 stars on GitHub
xeofs 0.7.0
Collection of EOF analysis and related variants for climate science
3 versions - Latest release: over 1 year ago - 34 stars on GitHub
aimodelshare 0.0.144
23 versions - Latest release: over 1 year ago - 36 stars on GitHub
scikit-time 0.1
A unified framework for machine learning with time series
1 version - Latest release: over 4 years ago - 39 stars on GitHub
zen3geo 0.5.0 ๐Ÿ’ฐ
The ๐ŸŒ data science library you've been waiting for~
3 versions - Latest release: over 1 year ago - 40 stars on GitHub
r-loon 1.4.0
A Toolkit for Interactive Statistical Data Visualization
1 version - Latest release: over 1 year ago - 45 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.
1 version - Latest release: about 4 years ago - 47 stars on GitHub
dask-awkward 2022.10a0
dask-awkward provides a native Dask collection representing partitioned awkward arrays.
2 versions - Latest release: over 1 year ago - 2 dependent repositories - 47 stars on GitHub
lcensemble 0.3.2
Random Forest or XGBoost? It is Time to Explore LCE
10 versions - Latest release: over 1 year ago - 48 stars on GitHub
lexicalrichness 0.3.0
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 50 stars on GitHub
foundry_ml 0.5.0
Simplifying the discovery and usage of machine-learning ready datasets in materials science and c...
4 versions - Latest release: over 1 year ago - 52 stars on GitHub
matbench 0.6
Matbench: Benchmarks for materials science property prediction
2 versions - Latest release: almost 2 years ago - 57 stars on GitHub
forestplot 0.2.0
A Python package to make publication-ready but customizable coefficient plots.
2 versions - Latest release: over 1 year ago - 59 stars on GitHub
typedframe 0.7.0
Typed wrappers over pandas DataFrames with schema validation
1 version - Latest release: almost 2 years ago - 65 stars on GitHub
psyplot 1.4.3
psyplot is an cross-platform open source python project that mainly combines the plotting utiliti...
10 versions - Latest release: over 1 year ago - 7 dependent packages - 10 dependent repositories - 65 stars on GitHub
coqui-trainer 0.0.5
๐Ÿธ - A general purpose model trainer, as flexible as it gets
2 versions - Latest release: about 2 years ago - 67 stars on GitHub
pyprocessmacro 1.0.12
A Python library for moderation, mediation and conditional process analysis.
3 versions - Latest release: about 2 years ago - 67 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...
2 versions - Latest release: over 2 years ago - 68 stars on GitHub
tidypandas 0.2.3
A grammar of data manipulation for pandas inspired by tidyverse
3 versions - Latest release: over 1 year ago - 70 stars on GitHub
ipychart 0.4.0
ipychart is an ipywidget which allows to create dynamic, refined and customizable charts within t...
9 versions - Latest release: about 2 years ago - 71 stars on GitHub
r-vtree 5.4.6
An R package for calculating and drawing variable trees
3 versions - Latest release: over 2 years ago - 71 stars on GitHub
stripnet 0.0.7
Leverage the power of NLP Topic Modeling, Semantic Similarity and Network analysis to study the t...
2 versions - Latest release: about 2 years ago - 82 stars on GitHub
susi 1.2.2
SuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)
4 versions - Latest release: over 2 years ago - 83 stars on GitHub
r-tarchetypes 0.7.2
Archetypes for targets and pipelines
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 89 stars on GitHub
pymks 0.4.1
PyMKS is an open source, pythonic implementation of the methodologies developed under the aegis o...
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 92 stars on GitHub
r-astsa 1.16
R package to accompany Time Series Analysis and Its Applications: With R Examples -and- Time Seri...
4 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 97 stars on GitHub
panel-chemistry 0.2.2
๐Ÿงช๐Ÿ“ˆ ๐Ÿ. The purpose of the panel-chemistry project is to make it really easy for you to do DATA AN...
5 versions - Latest release: over 1 year ago - 98 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...
33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
r-breakdown 0.2.1 ๐Ÿ’ฐ
Model Agnostics breakDown plots
5 versions - Latest release: over 3 years ago - 1 dependent package - 99 stars on GitHub
pyrolite 0.3.2
A set of tools for getting the most from your geochemical data.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 100 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...
44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
scitime 0.1.1
Training time estimation for scikit-learn algorithms.
4 versions - Latest release: about 3 years ago - 120 stars on GitHub
r-gghoriplot 1.0.1
A user-friendly, highly customizable R package for building horizon plots in ggplot2
2 versions - Latest release: over 1 year ago - 121 stars on GitHub
flytekitplugins-modin 1.2.4
Modin plugin for Flytekit: `flytekitplugins-modin` PyPI: [https://pypi.org/project/flytekitplugi...
9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-data-fsspec 1.2.4
`fsspec` powered data-plugins for Flytekit: `flytekitplugins-data-fsspec` PyPI: [https://pypi.or...
8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-sqlalchemy 1.2.4
SQLAlchemy plugin for Flytekit: `flytekitplugins-sqlalchemy` PyPI: [https://pypi.org/project/fly...
8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-athena 1.2.4
Athena plugin for Flytekit: `flytekitplugins-athena` PyPI: [https://pypi.org/project/flytekitplu...
8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-spark 1.0.5
Spark 3 plugin for Flytekit: `flytekitplugins-spark` PyPI: [https://pypi.org/project/flytekitplu...
1 version - Latest release: almost 2 years ago - 123 stars on GitHub
flytekitplugins-awsbatch 1.2.4
AWS Batch plugin for Flytekit: `flytekitplugins-awsbatch` PyPI: [https://pypi.org/project/flytek...
9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-pandera 1.2.4
Pandera plugin for Flytekit: `flytekitplugins-pandera` PyPI: [https://pypi.org/project/flytekitp...
7 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...
4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
verticapy 0.11.0
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science pro...
17 versions - Latest release: over 1 year ago - 124 stars on GitHub
chemml 1.2
ChemML is a machine learning and informatics program suite for the analysis, mining, and modeling...
4 versions - Latest release: about 2 years ago - 128 stars on GitHub
pyscaffoldext-dsproject 0.7.2 ๐Ÿ’ฐ
๐Ÿ’ซ PyScaffold extension for data-science projects
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 139 stars on GitHub
hcrystalball 0.1.12
A library that unifies the API for most commonly used libraries and modeling techniques for time-...
10 versions - Latest release: about 2 years ago - 1 dependent package - 147 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling
5 versions - Latest release: over 1 year ago - 152 stars on GitHub
hyppo 0.3.2
Python package for multivariate hypothesis testing
1 version - Latest release: almost 2 years ago - 154 stars on GitHub
mvlearn 0.5.0
mvlearn aims to serve as a community-driven open-source software package that offers reference im...
5 versions - Latest release: about 2 years ago - 169 stars on GitHub
ptitprince 0.2.6
python version of raincloud
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 169 stars on GitHub
visions 0.7.5
Type System for Data Analysis in Python
12 versions - Latest release: over 2 years ago - 2 dependent packages - 13 dependent repositories - 174 stars on GitHub
r-fastverse 0.3.0
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and...
3 versions - Latest release: over 1 year ago - 175 stars on GitHub
redshift_connector 2.0.908
redshift_connector is the Amazon Redshift connector for Python. Easy integration with pandas and ...
26 versions - Latest release: almost 2 years ago - 2 dependent packages - 178 stars on GitHub
pandas_schema 0.3.5
A validation library for Pandas data frames using user-friendly schemas
1 version - Latest release: about 4 years ago - 1 dependent repositories - 180 stars on GitHub
cdsdashboards 0.6.3
JupyterHub extension for ContainDS Dashboards
22 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 183 stars on GitHub
cdsdashboards-singleuser 0.6.3
JupyterHub extension for ContainDS Dashboards
22 versions - Latest release: over 1 year ago - 6 dependent repositories - 183 stars on GitHub
bookstore 2.5.1
bookstore provides tooling and workflow recommendations for storing, scheduling, and publishing n...
2 versions - Latest release: over 4 years ago - 187 stars on GitHub
bloxs 1.0.2
Build dashboards in Jupyter Notebook with numeric and chart boxes
1 version - Latest release: almost 2 years ago - 203 stars on GitHub
ripser 0.6.4
Ripser.py is a lean persistent homology package for Python. Building on the blazing fast C++ Rips...
8 versions - Latest release: over 1 year ago - 209 stars on GitHub
pycwt 0.3.0a22
A Python module for continuous wavelet spectral analysis. It includes a collection of routines fo...
1 version - Latest release: over 1 year ago - 3 dependent repositories - 212 stars on GitHub
histolab 0.5.1 ๐Ÿ’ฐ
Library for Digital Pathology Image Processing
1 version - Latest release: about 2 years ago - 229 stars on GitHub
miceforest 5.6.2
Multiple Imputation iteratively 'fills in' missing values in a dataset by modeling each variable ...
8 versions - Latest release: over 1 year ago - 229 stars on GitHub
tscv 0.1.2
This repository is a scikit-learn extension for time series cross-validation. It introduces gaps ...
1 version - Latest release: about 3 years ago - 229 stars on GitHub
tsflex 0.3.0
Flexible time series feature extraction & processing
15 versions - Latest release: over 1 year ago - 240 stars on GitHub
deon 0.3.0
deon is a command line tool that allows you to easily add an ethics checklist to your data scienc...
3 versions - Latest release: over 3 years ago - 250 stars on GitHub
pysparkling 0.6.1
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
1 version - Latest release: about 3 years ago - 256 stars on GitHub
pyglmnet 1.1
Python implementation of elastic-net regularized generalized linear models
1 version - Latest release: about 4 years ago - 258 stars on GitHub
doubleml 0.5.2
The Python package DoubleML provides an implementation of the double / debiased machine learning ...
10 versions - Latest release: over 1 year ago - 261 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans
9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
deepgraph 0.2.3
DeepGraph is a scalable, general-purpose data analysis package. It implements a network represent...
3 versions - Latest release: over 2 years ago - 272 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data
7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
retriever 3.1.0
This module analyzes jpeg/jpeg2000/png/gif image header and return image size.
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 279 stars on GitHub
graspy 0.2
A graph, or network, provides a mathematically intuitive representation of data with some sort of...
1 version - Latest release: about 4 years ago - 1 dependent package - 291 stars on GitHub