Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-science" keyword

Top 7.9% on pypi.org
scikits.statsmodels 0.3.1
Statistical computations and models for use with SciPy
5 versions - Latest release: over 12 years ago - 48 dependent repositories - 330 downloads last month - 9,506 stars on GitHub - 4 maintainers
Top 7.2% on pypi.org
scikits.learn 0.8.1 💰
A set of python modules for machine learning and data mining
1 version - Latest release: over 12 years ago - 17 dependent repositories - 1.2 thousand downloads last month - 57,979 stars on GitHub - 4 maintainers
mdr 0.0.1
python library to detect and extract listing data from HTML page
1 version - Latest release: over 9 years ago - 2 dependent repositories - 29 downloads last month - 106 stars on GitHub - 2 maintainers
Top 6.7% on pypi.org
aeon 2.0.2
A toolkit for conducting machine learning tasks with time series data
20 versions - Latest release: almost 9 years ago - 3 dependent packages - 1 dependent repositories - 4.08 thousand downloads last month - 676 stars on GitHub - 2 maintainers
join 0.1.1
SQL-style joins for iterables.
5 versions - Latest release: almost 9 years ago - 16 dependent repositories - 987 downloads last month - 11 stars on GitHub - 2 maintainers
aduana 0.2.1
Bindings for Aduana library
2 versions - Latest release: almost 9 years ago - 3 dependent repositories - 31 downloads last month - 53 stars on GitHub - 2 maintainers
sample-lines 0.0.4 💰
Sample lines from a file.
4 versions - Latest release: over 8 years ago - 2 dependent repositories - 19 downloads last month - 3,607 stars on GitHub - 2 maintainers
ipython-dashboard 0.1.5
An stand alone, light-weight web server for building, sharing graphs in created in ipython. Let i...
6 versions - Latest release: over 8 years ago - 1 dependent repositories - 47 downloads last month - 686 stars on GitHub - 2 maintainers
flexds 1.1
Flex is a framework for building and executing computing pipelines
1 version - Latest release: over 8 years ago - 2 dependent repositories - 13 downloads last month - 54 stars on GitHub - 2 maintainers
Top 2.8% on pypi.org
pydataset 0.2.0
Provides instant access to many popular datasets right from Python (in dataframe structure).
3 versions - Latest release: over 8 years ago - 5 dependent packages - 62 dependent repositories - 1.96 thousand downloads last month - 931 stars on GitHub - 2 maintainers
xp 1.1
xp is a framework for building and executing computing pipelines
1 version - Latest release: about 8 years ago - 3 dependent repositories - 34 downloads last month - 54 stars on GitHub - 2 maintainers
panoramix 0.8.0
A interactive data visualization platform build on SqlAlchemy and druid.io
11 versions - Latest release: about 8 years ago - 2 dependent repositories - 51 downloads last month - 58,575 stars on GitHub - 2 maintainers
page_clustering 0.0.1
Online k-means clustering of web pages
1 version - Latest release: almost 8 years ago - 13 dependent repositories - 224 downloads last month - 35 stars on GitHub - 4 maintainers
wordlevelrnn 1.0
UNKNOWN
1 version - Latest release: almost 8 years ago - 1 dependent repositories - 21 downloads last month - 60,873 stars on GitHub - 2 maintainers
abel-airflow 1.7.1.3.post3
Programmatically author, schedule and monitor data pipelines
1 version - Latest release: over 7 years ago - 13 downloads last month - 34,219 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
spark-df-profiling 1.1.13
Create HTML profiling reports from Apache Spark DataFrames
13 versions - Latest release: over 7 years ago - 2 dependent repositories - 53.9 thousand downloads last month - 194 stars on GitHub - 2 maintainers
dabox 0.0.2
Supplementary tools and functions for data analysis and ML
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 2 maintainers
creavel 0.11.0
A interactive data visualization platform build on SqlAlchemy and druid.io
1 version - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 58,575 stars on GitHub - 1 maintainer
feagen 0.3.2
A fast and memory-efficient Python feature generating framework for machine learning.
15 versions - Latest release: over 7 years ago - 1 dependent repositories - 82 downloads last month - 33 stars on GitHub - 2 maintainers
Top 6.0% on pypi.org
pyglmnet 1.0.0
Elastic-net regularized generalized linear models.
2 versions - Latest release: over 7 years ago - 5 dependent repositories - 1.4 thousand downloads last month - 273 stars on GitHub - 4 maintainers
snorkel_ie 0.4a0
a lightweight framework for developing structured information extraction applications
1 version - Latest release: over 7 years ago - 17 downloads last month - 5,699 stars on GitHub - 2 maintainers
weka-porter 0.1.0
Transpile trained decision trees from Weka to a low-level programming language.
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 6 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
webstruct 0.4.1
A library for creating statistical NER systems that work on HTML data
6 versions - Latest release: over 7 years ago - 5 dependent repositories - 238 downloads last month - 254 stars on GitHub - 12 maintainers
easymoney 1.5.0
Data Science Tools for Monetary Information and Conversions.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 20 downloads last month - 7 stars on GitHub - 1 maintainer
pyfora 0.5.10
A library for parallel execution of Python code in the Ufora runtime
29 versions - Latest release: over 7 years ago - 2 dependent repositories - 107 downloads last month - 494 stars on GitHub - 4 maintainers
ann 0.1.0
Supporting package for the book 'Introduction to Artificial Neural Networks and Deep Learning: A ...
1 version - Latest release: over 7 years ago - 1 dependent repositories - 69 downloads last month - 2,775 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
page_finder 0.1.9
Find which links on a web page are pagination links
10 versions - Latest release: over 7 years ago - 14 dependent repositories - 549 downloads last month - 30 stars on GitHub - 4 maintainers
Top 5.2% on pypi.org
datacleaner 0.1.5
A Python tool that automatically cleans data sets and readies them for analysis.
6 versions - Latest release: over 7 years ago - 1 dependent package - 19 dependent repositories - 313 downloads last month - 1,039 stars on GitHub - 2 maintainers
decorated_options 0.1.0
Function decorator to make argument passing saner.
1 version - Latest release: over 7 years ago - 1 dependent repositories - 6 downloads last month - 7 stars on GitHub - 2 maintainers
Top 8.3% on pypi.org
heamy 0.0.7
A set of useful tools for competitive data science.
2 versions - Latest release: over 7 years ago - 33 dependent repositories - 31 downloads last month - 552 stars on GitHub - 2 maintainers
autism_treatment_assistance 0.5
Employs data science and deep learning to assist therapists treat a common autism symptom that in...
5 versions - Latest release: about 7 years ago - 16 downloads last month - 17 stars on GitHub - 1 maintainer
biovida 0.1.1
Automated BioMedical Information Curation for Machine Learning Applications.
1 version - Latest release: about 7 years ago - 1 dependent repositories - 21 downloads last month - 2 maintainers
Top 3.1% on pypi.org
skrebate 0.3.4
Relief-based feature selection algorithms
13 versions - Latest release: about 7 years ago - 8 dependent packages - 51 dependent repositories - 4.77 thousand downloads last month - 394 stars on GitHub - 4 maintainers
pystae 0.1.0
UNKNOWN
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 13 downloads last month - 0 stars on GitHub - 2 maintainers
corral-pipeline 0.2.7
MVC framework for create trustworthy pipelines
6 versions - Latest release: about 7 years ago - 3 dependent repositories - 27 downloads last month - 6 stars on GitHub - 1 maintainer
pygdf 0.1.0a1
GPU Dataframe
1 version - Latest release: about 7 years ago - 1 dependent repositories - 23 downloads last month - 7,236 stars on GitHub - 2 maintainers
Top 6.0% on pypi.org
speedml 0.9.3
Speedml Machine Learning Speed Start
6 versions - Latest release: almost 7 years ago - 17 dependent repositories - 1.12 thousand downloads last month - 202 stars on GitHub - 2 maintainers
insults 0.1.12
Identify insulting comments and users on social media
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 19 downloads last month - 23 stars on GitHub - 2 maintainers
langdist 0.4.1
Multilingual Language Modeling Toolkit
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 5 downloads last month - 11 stars on GitHub - 2 maintainers
mlfromscratch 0.0.4
Python implementations of some of the fundamental Machine Learning models and algorithms from scr...
4 versions - Latest release: almost 7 years ago - 1 dependent repositories - 48 downloads last month - 22,973 stars on GitHub - 1 maintainer
pylearning 3.2.2b1
Simple high-level library to use machine learning algorithms
9 versions - Latest release: almost 7 years ago - 1 dependent repositories - 51 downloads last month - 5 stars on GitHub - 2 maintainers
krisk 0.3.1
Echarts Statistical Visualization for Python Data Science
6 versions - Latest release: almost 7 years ago - 1 dependent repositories - 12 downloads last month - 118 stars on GitHub - 2 maintainers
easymlpy 0.1.2
A Python toolkit for easily building and evaluating machine learning models.
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 15 downloads last month - 40 stars on GitHub - 2 maintainers
geoql 0.0.8.0
Library for performing queries and transformations on GeoJSON data (with emphasis on support for ...
7 versions - Latest release: almost 7 years ago - 2 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
footballdata 0.3.1
A collection of wrappers over football (soccer) data from various websites / APIs. You get: Panda...
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 11 downloads last month - 34 stars on GitHub - 2 maintainers
chicksexer 0.2.2
Python package for gender classification.
6 versions - Latest release: almost 7 years ago - 1 dependent repositories - 226 downloads last month - 82 stars on GitHub - 2 maintainers
sciblox 0.2.11
Making data science and machine learning in Python easier.
11 versions - Latest release: almost 7 years ago - 1 dependent repositories - 33 downloads last month - 48 stars on GitHub - 2 maintainers
qlink 0.1a1
Entity Resolution and Record Linkage library
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 13 downloads last month - 7 stars on GitHub - 2 maintainers
toady 1.70
Easily visualize high-dimensional data in 2d space
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 2 maintainers
datacost 1.07
Calculate cost based metrics about data based on the number of positive and negative data points.
7 versions - Latest release: almost 7 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 2 maintainers
wattle 1.00
wattle is a customizable decision tree algorithm. Use it to implement decision tree algorithms...
1 version - Latest release: over 6 years ago - 1 dependent repositories - 5 downloads last month - 0 stars on GitHub - 2 maintainers
entro 1.01
Information entropy measurements library
1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 2 stars on GitHub - 2 maintainers
chainsaw 1.00
Functions for splitting decision trees.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 2 maintainers
xcessiv 0.5.1
A web-based application for quick and scalable construction of massive machine learning ensembles.
33 versions - Latest release: over 6 years ago - 1 dependent repositories - 147 downloads last month - 1,266 stars on GitHub - 2 maintainers
spark-df-profiling-optimus 0.1.1
Create HTML profiling reports from Apache Spark DataFrames
6 versions - Latest release: over 6 years ago - 3 dependent repositories - 598 downloads last month - 2 stars on GitHub - 2 maintainers
datasnakes 0.1.1a1
This package helps in the analysis of orthologous genes.
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 11 downloads last month - 27 stars on GitHub - 2 maintainers
predictives-models-building 1.0
code for running predictives modeling tasks
1 version - Latest release: over 6 years ago - 1 dependent repositories - 7 downloads last month - 5 stars on GitHub - 2 maintainers
acerim 0.1.1
ACERIM is deprecated. See github.com/cjtu/craterpy
9 versions - Latest release: over 6 years ago - 80 downloads last month - 16 stars on GitHub - 2 maintainers
reportify 1.3.0
Generate report-like documents from Jupyter notebooks
6 versions - Latest release: over 6 years ago - 1 dependent repositories - 48 downloads last month - 2 stars on GitHub - 2 maintainers
devml 0.5.1 💰
Machine Learning, Statistics and Utilities around Developer Productivity, Company Productivity an...
5 versions - Latest release: over 6 years ago - 2 dependent repositories - 24 downloads last month - 27 stars on GitHub - 2 maintainers
Top 7.8% on pypi.org
dist-keras 0.2.1
Distributed Deep learning with Apache Spark with Keras.
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 6.95 thousand downloads last month - 624 stars on GitHub - 2 maintainers
guidedlda 2.0.0.dev22
Topic modeling with Guided latent Dirichlet allocation
5 versions - Latest release: over 6 years ago - 4 dependent repositories - 198 downloads last month - 492 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
observations 0.1.4
Tools for loading standard data sets in machine learning
6 versions - Latest release: over 6 years ago - 1 dependent package - 29 dependent repositories - 291 downloads last month - 200 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
kaggle-cli 0.12.13
An unofficial Kaggle command line tool.
39 versions - Latest release: over 6 years ago - 121 dependent repositories - 778 downloads last month - 676 stars on GitHub - 2 maintainers
seamless-framework 0.1.5
a cell-based reactive programming framework
6 versions - Latest release: over 6 years ago - 1 dependent repositories - 55 downloads last month - 20 stars on GitHub - 1 maintainer
orthoevol 0.9.0a2
This package aids in the analysis of orthologous genes.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 10 downloads last month - 27 stars on GitHub - 4 maintainers
scikit-data 0.1.3
The propose of this library is to allow the data analysis process more easy and automatic.
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 25 downloads last month - 19 stars on GitHub - 2 maintainers
choropie 0.0.3
Create a choropleth map with pie charts in each polygon centroid.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 27 downloads last month - 18 stars on GitHub - 2 maintainers
mxlearn 0.1.0
Deep learning library featuring a higher-level API for mxnet.
1 version - Latest release: over 6 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 2 maintainers
supermjo-py 0.2.0
Python interface to Super-Mjograph
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 21 downloads last month - 5 stars on GitHub - 2 maintainers
dframe-utils 0.0.2rc2
simple utility tools for dataframes in Python
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 21 downloads last month - 4 stars on GitHub - 1 maintainer
pypipelinestream 0.2.0
A pipelining framework designed for data analysis but can be useful to other applications
1 version - Latest release: over 6 years ago - 1 dependent repositories - 13 downloads last month - 14 stars on GitHub - 2 maintainers
jdss 0.2.4
A command line tool for generating Jenkins summary reports for data science activities
27 versions - Latest release: over 6 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 2 maintainers
Top 4.0% on pypi.org
pivottablejs 0.9.0
PivotTable.js integration for Jupyter/IPython Notebook
10 versions - Latest release: over 6 years ago - 73 dependent repositories - 38.3 thousand downloads last month - 645 stars on GitHub - 1 maintainer
spacy-ci-improve 2.0.5 💰
Industrial-strength Natural Language Processing (NLP) with Python and Cython
1 version - Latest release: over 6 years ago - 1 dependent repositories - 8 downloads last month - 28,635 stars on GitHub - 1 maintainer
tsmetrics 0.1.0
Evaluation metrics for time series analysis
1 version - Latest release: over 6 years ago - 1 dependent repositories - 3 downloads last month - 4 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
boostaroota 1.3
A Fast XGBoost Feature Selection Algorithm
3 versions - Latest release: over 6 years ago - 1 dependent package - 2 dependent repositories - 4.94 thousand downloads last month - 209 stars on GitHub - 2 maintainers
Top 8.1% on pypi.org
edinet-xbrl 0.2.0
A Python Edinet xbrl file parser
4 versions - Latest release: over 6 years ago - 6 dependent repositories - 440 downloads last month - 117 stars on GitHub - 2 maintainers
Top 5.6% on pypi.org
automl 2.9.9
Automated machine learning for production and analytics
19 versions - Latest release: about 6 years ago - 9 dependent repositories - 785 downloads last month - 1,635 stars on GitHub - 1 maintainer
hyperengine 0.1.1
Python library for Bayesian hyper-parameters optimization
1 version - Latest release: about 6 years ago - 6 dependent repositories - 9 downloads last month - 85 stars on GitHub - 2 maintainers
sklearn2docker 0.1
Convert your trained scikit-learn classifier to a Docker container with a pre-configured API.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 5 stars on GitHub - 2 maintainers
diaml 1.1.0a1
Does It All Machine Learning
1 version - Latest release: about 6 years ago - 1 dependent repositories - 15 downloads last month - 7 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
auto_ml 2.9.10
Automated machine learning for production and analytics
78 versions - Latest release: about 6 years ago - 3 dependent repositories - 1.44 thousand downloads last month - 1,635 stars on GitHub - 2 maintainers
adlframework 1.4
Deep learning Streamlined Process
5 versions - Latest release: about 6 years ago - 48 downloads last month - 1 stars on GitHub - 1 maintainer
polyaxon-lib 0.0.4
Deep Learning library for TensorFlow for building end to end models and experiments.
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 8 stars on GitHub - 2 maintainers
pyfts 1.2.2
Fuzzy Time Series for Python
3 versions - Latest release: about 6 years ago - 1 dependent repositories - 236 downloads last month - 257 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
pydbgen 1.0.5
Random database/dataframe generator
1 version - Latest release: about 6 years ago - 12 dependent repositories - 268 downloads last month - 290 stars on GitHub - 2 maintainers
bis-miner 3.11.0 💰
Bis-Miner, a component-based data mining framework.
1 version - Latest release: about 6 years ago - 14 downloads last month - 4,611 stars on GitHub - 1 maintainer
cfanalytics 0.1.10
Downloading, analyzing and visualizing CrossFit data
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 27 stars on GitHub - 2 maintainers
Top 7.5% on pypi.org
prettypandas 0.0.4
Pandas Styler for Report Quality Tables.
4 versions - Latest release: about 6 years ago - 14 dependent repositories - 127 downloads last month - 409 stars on GitHub - 2 maintainers
eru 0.0.1
Deep Learning for all
1 version - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 8 stars on GitHub - 2 maintainers
etherscan-magic-for-machine-learning-and-bash 0.1
Machine Learning Library for Etherscan
1 version - Latest release: about 6 years ago - 1 dependent repositories - 10 downloads last month - 139 stars on GitHub - 2 maintainers
f27-cohorts 1.0.0
Make cohort analysis a magical experience
1 version - Latest release: about 6 years ago - 1 dependent repositories - 7 downloads last month - 5 stars on GitHub - 2 maintainers
logdissect 3.1.1
Robust CLI syslog forensics tool
18 versions - Latest release: about 6 years ago - 1 dependent repositories - 270 downloads last month - 137 stars on GitHub - 2 maintainers
datapipeml 0.8
Framework to manipulate dataframes fluidly in a pipeline.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 12 downloads last month - 7 stars on GitHub - 2 maintainers
etherscan-ml 0.1.4
Machine Learning Library for Etherscan
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 24 downloads last month - 140 stars on GitHub - 2 maintainers
data-refinery 0.2.13
Data Refinery: transformating data
29 versions - Latest release: about 6 years ago - 1 dependent repositories - 240 downloads last month - 22 stars on GitHub - 2 maintainers
dwcontents 1.0.0b5
Jupyter contents manager for data.world
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 31 downloads last month - 1 stars on GitHub - 2 maintainers
logpose 0.0.1b2
A python log library for data science
11 versions - Latest release: about 6 years ago - 1 dependent repositories - 50 downloads last month - 2 stars on GitHub - 2 maintainers
pypi-description-test 0.12
py_description_test
11 versions - Latest release: about 6 years ago - 1 dependent repositories - 19 downloads last month - 1,757 stars on GitHub - 2 maintainers
Related Keywords
python 1,525 machine-learning 1,197 data-analysis 373 deep-learning 370 mlops 325 data-engineering 322 data 305 pandas 267 data-visualization 252 etl 242 analytics 215 data-pipelines 209 workflow 199 pytorch 190 orchestration 171 statistics 167 ai 162 data-integration 155 scikit-learn 152 visualization 150 scheduler 149 data-orchestrator 147 python3 143 elt 142 artificial-intelligence 139 ml 136 automation 126 jupyter 123 pipeline 119 hacktoberfest 116 machine learning 115 automl 114 apache 113 natural-language-processing 106 tensorflow 106 data-mining 105 airflow 104 dag 103 time-series 100 database 98 workflow-engine 97 data science 93 sql 92 hyperparameter-optimization 91 apache-airflow 90 nlp 90 snowflake 86 workflow-orchestration 85 feature-engineering 83 metadata 78 computer-vision 77 jupyter-notebook 76 data-lineage 72 workflow-automation 71 numpy 70 kubernetes 68 data-structures 67 dagster 66 integration 65 data-analytics 64 forecasting 64 dataops 64 automated-machine-learning 64 neural-network 63 airflow-provider 63 matplotlib 62 dataframe 58 reinforcement-learning 57 data-quality 57 big-data 57 neural-networks 56 tabular-data 54 trino 54 keras 50 classification 49 data-warehouse 49 machinelearning 49 warehouse 49 spark 48 data-engineering-pipeline 47 data-engineer 47 pipelines 47 science 46 learning 46 eda 45 machine 44 hyperparameter-tuning 44 optimization 41 exploratory-data-analysis 40 dataset 40 notebook 40 flask 39 distributed 38 developer-tools 38 datascience 38 pandas-dataframe 38 aws 38 ensemble-learning 37 regression 37 plotting 36