An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "exploratory-data-analysis" keyword

View the packages on the pypi.org package registry that are tagged with the "exploratory-data-analysis" keyword.

tseuler 0.0.4.dev0
A library for Time-Series exploration, analysis & modelling.
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 163 downloads last month - 17 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
ydata-profiling 4.16.1
Generate profile report for pandas DataFrame
31 versions - Latest release: 26 days ago - 43 dependent packages - 79 dependent repositories - 1.56 million downloads last month - 12,845 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
pandas-profiling 3.6.6
Deprecated 'pandas-profiling' package, use 'ydata-profiling' instead
40 versions - Latest release: about 2 years ago - 46 dependent packages - 1,970 dependent repositories - 371 thousand downloads last month - 12,108 stars on GitHub - 4 maintainers
hnet 1.2.3 💰
Graphical Hypergeometric Networks
29 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.21 thousand downloads last month - 29 stars on GitHub - 1 maintainer
edaexcelreport 0.1.9
A Python package for generating detailed EDA reports in Excel format with structured insights and...
10 versions - Latest release: about 1 month ago - 331 downloads last month - 2 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
data-describe 0.1.0b3
A Pythonic EDA Accelerator for Data Science
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 929 downloads last month - 299 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
great-expectations 1.4.0
Always know what to expect from your data.
310 versions - Latest release: 5 days ago - 58 dependent packages - 284 dependent repositories - 20.2 million downloads last month - 9,420 stars on GitHub - 8 maintainers
Top 3.4% on pypi.org
kdepy 1.1.12
Kernel Density Estimation in Python.
38 versions - Latest release: 3 days ago - 4 dependent packages - 10 dependent repositories - 26.6 thousand downloads last month - 610 stars on GitHub - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 49 downloads last month - 1 stars on GitHub - 1 maintainer
prettierplot 0.1.2
Quickly create prettier plots
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 344 downloads last month - 1 stars on GitHub - 1 maintainer
piperider-cli 0.1.3.12
PiperRider CLI
9 versions - Latest release: almost 3 years ago - 1 dependent repositories - 292 downloads last month - 487 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
great-expectations-experimental 0.1.20240917055
Always know what to expect from your data.
530 versions - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 491 thousand downloads last month - 9,420 stars on GitHub - 4 maintainers
great-expectations-cta 0.15.43
Always know what to expect from your data.
2 versions - Latest release: over 2 years ago - 1 dependent package - 107 downloads last month - 9,420 stars on GitHub - 1 maintainer
tslumen 0.0.1
A library for Time Series exploratory data analysis
1 version - Latest release: over 2 years ago - 1 dependent package - 69 downloads last month - 69 stars on GitHub - 1 maintainer
adenine 0.1.4
A Data ExploratioN pIpeliNE
6 versions - Latest release: about 8 years ago - 2 dependent repositories - 197 downloads last month - 15 stars on GitHub - 1 maintainer
retrain-pipelines 0.1.1
retrain-pipelines lowers the barrier to entry for the creation and management of professional mac...
2 versions - Latest release: 6 months ago - 2.73 thousand downloads last month - 5 stars on GitHub - 1 maintainer
easy-eda 0.1.10
Exploratory Data Analysis
8 versions - Latest release: about 6 years ago - 1 dependent repositories - 297 downloads last month - 4 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
renumics-spotlight 1.6.13
Visualize and maintain datasets to develop and understand data-driven algorithms.
58 versions - Latest release: 5 months ago - 4 dependent packages - 1 dependent repositories - 2.72 thousand downloads last month - 1,162 stars on GitHub - 1 maintainer
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.
2 versions - Latest release: over 2 years ago - 99 downloads last month - 1 stars on GitHub - 1 maintainer
skimpy-ext 0.1.0
skimpy is a light weight tool that provides summary statistics about variables in data frames wit...
2 versions - Latest release: over 1 year ago - 105 downloads last month - 453 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
skimpy 0.0.18
skimpy
17 versions - Latest release: 3 months ago - 2 dependent packages - 42 dependent repositories - 16.3 thousand downloads last month - 453 stars on GitHub - 1 maintainer
agriculture-data-analytics 0.0.1
Data analytic functions for the project.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 75 downloads last month - 1 stars on GitHub - 1 maintainer
agas 0.0.1
Agas is a small Python library for pairing data series based on aggregate measures
1 version - Latest release: over 2 years ago - 96 downloads last month - 0 stars on GitHub - 1 maintainer
psy-transect 0.1.0
Psyplot plugin for visualizing data along a transect
1 version - Latest release: about 1 year ago - 44 downloads last month - 0 stars on GitHub - 1 maintainer
inspectpd 0.1.0
inspectpd: Inspection, Comparison and Visualisation of Data Frames
1 version - Latest release: 3 months ago - 36 downloads last month - 7 stars on GitHub - 1 maintainer
report-creator 1.0.38
Create self-contained HTML reports from Python.
35 versions - Latest release: about 2 months ago - 1 dependent package - 3.55 thousand downloads last month - 8 stars on GitHub - 1 maintainer
vizdxp 0.2.2
Simple data visualization web app
11 versions - Latest release: 9 months ago - 1 dependent repositories - 432 downloads last month - 6 stars on GitHub - 1 maintainer
llnl-thicket 2024.2.1
Toolkit for exploratory data analysis of ensemble performance data
6 versions - Latest release: 6 months ago - 334 downloads last month - 16 stars on GitHub - 1 maintainer
data-purifier 0.3.6
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning and Automated D...
35 versions - Latest release: over 1 year ago - 1 dependent repositories - 466 downloads last month - 44 stars on GitHub - 1 maintainer
pandas-data-exploration-utility-package 0.0.3
Utility functions to help with exploratory data analysis on top the Pandas APIs
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 189 downloads last month - 5 stars on GitHub - 1 maintainer
haiqv-profiling 0.0.1
Generate profile report for pandas DataFrame
1 version - Latest release: over 4 years ago - 1 dependent repositories - 56 downloads last month - 12,108 stars on GitHub - 1 maintainer
arabica 1.8.2
Python package for text mining of time-series data
63 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 2.46 thousand downloads last month - 71 stars on GitHub - 1 maintainer
exploripy 1.1.2
Pre-Modelling Analysis of the data by doing various exploratory data analysis and Statistical Test
14 versions - Latest release: over 5 years ago - 1 dependent repositories - 401 downloads last month - 51 stars on GitHub - 3 maintainers
many 0.7.2
Statistical methods for computing many correlations
24 versions - Latest release: about 1 year ago - 10 dependent repositories - 692 downloads last month - 5 stars on GitHub - 1 maintainer
a2rl 1.2.0
Make recommendations for sequential decision problems using offline data
6 versions - Latest release: almost 2 years ago - 186 downloads last month - 36 stars on GitHub - 1 maintainer
qgridnext 2.0.4
An Interactive Grid for Sorting and Filtering DataFrames in Jupyter
7 versions - Latest release: 6 months ago - 12 thousand downloads last month - 19 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
piperider 1.0.2
PiperRider CLI
170 versions - Latest release: almost 4 years ago - 5 dependent repositories - 5.87 thousand downloads last month - 487 stars on GitHub - 1 maintainer
piperider-nightly 0.42.0.20250102
PiperRider CLI
706 versions - Latest release: 4 months ago - 12.4 thousand downloads last month - 479 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
cleanvision 0.3.6
Find issues in image datasets
12 versions - Latest release: about 1 year ago - 3 dependent packages - 2 dependent repositories - 5.15 thousand downloads last month - 1,068 stars on GitHub - 6 maintainers
Top 2.1% on pypi.org
cleanlab 2.7.1
The standard package for data-centric AI, machine learning with label errors, and automatically f...
33 versions - Latest release: about 2 months ago - 11 dependent packages - 19 dependent repositories - 36.4 thousand downloads last month - 10,446 stars on GitHub - 4 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...
7 versions - Latest release: about 1 year ago - 281 downloads last month - 10,446 stars on GitHub - 1 maintainer
zaps 1.1
Low-code Python wrapper for Exploratory Data Analysis
2 versions - Latest release: 5 months ago - 94 downloads last month - 0 stars on GitHub - 1 maintainer
orgroamtools 0.1.5
Library to aid in analysis of org-roam collections
6 versions - Latest release: 9 months ago - 247 downloads last month - 28 stars on GitHub - 1 maintainer
kydlib 0.3.1
Routines for exploratory data analysis.
7 versions - Latest release: about 2 years ago - 237 downloads last month - 25 stars on GitHub - 1 maintainer
edapy 0.4.1 💰
A tookit for exploratoriy data analysis.
7 versions - Latest release: about 3 years ago - 1 dependent repositories - 237 downloads last month - 20 stars on GitHub - 1 maintainer
desbordante 2.3.2
Science-intensive high-performance data profiler
9 versions - Latest release: about 2 months ago - 1 dependent package - 3.88 thousand downloads last month - 397 stars on GitHub - 1 maintainer
eda-report 2.8.2
Automate exploratory data analysis and reporting.
58 versions - Latest release: 4 months ago - 1 dependent repositories - 1.35 thousand downloads last month - 10 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
lux-api 0.5.1
A Python API for Intelligent Data Discovery
18 versions - Latest release: about 3 years ago - 1 dependent package - 16 dependent repositories - 1.84 thousand downloads last month - 5,264 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
lux 0.5.1
A Python API for Intelligent Data Discovery
1 version - Latest release: about 3 years ago - 2 dependent packages - 10 dependent repositories - 1.13 thousand downloads last month - 5,264 stars on GitHub - 2 maintainers
Top 2.8% on pypi.org
dataprep 0.4.5
Dataprep: Data Preparation in Python
33 versions - Latest release: over 2 years ago - 1 dependent package - 55 dependent repositories - 20.8 thousand downloads last month - 2,142 stars on GitHub - 4 maintainers
haisweetviz 1.0.2
A pandas-based library to visualize and compare datasets.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 41 downloads last month - 2,965 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
sweetviz 2.3.1
A pandas-based library to visualize and compare datasets.
35 versions - Latest release: over 1 year ago - 16 dependent packages - 167 dependent repositories - 62.2 thousand downloads last month - 2,965 stars on GitHub - 1 maintainer
easiersdk 0.1.16
This library contains code for interacting with EASIER.AI platform.
102 versions - Latest release: almost 4 years ago - 1 dependent repositories - 2.72 thousand downloads last month - 2,965 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
scattertext 0.2.2
An NLP package to visualize interesting terms in text.
150 versions - Latest release: 7 months ago - 2 dependent packages - 90 dependent repositories - 11.5 thousand downloads last month - 2,290 stars on GitHub - 1 maintainer
visualizer 0.0.10
Automate the process of visualization
7 versions - Latest release: about 5 years ago - 2 dependent repositories - 913 downloads last month - 8 stars on GitHub - 1 maintainer
dexter 0.0.7 💰
Data Exploration Terser
6 versions - Latest release: over 3 years ago - 2 dependent repositories - 225 downloads last month - 9 stars on GitHub - 1 maintainer
easy-insight 1.0.4
A simple library for easy exploratory data analysis
10 versions - Latest release: 6 months ago - 439 downloads last month - 0 stars on GitHub - 1 maintainer
jatoolbox 1.0.0
Classe que agrega alguns métodos comumente utilizados em análises de jornadas em Python na DP6.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 149 downloads last month - 5 stars on GitHub - 1 maintainer
cardtale 0.1.2
Data, Model, and Algorithm Cards for Time Series
2 versions - Latest release: 2 months ago - 99 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
handyspark 0.2.2a1
HandySpark - bringing pandas-like capabilities to Spark dataframes
7 versions - Latest release: almost 6 years ago - 3 dependent repositories - 27.8 thousand downloads last month - 193 stars on GitHub - 1 maintainer
colvert 0.3.0
colvert is a Frontend for DuckDB a fast and lightweight in-memory database designed for analytica...
6 versions - Latest release: 4 months ago - 202 downloads last month - 14 stars on GitHub - 1 maintainer
text-explore 0.0.2
A Python Package to perform Exploratory Data Analysis on Text Data.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 58 downloads last month - 3 stars on GitHub - 1 maintainer
pymeda 0.1.13
Matrix Exploratory Data Analysis
17 versions - Latest release: over 6 years ago - 1 dependent repositories - 266 downloads last month - 1 stars on GitHub - 2 maintainers
edamame 0.0.9
Exploratory data analysis tools
59 versions - Latest release: over 2 years ago - 1.33 thousand downloads last month - 3 stars on GitHub - 1 maintainer
edatora 0.2
A python package that runs exploratory data analysis for users
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 42 downloads last month - 75 stars on GitHub - 1 maintainer
veda-lib 0.0.5
veda_lib is a Python library designed to streamline the data preprocessing and cleaning workflow ...
4 versions - Latest release: 8 months ago - 79 downloads last month - 0 stars on GitHub - 1 maintainer
altair-ally 0.1.1
Altair Ally is a companion package to Altair, which provides shortcuts to create common plots for...
2 versions - Latest release: 5 months ago - 1 dependent repositories - 109 downloads last month - 25 stars on GitHub - 1 maintainer
data-inspector 1.5.5
This module brings different functions to make EDA, data cleaning easier.
17 versions - Latest release: over 3 years ago - 1 dependent repositories - 497 downloads last month - 39 stars on GitHub - 1 maintainer
dxter 0.0.1 💰
Data Exploration Terser
1 version - Latest release: over 3 years ago - 1 dependent repositories - 51 downloads last month - 9 stars on GitHub - 1 maintainer
sparkora 0.0.1
Exploratory data analysis toolkit for Pyspark
1 version - Latest release: over 3 years ago - 1 dependent repositories - 39 downloads last month - 53 stars on GitHub - 1 maintainer
waltzboard 1.0.0a0
Waltzboard: Multi-Criteria Automated Dashboard Design
1 version - Latest release: about 1 year ago - 66 downloads last month - 7 stars on GitHub - 1 maintainer
olliepy 0.2.9
OlliePy is a python package which can help data scientists in exploring their data and evaluating...
33 versions - Latest release: almost 4 years ago - 1 dependent repositories - 1.04 thousand downloads last month - 51 stars on GitHub - 1 maintainer
dfsummarizer 0.1.6
Python command line application to summarize a CSV or TSV dataset.
7 versions - Latest release: over 3 years ago - 1 dependent repositories - 120 downloads last month - 5 stars on GitHub - 1 maintainer
edvart 4.0.0
Effective data visualization and reporting tool
16 versions - Latest release: about 1 year ago - 371 downloads last month - 42 stars on GitHub - 1 maintainer
fbref2pandas 0.0.1 💰
A scraper that directly gives football(not soccer) data from FBRef website directly to Pandas dat...
1 version - Latest release: almost 2 years ago - 73 downloads last month - 2 stars on GitHub - 1 maintainer
edexplore 1.0.1
A simple widget for interactive EDA / QA for those who use Pandas in Jupyter Notebook.
1 version - Latest release: 10 months ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
leila 0.2
Librería para medir la calidad de los datos en conjuntos de datos estructurados
2 versions - Latest release: over 3 years ago - 2 dependent repositories - 115 downloads last month - 61 stars on GitHub - 1 maintainer
sliceguard 0.0.35
A library for detecting critical data slices in structured and unstructured data based on feature...
33 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 1.32 thousand downloads last month - 64 stars on GitHub - 1 maintainer
gleaner 0.0.5 removed
Gleaner: Multi-Criteria Optimization for Automatic Dashboard Design
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 41 downloads last month - 6 stars on GitHub - 1 maintainer
Related Keywords
data-science 43 eda 41 python 32 data-analysis 26 pandas 22 data-visualization 19 machine-learning 17 data-profiling 17 data-exploration 16 visualization 15 data-quality 14 statistics 12 data 11 jupyter 9 analysis 8 python3 7 exploration 7 exploratory-data-visualizations 7 data-engineering 6 data-analytics 6 pandas-dataframe 6 exploratory data analysis 5 data science 5 dataquality 5 datacleaning 5 exploratory-analysis 5 data-cleaning 5 jupyter-notebook 5 deep-learning 5 ipython 5 time-series 5 Data Science 4 mlops 4 continuous-integration 4 data-centric-ai 4 pipeline 4 science 4 data-curation 4 machine learning 4 html-report 4 hacktoberfest 4 datacleaner 3 dataunittest 3 exploratorydataanalysis 3 pipeline-debt 3 pipeline-testing 3 pipeline-tests 3 pandas-profiling 3 nlp 3 feature-engineering 3 markdown 3 Machine Learning 3 big-data-analytics 3 seaborn 3 code-review 3 data-observability 3 data-pipeline 3 data-profiler 3 data-reliability 3 data-testing 3 dbt 3 dbt-metrics 3 pull-requests 3 reporting 3 forecasting 3 data-mining 3 data-preprocessing 3 machine_learning 3 data_cleaning 3 Visualization 3 data-validation 3 data analysis 3 classification 3 outlier-detection 3 testing 3 quality 3 report 3 validation 3 datavalidation 3 cleandata 3 EDA 3 data-profilers 3 data-unit-tests 3 open-source 2 out-of-distribution-detection 2 confident_learning 2 llms 2 weak-supervision 2 pyspark 2 exploratory 2 labeling 2 noisy-labels 2 ai 2 active-learning 2 annotation 2 datacentric 2 data-labeling 2 datacentric_ai 2 text-mining 2 unsupervised_learning 2