An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pydata" keyword

View the packages on the pypi.org package registry that are tagged with the "pydata" keyword.

Top 1.5% on pypi.org
stumpy 1.13.0
A powerful and scalable library that can be used for a variety of time series data mining tasks
29 versions - Latest release: 9 months ago - 14 dependent packages - 79 dependent repositories - 299 thousand downloads last month - 3,611 stars on GitHub - 1 maintainer
bodo 2025.3.3
High-Performance Python Compute Engine for Data and AI
41 versions - Latest release: 22 days ago - 1 dependent repositories - 18.2 thousand downloads last month - 250 stars on GitHub - 1 maintainer
Top 0.3% on pypi.org
dask 2025.3.0 💰
Parallel PyData with Task Scheduling
217 versions - Latest release: 29 days ago - 880 dependent packages - 13,853 dependent repositories - 11.8 million downloads last month - 11,667 stars on GitHub - 8 maintainers
Top 1.4% on pypi.org
impyla 0.21.0
Python client for the Impala distributed query engine
59 versions - Latest release: about 1 month ago - 29 dependent packages - 251 dependent repositories - 6.14 million downloads last month - 735 stars on GitHub - 13 maintainers
Top 9.5% on pypi.org
cudf-cu12 25.4.0
cuDF - GPU Dataframe
25 versions - Latest release: 9 days ago - 12 dependent packages - 1 dependent repositories - 12.2 thousand downloads last month - 8,866 stars on GitHub - 1 maintainer
dask-cudf-cu12 25.4.0
Utilities for Dask and cuDF interactions
23 versions - Latest release: 9 days ago - 3 dependent packages - 1 dependent repositories - 8.18 thousand downloads last month - 8,866 stars on GitHub - 1 maintainer
pylibcudf-cu11 25.4.0
pylibcudf - Python bindings for libcudf
6 versions - Latest release: 9 days ago - 1.91 thousand downloads last month - 8,866 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
cudf-cu11 25.4.0
cuDF - GPU Dataframe
30 versions - Latest release: 9 days ago - 11 dependent packages - 2 dependent repositories - 4.7 thousand downloads last month - 8,866 stars on GitHub - 3 maintainers
cudf-polars-cu12 25.4.0
Executor for polars using cudf
8 versions - Latest release: 9 days ago - 4.17 thousand downloads last month - 8,866 stars on GitHub - 2 maintainers
pylibcudf-cu12 25.4.0
pylibcudf - Python bindings for libcudf
6 versions - Latest release: 9 days ago - 7.94 thousand downloads last month - 8,866 stars on GitHub - 2 maintainers
Top 7.6% on pypi.org
dask-cudf-cu11 25.4.0
Utilities for Dask and cuDF interactions
28 versions - Latest release: 9 days ago - 4 dependent packages - 2 dependent repositories - 2.11 thousand downloads last month - 8,860 stars on GitHub - 1 maintainer
cudf-polars-cu11 25.4.0
Executor for polars using cudf
8 versions - Latest release: 9 days ago - 272 downloads last month - 8,860 stars on GitHub - 2 maintainers
libcudf-cu11 25.4.0
cuDF - GPU Dataframe (C++)
6 versions - Latest release: 9 days ago - 1.83 thousand downloads last month - 8,860 stars on GitHub - 2 maintainers
Top 0.3% on pypi.org
pydata-sphinx-theme 0.16.1
Bootstrap-based Sphinx theme from the PyData community
60 versions - Latest release: 4 months ago - 709 dependent packages - 2,958 dependent repositories - 1.77 million downloads last month - 523 stars on GitHub - 4 maintainers
Top 0.7% on pypi.org
distributed 2025.3.0 💰
Distributed scheduler for Dask
247 versions - Latest release: 29 days ago - 307 dependent packages - 7,307 dependent repositories - 3.97 million downloads last month - 1,550 stars on GitHub - 8 maintainers
Top 2.0% on pypi.org
pyjanitor 0.31.0
Tools for cleaning pandas DataFrames
66 versions - Latest release: about 1 month ago - 9 dependent packages - 66 dependent repositories - 91.8 thousand downloads last month - 1,411 stars on GitHub - 4 maintainers
dask-histogram 2025.2.0
Histogramming with Dask.
37 versions - Latest release: 2 months ago - 2 dependent packages - 1 dependent repositories - 159 thousand downloads last month - 22 stars on GitHub - 5 maintainers
Top 1.2% on pypi.org
koalas 1.8.2
Koalas: pandas API on Apache Spark
47 versions - Latest release: over 3 years ago - 11 dependent packages - 444 dependent repositories - 1.64 million downloads last month - 3,351 stars on GitHub - 7 maintainers
pydatasentry 0.1.4
Memory tool for Python-Based Data Science
2 versions - Latest release: over 9 years ago - 2 dependent repositories - 53 downloads last month - 5 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 4 years ago - 73 dependent packages - 3,913 dependent repositories - 359 thousand downloads last month - 3,024 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
python-graphblas 2025.2.0
Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics
24 versions - Latest release: 2 months ago - 4 dependent packages - 2 dependent repositories - 3.09 thousand downloads last month - 132 stars on GitHub - 2 maintainers
Top 8.5% on pypi.org
sgkit 0.10.0
Statistical genetics toolkit
10 versions - Latest release: 12 days ago - 2 dependent packages - 5 dependent repositories - 2.31 thousand downloads last month - 253 stars on GitHub - 1 maintainer
pandas-selectable 1.2.0
Add a select accessor to pandas
6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 264 downloads last month - 34 stars on GitHub - 1 maintainer
tba-pydata 0.1.1
Wrapper for working with the Blue Alliance API in pandas
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 46 downloads last month - 0 stars on GitHub - 1 maintainer
molar 0.4.5
"A database to store chemical data"
23 versions - Latest release: about 2 years ago - 1 dependent repositories - 351 downloads last month - 10 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
pymapd 0.26.0
A wrapper for pyomnisci for backwards compatibility.
42 versions - Latest release: over 3 years ago - 1 dependent package - 92 dependent repositories - 1.92 thousand downloads last month - 111 stars on GitHub - 2 maintainers
Top 10.0% on pypi.org
vtreat 1.3.1
vtreat is a pandas.DataFrame processor/conditioner that prepares real-world data for predictive m...
24 versions - Latest release: 10 months ago - 2 dependent packages - 1 dependent repositories - 1.71 thousand downloads last month - 121 stars on GitHub - 1 maintainer
matrepr 1.0.1
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
16 versions - Latest release: 10 months ago - 2 dependent packages - 929 downloads last month - 13 stars on GitHub - 1 maintainer
pygdf 0.1.0a1
GPU Dataframe
1 version - Latest release: almost 8 years ago - 1 dependent repositories - 36 downloads last month - 8,845 stars on GitHub - 1 maintainer
libcudf-cu12 25.4.0
cuDF - GPU Dataframe (C++)
6 versions - Latest release: 9 days ago - 7.56 thousand downloads last month - 8,845 stars on GitHub
grblas 2022.4.0
Python interface to GraphBLAS
19 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 877 downloads last month - 115 stars on GitHub - 2 maintainers
graphblas-algorithms 2023.10.0
Graph algorithms written in GraphBLAS and backend for NetworkX
9 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.51 thousand downloads last month - 79 stars on GitHub - 2 maintainers
pydataproject 1.0.0
A pydata Python project
1 version - Latest release: about 3 years ago - 1 dependent repositories - 35 downloads last month - 1 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
donfig 0.8.1
Python package for configuring a python package
13 versions - Latest release: almost 2 years ago - 16 dependent packages - 17 dependent repositories - 307 thousand downloads last month - 38 stars on GitHub - 4 maintainers
graphtype 0.1.0 💰
Enforce graph, node and edge attribute types on NetworkX Graphs.
2 versions - Latest release: almost 3 years ago - 114 downloads last month - 6 stars on GitHub - 1 maintainer
array-api-stubs 0.0.2
Stubs for the array API standard
1 version - Latest release: over 2 years ago - 70 downloads last month - 232 stars on GitHub - 1 maintainer
pydata 1.0.0
PyData Location List
1 version - Latest release: almost 7 years ago - 2 dependent repositories - 283 downloads last month - 0 stars on GitHub - 1 maintainer
amzon.line 1.0.0
LINE Location List
1 version - Latest release: almost 5 years ago - 36 downloads last month - 0 stars on GitHub - 1 maintainer
xarray-ms 0.2.5
xarray MSv4 views over MSv2 Measurement Sets
16 versions - Latest release: 26 days ago - 1 dependent repositories - 634 downloads last month - 1 stars on GitHub - 2 maintainers
cz-pydata 0.3.0
Commitizen plugin for PyData-style commits
10 versions - Latest release: about 1 year ago - 226 downloads last month - 0 stars on GitHub - 1 maintainer
mapshader 0.1.3
Simple Python GIS Web Services
13 versions - Latest release: over 2 years ago - 1 dependent repositories - 275 downloads last month - 42 stars on GitHub - 1 maintainer
kartothek 5.3.0
A consistent table management library in python
41 versions - Latest release: over 3 years ago - 1 dependent repositories - 540 downloads last month - 160 stars on GitHub - 3 maintainers
regallager 0.0.1
A consistent table management library in python
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 18 downloads last month - 160 stars on GitHub - 1 maintainer
impyla-jz 0.16.3
Python client for the Impala distributed query engine
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 46 downloads last month - 0 stars on GitHub - 1 maintainer
ym-impyla 0.14.0
Python client for the Impala distributed query engine
1 version - Latest release: over 8 years ago - 1 dependent repositories - 38 downloads last month - 1 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
binpickle 0.3.4
Optimized format for pickling binary data.
13 versions - Latest release: almost 4 years ago - 2 dependent packages - 4 dependent repositories - 1.04 thousand downloads last month - 1 stars on GitHub - 1 maintainer