Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataset" keyword

pygdg 0.1.6
A simple comand line tool to create game events data for analytics and machine learning use cases
6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 60 downloads last month - 0 stars on GitHub - 1 maintainer
lsat 0.0.1
A small example package
1 version - Latest release: almost 2 years ago - 2 dependent repositories - 16 downloads last month - 9 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
unihan-etl 0.34.0
Export UNIHAN data of Chinese, Japanese, Korean to CSV, JSON or YAML
55 versions - Latest release: about 2 months ago - 3 dependent packages - 7 dependent repositories - 743 downloads last month - 51 stars on GitHub - 1 maintainer
scieloscopus 1.0.0
Library to delivery Scopus and Scimago indicators of SciELO Journals
1 version - Latest release: over 6 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
cvat-sdk 2.13.0
CVAT REST API
35 versions - Latest release: 10 days ago - 4 dependent packages - 54 dependent repositories - 13.9 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
Top 0.7% on pypi.org
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.
33 versions - Latest release: 25 days ago - 92 dependent packages - 2,976 dependent repositories - 705 thousand downloads last month - 3,450 stars on GitHub - 4 maintainers
hscitorchutil 0.1.64
HSCI research group utilities for pytorch (lightning)
15 versions - Latest release: about 1 month ago - 201 downloads last month - 0 stars on GitHub - 1 maintainer
pit30m 0.0.2
Development kit for the Pit30M large scale localization dataset
2 versions - Latest release: 11 months ago - 24 downloads last month - 13 stars on GitHub - 1 maintainer
data-understand 0.0.6
Utility package for generating insights for datasets
7 versions - Latest release: 8 months ago - 49 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
crowsetta 5.0.2
A Python tool to work with any format for annotating animal vocalizations and bioacoustics data
32 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 615 downloads last month - 48 stars on GitHub - 1 maintainer
holcrawl 1.0.1
A crawler for building Hollywood movies datsets.
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 13 downloads last month - 9 stars on GitHub - 1 maintainer
coco-merger 0.0.2
Python package which aims to merge 2 COCO .json files
2 versions - Latest release: over 1 year ago - 85 downloads last month - 37 stars on GitHub - 1 maintainer
waymo-open-dataset-2-0-0 1.0.1
Waymo Open Dataset libraries.
1 version - Latest release: over 4 years ago - 1 dependent package - 1 dependent repositories - 22 downloads last month - 1 maintainer
iden 0.0.3
simple library to manage a dataset of shards to train machine learning models
9 versions - Latest release: 2 months ago - 2 dependent packages - 53.4 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
pylabel 0.1.55 💰
Transform, analyze, and visualize computer vision annotations.
57 versions - Latest release: 6 months ago - 1 dependent package - 3 dependent repositories - 2.86 thousand downloads last month - 297 stars on GitHub - 1 maintainer
orangelab 0.0.1
The Python module developed for the Orange Python Tool Plugin serves a dual purpose, providing fu...
1 version - Latest release: 6 months ago - 13 downloads last month - 0 stars on GitHub - 1 maintainer
tsp 1.7.3
Making permafrost data effortless
29 versions - Latest release: 8 months ago - 12 dependent repositories - 395 downloads last month - 5 stars on GitLab.com - 1 maintainer
open-mastr 0.14.3
A package that provides an interface for downloading and processing the data of the Marktstammdat...
14 versions - Latest release: 25 days ago - 1.63 thousand downloads last month - 65 stars on GitHub - 1 maintainer
climateserv 1.0.5
This is a package to access the ClimateSERV API](https://climateserv.servirglobal.net/)
17 versions - Latest release: 2 months ago - 1 dependent repositories - 145 downloads last month - 6 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
flwr-datasets 0.1.0
Flower Datasets
3 versions - Latest release: 2 months ago - 7 dependent repositories - 4.28 thousand downloads last month - 3,924 stars on GitHub - 2 maintainers
openfisca-uk-data 0.9.0 💰
A Python package to manage OpenFisca-UK-compatible microdata
20 versions - Latest release: about 2 years ago - 1 dependent repositories - 202 downloads last month - 0 stars on GitHub - 1 maintainer
Top 0.3% on pypi.org
faker 25.2.0 💰
Faker is a Python package that generates fake data for you.
361 versions - Latest release: 6 days ago - 382 dependent packages - 15,807 dependent repositories - 15.4 million downloads last month - 16,716 stars on GitHub - 2 maintainers
geode-ml 2.7.2
Classes and methods to help with the creation of geospatial training datasets and deep-learning m...
52 versions - Latest release: 10 months ago - 216 downloads last month - 0 stars on GitHub - 1 maintainer
mhm 5.13.1
Python distribution of mHM with bindings.
3 versions - Latest release: 9 months ago - 1 dependent package - 159 downloads last month - 214 stars on GitHub - 2 maintainers
wildlife-datasets 1.0.3
Library for easier access and research of wildlife re-identification datasets
49 versions - Latest release: 10 days ago - 1 dependent package - 523 downloads last month - 42 stars on GitHub - 1 maintainer
lshkrepresentatives 1.2.3
LSH-k-Representatives: Mixed categorial and numerical (ordinal and nonordinal) data clustering al...
12 versions - Latest release: about 23 hours ago - 1 dependent repositories - 58 downloads last month - 4 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
doccano 1.8.4 💰
doccano, text annotation tool for machine learning practitioners
31 versions - Latest release: 10 months ago - 7 dependent repositories - 10.3 thousand downloads last month - 8,436 stars on GitHub - 1 maintainer
pycovjson 0.3.9
Create CovJSON files from common scientific data formats
8 versions - Latest release: over 7 years ago - 1 dependent repositories - 43 downloads last month - 11 stars on GitHub - 3 maintainers
waymo-od-tf1-15 1.0.1
Waymo Open Dataset libraries.
3 versions - Latest release: over 4 years ago - 1 dependent package - 1 dependent repositories - 34 downloads last month - 1 maintainer
kaggle-dataset-creator 0.0.1
A Python package to generate csv/json from command line. It allows you to create CSV/JSON files b...
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
flexclash 0.0.1
Federated Learning (FL) experiment simulation in Python.
1 version - Latest release: 2 months ago - 32 downloads last month - 11 stars on GitHub - 1 maintainer
flexanomalies 0.0.2
Federated Learning (FL) experiment simulation in Python.
2 versions - Latest release: 2 months ago - 19 downloads last month - 11 stars on GitHub - 2 maintainers
nlprep 0.2.1
Download and pre-processing data for nlp tasks
70 versions - Latest release: almost 3 years ago - 1 dependent repositories - 635 downloads last month - 28 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
nas-bench-201 2.1
API for NAS-Bench-201 (a benchmark for neural architecture search).
6 versions - Latest release: over 3 years ago - 9 dependent repositories - 267 downloads last month - 619 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
soundata 0.1.3
Python library for loading and working with sound datasets.
16 versions - Latest release: 3 months ago - 2 dependent repositories - 1.07 thousand downloads last month - 269 stars on GitHub - 2 maintainers
Top 2.5% on pypi.org
rdata 0.11.2
Read R datasets from Python.
17 versions - Latest release: 3 months ago - 8 dependent packages - 17 dependent repositories - 52.4 thousand downloads last month - 34 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
chazutsu 0.8.2
The tool to make NLP datasets ready to use
20 versions - Latest release: about 5 years ago - 2 dependent repositories - 151 downloads last month - 243 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
retriever 3.1.0
Data Retriever
12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.51 thousand downloads last month - 302 stars on GitHub - 4 maintainers
ekpy 0.1.14
A collection of control and analysis code for experiments
11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 67 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
colour-science 0.4.4 💰
Colour Science for Python
23 versions - Latest release: 5 months ago - 23 dependent packages - 94 dependent repositories - 99.2 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
colour-datasets 0.2.5 💰
Colour science datasets for use with Colour
8 versions - Latest release: 5 months ago - 1 dependent package - 4 dependent repositories - 134 downloads last month - 53 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
audb 1.7.2
Load and publish databases in audformat
38 versions - Latest release: 3 days ago - 1 dependent package - 4 dependent repositories - 2.39 thousand downloads last month - 20 stars on GitHub - 1 maintainer
bambird 0.3.0
BAM, unsupervised labelling function to extract and cluster similar animal vocalizations together
3 versions - Latest release: over 1 year ago - 46 downloads last month - 17 stars on GitHub - 1 maintainer
solidago 0.1.1 💰
Algorithms for Secure Algorithmic Governance
9 versions - Latest release: 26 days ago - 150 downloads last month - 314 stars on GitHub - 2 maintainers
Top 1.6% on pypi.org
label-studio 1.12.0
Label Studio annotation tool
184 versions - Latest release: about 1 month ago - 1 dependent package - 39 dependent repositories - 42.6 thousand downloads last month - 15,269 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
scipp 24.2.0
Multi-dimensional data arrays with labeled dimensions
42 versions - Latest release: 3 months ago - 12 dependent packages - 3 dependent repositories - 9.94 thousand downloads last month - 106 stars on GitHub - 3 maintainers
ekpmeasure 0.1.7
A collection of control and analysis code for experiments
21 versions - Latest release: over 2 years ago - 1 dependent repositories - 94 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
sidechainnet 1.0.1
Tools and data for all-atom protein structure prediction via machine learning.
11 versions - Latest release: 7 months ago - 1 dependent package - 5 dependent repositories - 534 downloads last month - 296 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
split-folders 0.5.1
Split folders with files (e.g. images) into training, validation and test (dataset) folders.
12 versions - Latest release: over 2 years ago - 4 dependent packages - 91 dependent repositories - 26.4 thousand downloads last month - 406 stars on GitHub - 1 maintainer
pietoolbelt 0.3.26
Toolbelt for PiePline training pipeline
29 versions - Latest release: over 2 years ago - 2 dependent repositories - 205 downloads last month - 1 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 3 years ago - 73 dependent packages - 3,913 dependent repositories - 476 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
ds-format 4.1.0
ds-format is an open source program, a Python package and a storage format which provides an inte...
24 versions - Latest release: 3 months ago - 4 dependent repositories - 220 downloads last month - 1 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
dali-dataset 1.0.0
Code for working with the DALI dataset
2 versions - Latest release: almost 5 years ago - 2 dependent packages - 3 dependent repositories - 135 downloads last month - 310 stars on GitHub - 1 maintainer
llm-dataset-converter 0.2.3
Python3 library for converting between various LLM dataset formats.
11 versions - Latest release: 13 days ago - 1 dependent repositories - 288 downloads last month - 5 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
starwhale 0.6.13
An MLOps Platform for Model Evaluation
181 versions - Latest release: 4 months ago - 6 dependent repositories - 428 downloads last month - 187 stars on GitHub - 1 maintainer
starwhale-bootstrap 0.2.2b6
MLOps Platform
65 versions - Latest release: almost 2 years ago - 1 dependent repositories - 672 downloads last month - 187 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
tensorflow-io-gcs-filesystem-nightly 0.31.0.dev20230309180344
TensorFlow IO
160 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 9.33 thousand downloads last month - 690 stars on GitHub - 3 maintainers
Top 1.4% on pypi.org
tensorflow-io 0.37.0
TensorFlow IO
44 versions - Latest release: 18 days ago - 19 dependent packages - 293 dependent repositories - 3.89 million downloads last month - 690 stars on GitHub - 6 maintainers
Top 5.0% on pypi.org
tensorflow-io-nightly 0.31.0.dev20230309180344
TensorFlow IO
902 versions - Latest release: about 1 year ago - 3 dependent repositories - 29.9 thousand downloads last month - 690 stars on GitHub - 5 maintainers
Top 10.0% on pypi.org
tensorflow-io-plugin-gs-nightly 0.18.0.dev20210513213318
TensorFlow IO
29 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 1.23 thousand downloads last month - 690 stars on GitHub - 4 maintainers
Top 1.3% on pypi.org
tensorflow-io-gcs-filesystem 0.37.0
TensorFlow IO
22 versions - Latest release: 18 days ago - 105 dependent packages - 5,470 dependent repositories - 14.3 million downloads last month - 690 stars on GitHub - 3 maintainers
Top 1.2% on pypi.org
whylogs 1.4.0
Profile and monitor your ML data pipeline end-to-end
312 versions - Latest release: 5 days ago - 6 dependent packages - 413 dependent repositories - 441 thousand downloads last month - 2,482 stars on GitHub - 4 maintainers
geda 0.1.17
Get Data for you projects with just three lines of code. Currently suppored datasets: Pascal VOC,...
18 versions - Latest release: 5 months ago - 1 dependent repositories - 146 downloads last month - 0 stars on GitHub - 1 maintainer
gigawork 1.3.0
A tool for extracting GitHub Actions workflows
5 versions - Latest release: about 2 months ago - 43 downloads last month - 2 stars on GitHub - 1 maintainer
ut-course-catalog 0.2.16 💰
Python package for fetching UTokyo Online Course Catalogue
19 versions - Latest release: about 2 months ago - 180 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
randfacts 0.21.0 💰
Package to generate random facts
56 versions - Latest release: 6 months ago - 5 dependent packages - 24 dependent repositories - 10.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
flights-time-series-dataset 1.1.10
Flights time series dataset for time-series-predictor.
19 versions - Latest release: about 3 years ago - 1 dependent repositories - 168 downloads last month - 0 stars on GitHub - 1 maintainer
amid 0.13.0
A curated list of medical imaging datasets with unified interfaces
18 versions - Latest release: 2 months ago - 1 dependent repositories - 85 downloads last month - 33 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
segments-ai 1.8.1
Segments.ai Python SDK
131 versions - Latest release: 5 days ago - 1 dependent package - 3 dependent repositories - 4.21 thousand downloads last month - 20 stars on GitHub - 1 maintainer
ws-benchmark 1.1.1
a weak supervision learning benchmark
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 36 downloads last month - 211 stars on GitHub - 2 maintainers
ab-data-processing 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
1 version - Latest release: 4 months ago - 26 downloads last month - 43 stars on GitHub - 1 maintainer
connectome 0.10.0
A library for datasets containing heterogeneous data
34 versions - Latest release: about 1 month ago - 1 dependent repositories - 246 downloads last month - 12 stars on GitHub - 2 maintainers
one-data-processing 0.0.14
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
14 versions - Latest release: 4 months ago - 104 downloads last month - 43 stars on GitHub - 1 maintainer
mddatasetbuilder 1.3.8
A script to generate molecular dynamics (MD) datasets for machine learning from given LAMMPS traj...
27 versions - Latest release: 8 months ago - 1 dependent repositories - 282 downloads last month - 36 stars on GitHub - 1 maintainer
a-data-processing 0.0.1
A library that prepares raw documents for downstream ML tasks.
1 version - Latest release: 4 months ago - 29 downloads last month - 43 stars on GitHub - 1 maintainer
ragas-once 0.0.1
A one-step Ragas cli tool to evaluate RAG apps
1 version - Latest release: 4 months ago - 10 downloads last month - 41 stars on GitHub - 1 maintainer
tf-datasets 0.0.1
tensorflow/datasets
1 version - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 4,084 stars on GitHub - 1 maintainer
dao-scripts 1.2.2
"A tool to download data to monitor DAO activity"
25 versions - Latest release: 5 months ago - 1 dependent package - 450 downloads last month - 0 stars on GitHub - 2 maintainers
aimfast 1.3.4
An Astronomical Image Fidelity Assessment Tool.
17 versions - Latest release: 11 months ago - 1 dependent repositories - 133 downloads last month - 3 stars on GitHub - 1 maintainer
easy-vqa 1.0
The official package for the easy-VQA dataset.
2 versions - Latest release: over 4 years ago - 2 dependent repositories - 275 downloads last month - 32 stars on GitHub - 1 maintainer
graphite-datasets 1.0.59
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
60 versions - Latest release: 3 months ago - 391 downloads last month - 4,157 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
cpi 1.1.5 💰
Quickly adjust U.S. dollars for inflation using the Consumer Price Index (CPI)
52 versions - Latest release: 4 days ago - 1 dependent package - 21 dependent repositories - 21.5 thousand downloads last month - 127 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
clip-retrieval 2.44.0
Easily computing clip embeddings and building a clip retrieval system with them
86 versions - Latest release: 4 months ago - 3 dependent repositories - 5.68 thousand downloads last month - 2,163 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
torchxrayvision 1.2.3 💰
TorchXRayVision: A library of chest X-ray datasets and models
43 versions - Latest release: 5 days ago - 1 dependent package - 31 dependent repositories - 13.6 thousand downloads last month - 775 stars on GitHub - 2 maintainers
moviechat 0.6.3
Long video understanding
10 versions - Latest release: 29 days ago - 1.13 thousand downloads last month - 408 stars on GitHub - 1 maintainer
rstojnic-tfds-nightly 4.6.0.dev202206140947
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
3 versions - Latest release: almost 2 years ago - 41 downloads last month - 4,084 stars on GitHub - 1 maintainer
waymo-open-dataset-tf-2-12-0 1.6.4
Waymo Open Dataset
3 versions - Latest release: about 1 month ago - 1.01 thousand downloads last month - 2,551 stars on GitHub - 1 maintainer
tlidb 1.0.3
The Transfer Learning in Dialogue Baselines Toolkit
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 46 downloads last month - 13 stars on GitHub - 1 maintainer
cmem-plugin-kaggle 2.0.0
Import dataset resources from Kaggle.
4 versions - Latest release: 10 months ago - 45 downloads last month - 0 stars on GitHub - 3 maintainers
Top 6.3% on pypi.org
kitti2bag 1.1.1
Convert KITTI dataset to ROS bag file the easy way!
7 versions - Latest release: over 7 years ago - 5 dependent repositories - 359 downloads last month - 683 stars on GitHub - 1 maintainer
labelme2yolov7segmentation 2.0.5
Conver labelme annotation format to yolov7 annotation format for segmentation.
10 versions - Latest release: 8 months ago - 111 downloads last month - 4 stars on GitHub - 1 maintainer
dldummygen 0.0.2 💰
Deep-Learning Dummy File Generator by csv File
1 version - Latest release: over 3 years ago - 1 dependent repositories - 24 downloads last month - 17,156 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
fake-factory 9999.9.9 💰
The `fake-factory` package was deprecated on December 15th, 2016. Use the `Faker` package instead.
22 versions - Latest release: over 7 years ago - 7 dependent packages - 806 dependent repositories - 57.4 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
lfake 18.9.0 removed 💰
Fake data
1 version - Latest release: almost 1 year ago - 13 downloads last month - 16,716 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
waymo-open-dataset-tf-2-11-0 1.6.1
Waymo Open Dataset
5 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 31 thousand downloads last month - 2,546 stars on GitHub - 1 maintainer
ua-gec 2.1.3
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian language
9 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 166 downloads last month - 254 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
tfds-nightly 4.9.4.dev202401070044
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
1,859 versions - Latest release: 4 months ago - 13 dependent packages - 296 dependent repositories - 1.35 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
33 versions - Latest release: 5 months ago - 116 dependent packages - 3,946 dependent repositories - 4.14 million downloads last month - 4,085 stars on GitHub - 8 maintainers
tehran-stocks 2.0.0
Data Downloader for Tehran stock market
26 versions - Latest release: 10 months ago - 1 dependent repositories - 405 downloads last month - 450 stars on GitHub - 1 maintainer
stringzilla 3.8.3
SIMD-accelerated string search, sort, hashes, fingerprints, & edit distances
37 versions - Latest release: 22 days ago - 1 dependent package - 23.8 thousand downloads last month - 1,749 stars on GitHub - 1 maintainer