An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataset" keyword

View the packages on the pypi.org package registry that are tagged with the "dataset" keyword.

data-understand 0.0.6
Utility package for generating insights for datasets
7 versions - Latest release: over 1 year ago - 286 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
cvat-sdk 2.34.0
CVAT REST API
69 versions - Latest release: 1 day ago - 4 dependent packages - 54 dependent repositories - 42.9 thousand downloads last month - 13,542 stars on GitHub - 1 maintainer
daget 0.5
Download dataset via DOI or landing page url
5 versions - Latest release: over 1 year ago - 208 downloads last month - 1 stars on GitHub - 1 maintainer
ckanext-datarequests 1.1.0
CKAN Extension - Data Requests
11 versions - Latest release: over 6 years ago - 2 dependent repositories - 220 downloads last month - 17 stars on GitHub - 2 maintainers
stringzilla 3.12.5
SIMD-accelerated string search, sort, hashes, fingerprints, & edit distances
69 versions - Latest release: about 4 hours ago - 1 dependent package - 1.36 million downloads last month - 1,749 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
fake-factory 9999.9.9 💰
The `fake-factory` package was deprecated on December 15th, 2016. Use the `Faker` package instead.
22 versions - Latest release: over 8 years ago - 7 dependent packages - 806 dependent repositories - 51.5 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 2.9% on pypi.org
dataprofiler 0.13.3
What is in your data? Detect schema, statistics and entities in almost any file.
57 versions - Latest release: about 1 month ago - 2 dependent packages - 28 dependent repositories - 23 thousand downloads last month - 1,464 stars on GitHub - 3 maintainers
Top 1.6% on pypi.org
label-studio 1.17.0
Label Studio annotation tool
192 versions - Latest release: 11 days ago - 1 dependent package - 39 dependent repositories - 80.1 thousand downloads last month - 21,528 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.
33 versions - Latest release: 12 months ago - 92 dependent packages - 2,976 dependent repositories - 898 thousand downloads last month - 3,513 stars on GitHub - 4 maintainers
gigawork 1.4.2
A tool for extracting GitHub Actions workflows
7 versions - Latest release: 6 months ago - 299 downloads last month - 3 stars on GitHub - 1 maintainer
kern-rowduction 0.0.4
Kern Rowduction - A package to reduce the number of rows / undersample the (imbalanced) datas...
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 82 downloads last month - 1 stars on GitHub - 1 maintainer
ccagt-utils 0.1.1
A framework of utilities to help at the use of the CCAgT dataset
11 versions - Latest release: almost 2 years ago - 385 downloads last month - 2 stars on GitHub - 1 maintainer
bambird 0.3.0
BAM, unsupervised labelling function to extract and cluster similar animal vocalizations together
3 versions - Latest release: over 2 years ago - 137 downloads last month - 26 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
tensorflow-io 0.37.1
TensorFlow IO
45 versions - Latest release: 10 months ago - 19 dependent packages - 293 dependent repositories - 761 thousand downloads last month - 690 stars on GitHub - 6 maintainers
ml-pyxis 0.4.dev0
Tool for reading and writing datasets of tensors with MessagePack and Lightning Memory-Mapped Dat...
1 version - Latest release: almost 4 years ago - 2 dependent repositories - 34 downloads last month - 117 stars on GitHub - 1 maintainer
gonk-ai 0.1.4
Generic backend for dataset annotation.
4 versions - Latest release: over 1 year ago - 184 downloads last month - 5 stars on GitHub - 1 maintainer
dataset-builder-for-segmentation 0.0.3
A dataset pipeline builder for semantic image segmentation
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 90 downloads last month - 3 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
synergy-dataset 1.0.3 💰
Python package for the SYNERGY dataset
12 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 6.44 thousand downloads last month - 76 stars on GitHub - 1 maintainer
cwru 0.2
Case Western Reserve University Bearing Data
2 versions - Latest release: almost 9 years ago - 2 dependent repositories - 84 downloads last month - 161 stars on GitHub - 1 maintainer
referit 1.0.0
Python wrapper to load the ReferIt Game dataset
1 version - Latest release: over 7 years ago - 1 dependent repositories - 79 downloads last month - 498 stars on GitHub - 1 maintainer
mapstery 0.0.7
Digital terrain map manipulation high level functions
7 versions - Latest release: about 6 years ago - 2 dependent repositories - 147 downloads last month - 1 maintainer
pygdg 0.1.6
A simple comand line tool to create game events data for analytics and machine learning use cases
6 versions - Latest release: almost 3 years ago - 1 dependent repositories - 269 downloads last month - 0 stars on GitHub - 1 maintainer
goes-dl 0.2rc4
GOES-DL — GOES Satellite Imagery Dataset Accessing Toolbox
9 versions - Latest release: about 15 hours ago - 257 downloads last month - 1 stars on GitHub - 1 maintainer
eurocropsml 0.4.0
EuroCropsML is a ready-to-use benchmark dataset for few-shot crop type classification using Senti...
5 versions - Latest release: 3 days ago - 179 downloads last month - 12 stars on GitHub - 1 maintainer
wildlife-datasets 1.0.6
Library for easier access and research of wildlife re-identification datasets
52 versions - Latest release: 4 days ago - 1 dependent package - 2.04 thousand downloads last month - 96 stars on GitHub - 2 maintainers
open-mastr 0.14.5
A package that provides an interface for downloading and processing the data of the Marktstammdat...
16 versions - Latest release: 6 months ago - 1.58 thousand downloads last month - 77 stars on GitHub - 1 maintainer
agml 0.7.3
A comprehensive library for agricultural deep learning
28 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.58 thousand downloads last month - 208 stars on GitHub - 2 maintainers
tsp 1.7.7
Making permafrost data effortless
30 versions - Latest release: 28 days ago - 12 dependent repositories - 761 downloads last month - 5 stars on gitlab.com - 1 maintainer
eccv-caption 0.1.0
A PyThon wrapper for Extended COCO Validation (ECCV) Caption dataset
1 version - Latest release: about 3 years ago - 85 downloads last month - 56 stars on GitHub - 1 maintainer
bdd100-to-yolo 0.0.3
Invert BDD100 dataset to YOLO dataset
3 versions - Latest release: 4 months ago - 59 downloads last month - 0 stars on GitHub - 1 maintainer
d3d 0.1.0rc0 💰
Customized tools for 3D object detection
1 version - Latest release: over 4 years ago - 1 dependent repositories - 66 downloads last month - 35 stars on GitHub - 1 maintainer
mhm 5.13.1
Python distribution of mHM with bindings.
3 versions - Latest release: over 1 year ago - 1 dependent package - 577 downloads last month - 238 stars on GitHub - 2 maintainers
radiopadre-client 1.2.0
Radiopadre client-side script
17 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 586 downloads last month - 5 stars on GitHub - 3 maintainers
climateserv 1.0.8
This is a package to access the ClimateSERV API](https://climateserv.servirglobal.net/)
19 versions - Latest release: 7 months ago - 1 dependent repositories - 867 downloads last month - 8 stars on GitHub - 1 maintainer
chem-mat-database 1.0.0
Command Line Interface for projects
1 version - Latest release: 3 months ago - 54 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
waymo-open-dataset-tf-2-5-0 1.4.1
Waymo Open Dataset libraries.
2 versions - Latest release: over 3 years ago - 33 dependent repositories - 450 downloads last month - 2 maintainers
nexadataset 0.1.1
数据集管理平台SDK
1 version - Latest release: about 18 hours ago - 1 maintainer
Top 1.6% on pypi.org
quandl 3.7.0
Package for quandl API access
62 versions - Latest release: over 3 years ago - 11 dependent packages - 303 dependent repositories - 123 thousand downloads last month - 1,375 stars on GitHub - 6 maintainers
Top 3.5% on pypi.org
torchxrayvision 1.3.3 💰
TorchXRayVision: A library of chest X-ray datasets and models
48 versions - Latest release: about 1 month ago - 1 dependent package - 31 dependent repositories - 6.89 thousand downloads last month - 868 stars on GitHub - 2 maintainers
classtree 0.0.3
A toolkit for hierarchical classification
3 versions - Latest release: 7 months ago - 86 downloads last month - 682 stars on GitHub - 2 maintainers
pydoda 1.2.1
A wrapper Python library for working with the DODa dataset
4 versions - Latest release: 8 months ago - 111 downloads last month - 2 stars on GitHub - 1 maintainer
continnum 0.0.1
A DataLoader library for Continual Learning in PyTorch.
1 version - Latest release: about 5 years ago - 54 downloads last month - 428 stars on GitHub - 1 maintainer
Top 0.3% on pypi.org
faker 37.1.0 💰
Faker is a Python package that generates fake data for you.
429 versions - Latest release: 25 days ago - 382 dependent packages - 15,807 dependent repositories - 23.9 million downloads last month - 17,558 stars on GitHub - 2 maintainers
edudata 0.0.18
This project aims to provide convenient interfaces for downloading and preprocessing dataset in e...
18 versions - Latest release: over 3 years ago - 1 dependent repositories - 704 downloads last month - 223 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
tensorflow-io-gcs-filesystem 0.37.1
TensorFlow IO
23 versions - Latest release: 10 months ago - 105 dependent packages - 5,470 dependent repositories - 9.27 million downloads last month - 690 stars on GitHub - 3 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.8
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
37 versions - Latest release: about 1 month ago - 116 dependent packages - 3,946 dependent repositories - 1.56 million downloads last month - 4,157 stars on GitHub - 8 maintainers
huggingface-text-data-analyzer 1.1.0
A comprehensive tool for analyzing text datasets from HuggingFace's datasets library
3 versions - Latest release: 4 months ago - 122 downloads last month - 6 stars on GitHub - 1 maintainer
patch-cli 0.2.16
Spin up analytics APIs over your data in minutes, without writing any code.
15 versions - Latest release: over 1 year ago - 555 downloads last month - 1 maintainer
hankshaw 2.0.0
Model for The Evolution of Cooperation by the Hankshaw Effect
3 versions - Latest release: over 9 years ago - 2 dependent repositories - 86 downloads last month - 1 stars on GitHub - 1 maintainer
flydenity 0.1.6
Flydenity is an aircraft callsign identification library. Parsers aircraft registration prefix to...
7 versions - Latest release: over 4 years ago - 2 dependent repositories - 887 downloads last month - 4 stars on GitHub - 1 maintainer
solidago 0.4.1 💰
A toolbox for Solid Algorithmic Governance
14 versions - Latest release: 20 days ago - 573 downloads last month - 348 stars on GitHub - 1 maintainer
dspp-keras 0.0.5
Integration of Database of structural propensities of proteins (dSPP) with Keras Machine Learning...
5 versions - Latest release: almost 8 years ago - 1 dependent repositories - 167 downloads last month - 166 stars on GitHub - 1 maintainer
sldatasets 0.0.1
A single library to (down)load all existing sign language video datasets.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 51 downloads last month - 7 stars on GitHub - 1 maintainer
pexels-cli 0.1.0
A Simple CLI To Get Image With Specific Tag From Pexels
1 version - Latest release: over 3 years ago - 1 dependent repositories - 57 downloads last month - 8 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
pylabel 0.1.55 💰
Transform, analyze, and visualize computer vision annotations.
57 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 18.9 thousand downloads last month - 317 stars on GitHub - 1 maintainer
pystoxx 0.1.2
Search and retrieve current data and historical information for publicly traded companies
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 352 downloads last month - 0 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 4 years ago - 73 dependent packages - 3,913 dependent repositories - 359 thousand downloads last month - 3,024 stars on GitHub - 2 maintainers
openfisca-uk-data 0.9.0 💰
A Python package to manage OpenFisca-UK-compatible microdata
20 versions - Latest release: about 3 years ago - 1 dependent repositories - 700 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
cvat-cli 2.34.0
Command-line client for CVAT
68 versions - Latest release: 1 day ago - 1 dependent repositories - 5.14 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
datasetops 0.0.6
Fluent dataset operations, compatible with your favorite libraries
4 versions - Latest release: about 5 years ago - 4 dependent repositories - 178 downloads last month - 11 stars on GitHub - 1 maintainer
insperareader 0.1.2
JSON parsing of Inspera Assessment files
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 73 downloads last month - 0 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
datumaro 1.10.0
Dataset Management Framework (Datumaro)
65 versions - Latest release: about 1 month ago - 7 dependent packages - 30 dependent repositories - 18.7 thousand downloads last month - 587 stars on GitHub - 2 maintainers
download-oscar 2.1
Downloading all files of a language from the OSCAR (Open Super-large Crawled Aggregated coRpus)
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 192 downloads last month - 10 stars on GitHub - 1 maintainer
admincer 1.2.0
Tool for managing datasets for visual ad detection
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 109 downloads last month - 1 maintainer
Top 6.9% on pypi.org
waymo-open-dataset-tf-2-1-0 1.3.1
Waymo Open Dataset libraries.
4 versions - Latest release: almost 4 years ago - 1 dependent package - 97 dependent repositories - 290 downloads last month - 2 maintainers
imagenetscraper 1.0.2
Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
3 versions - Latest release: over 5 years ago - 6 dependent repositories - 143 downloads last month - 26 stars on GitHub - 1 maintainer
teex 1.1.3
A Toolbox for the Evaluation of Explanations
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 369 downloads last month - 15 stars on GitHub - 1 maintainer
agi-env 0.2.7
AGI Env
37 versions - Latest release: 2 days ago - 2.65 thousand downloads last month - 2 stars on GitHub - 1 maintainer
agi-gui 0.2.7
AGI GUI
36 versions - Latest release: 2 days ago - 2.6 thousand downloads last month - 2 stars on GitHub - 1 maintainer
agilab 0.2.7
AGILAB a datascience IDE for engineering to explore AI
34 versions - Latest release: 2 days ago - 2.56 thousand downloads last month - 2 stars on GitHub - 1 maintainer
agi-core 0.2.7
agi-core a framework for AGI
37 versions - Latest release: 2 days ago - 2.62 thousand downloads last month - 2 stars on GitHub - 1 maintainer
dwcontents 1.0.0b5
Jupyter contents manager for data.world
5 versions - Latest release: about 7 years ago - 1 dependent repositories - 192 downloads last month - 1 stars on GitHub - 1 maintainer
gse 0.1.9
extract metadata and dataset from GEO Series Matrix format data
3 versions - Latest release: about 11 years ago - 2 dependent repositories - 124 downloads last month - 1 maintainer
bingoset 0.1.7
🎲 CLI Toolkit to quickly create image dataset using Bing Image Search API
8 versions - Latest release: almost 5 years ago - 1 dependent repositories - 363 downloads last month - 4 stars on GitHub - 1 maintainer
kaggle-dataset-creator 0.0.1
A Python package to generate csv/json from command line. It allows you to create CSV/JSON files b...
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 49 downloads last month - 0 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
fuel 0.2.0
Data pipeline framework for machine learning
2 versions - Latest release: over 8 years ago - 32 dependent repositories - 1.2 thousand downloads last month - 869 stars on GitHub - 1 maintainer
wiker 0.0.1
Library for wikipedia dataset collection
1 version - Latest release: over 2 years ago - 46 downloads last month - 0 stars on GitHub - 1 maintainer
pynavernews 0.1.0
Naver News Scraper
1 version - Latest release: about 1 year ago - 66 downloads last month - 0 stars on GitHub - 1 maintainer
pycovjson 0.3.9
Create CovJSON files from common scientific data formats
8 versions - Latest release: over 8 years ago - 1 dependent repositories - 243 downloads last month - 13 stars on GitHub - 3 maintainers
google-trends-scraper 0.0.7
Google Trends Scraper makes scraping data from Google Trends incredibly easy, even formatting res...
5 versions - Latest release: about 7 years ago - 1 dependent repositories - 150 downloads last month - 5 stars on GitHub - 1 maintainer
pydrugsdatabase 0.1.0
Downloads the FDA drugs database along with the NDC codes active ingredients and drug classes
1 version - Latest release: 10 months ago - 25 downloads last month - 0 stars on gitlab.com - 1 maintainer
clloader 0.0.2
A DataLoader library for Continual Learning in PyTorch.
2 versions - Latest release: about 5 years ago - 82 downloads last month - 428 stars on GitHub - 1 maintainer
waymo-open-dataset-tf-2-2-0 1.3.1
Waymo Open Dataset libraries.
3 versions - Latest release: almost 4 years ago - 2 dependent packages - 5 dependent repositories - 333 downloads last month - 2 maintainers
pyautoplot 1.0.1
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by gene...
2 versions - Latest release: 3 months ago - 123 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
path-dict 4.0.0
Extends Python's dict with useful extras
20 versions - Latest release: about 2 years ago - 1 dependent package - 3 dependent repositories - 6.26 thousand downloads last month - 25 stars on GitHub - 1 maintainer
ecko-cli 1.6.0
CLI tool that easily converts a directory of images into a dataset for training generative ai models
8 versions - Latest release: 5 months ago - 345 downloads last month - 1 stars on GitHub - 1 maintainer
spltr 0.3.2
A simple PyTorch-based data loader and splitter
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 138 downloads last month - 1 stars on GitHub - 1 maintainer
rads 0.1.0
Python front end for the Radar Altimeter Database System.
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 95 downloads last month - 2 stars on GitHub - 1 maintainer
graviti 0.13.1
Graviti Python SDK
56 versions - Latest release: about 2 years ago - 1 dependent repositories - 1.61 thousand downloads last month - 12 stars on GitHub - 1 maintainer
ecodatatk 0.0.1
Developed for limnological and hydrological studies
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 56 downloads last month - 0 stars on GitHub - 1 maintainer
factuality 1.0.14
Benchmarking long-form factuality in large language models. Original code for our paper "Long-for...
4 versions - Latest release: about 1 year ago - 177 downloads last month - 575 stars on GitHub - 1 maintainer
dgit_extensions 0.1.3
dgit addons
2 versions - Latest release: about 9 years ago - 2 dependent repositories - 57 downloads last month - 0 stars on GitHub - 1 maintainer
chrome-fingerprints 1.1
A Collection of 10.000 self-collected Chrome Fingerprints. Wrapped in a easy-to-use API, availabl...
2 versions - Latest release: over 1 year ago - 1 dependent package - 1.53 thousand downloads last month - 220 stars on GitHub - 1 maintainer
target-datadotworld 1.0.1
Singer target for data.world
5 versions - Latest release: almost 7 years ago - 1 dependent repositories - 154 downloads last month - 5 stars on GitHub - 1 maintainer
soramimi-phonetic-search-dataset 0.0.9
音韻的類似性を考慮した検索システムの評価用データセット。替え歌の歌詞から構築された特定ジャンルの単語ペアを収録。
2 versions - Latest release: about 1 month ago - 123 downloads last month - 0 stars on GitHub - 1 maintainer
mldatasetbuilder 1.0.0
MLDatasetBuilder is a python package which is helping to prepare the image for your ML dataset.
10 versions - Latest release: almost 5 years ago - 1 dependent repositories - 260 downloads last month - 4 stars on GitHub - 1 maintainer
biobeee 0.0.5
Bioinformatics tool for performing web scrapping on biological database and pre-processing
3 versions - Latest release: over 1 year ago - 71 downloads last month - 0 stars on gitlab.com - 1 maintainer
idsprites 1.0.1
Easily generate simple continual learning benchmarks.
1 version - Latest release: 9 months ago - 65 downloads last month - 495 stars on GitHub - 1 maintainer
arrowtextclassifier 1.0.3
ArrowTextClassifier is a simple text classification tool written in pytorch that allows you to tr...
4 versions - Latest release: 12 months ago - 197 downloads last month - 1 maintainer
gps2var 0.1.0a1
Fast reading of geospatial variables by GPS coordinates
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 91 downloads last month - 7 stars on GitHub - 1 maintainer