Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataset" keyword

Top 1.0% on pypi.org
tfds-nightly 4.9.4.dev202401070044
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
1,859 versions - Latest release: 5 months ago - 13 dependent packages - 296 dependent repositories - 1.35 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
33 versions - Latest release: 5 months ago - 116 dependent packages - 3,946 dependent repositories - 4.36 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 1.6% on pypi.org
quandl 3.7.0
Package for quandl API access
62 versions - Latest release: over 2 years ago - 11 dependent packages - 303 dependent repositories - 125 thousand downloads last month - 1,357 stars on GitHub - 6 maintainers
Top 1.4% on pypi.org
tensorflow-io 0.37.0
TensorFlow IO
44 versions - Latest release: 26 days ago - 19 dependent packages - 293 dependent repositories - 3.92 million downloads last month - 690 stars on GitHub - 6 maintainers
Top 5.0% on pypi.org
tensorflow-io-nightly 0.31.0.dev20230309180344
TensorFlow IO
902 versions - Latest release: about 1 year ago - 3 dependent repositories - 29.9 thousand downloads last month - 690 stars on GitHub - 5 maintainers
Top 2.1% on pypi.org
datalad 1.0.2
data distribution geared toward scientific datasets
114 versions - Latest release: about 1 month ago - 43 dependent packages - 78 dependent repositories - 18 thousand downloads last month - 492 stars on GitHub - 5 maintainers
Top 2.6% on pypi.org
mosaicml-streaming 0.7.6
Streaming lets users create PyTorch compatible datasets that can be streamed from cloud-based obj...
27 versions - Latest release: 16 days ago - 4 dependent packages - 61 dependent repositories - 61.5 thousand downloads last month - 703 stars on GitHub - 5 maintainers
Top 0.7% on pypi.org
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.
33 versions - Latest release: about 1 month ago - 92 dependent packages - 2,976 dependent repositories - 722 thousand downloads last month - 3,452 stars on GitHub - 4 maintainers
h3ds 0.4.0
Python interface for H3DS dataset
11 versions - Latest release: 5 months ago - 2 dependent repositories - 120 downloads last month - 119 stars on GitHub - 4 maintainers
Top 4.9% on pypi.org
retriever 3.1.0
Data Retriever
12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.51 thousand downloads last month - 302 stars on GitHub - 4 maintainers
Top 9.6% on pypi.org
frictionless-ckan-mapper 1.0.9
A library for mapping CKAN metadata <=> Frictionless metadata.
9 versions - Latest release: over 1 year ago - 3 dependent packages - 7 dependent repositories - 1.3 thousand downloads last month - 9 stars on GitHub - 4 maintainers
Top 1.2% on pypi.org
whylogs 1.4.0
Profile and monitor your ML data pipeline end-to-end
313 versions - Latest release: 13 days ago - 6 dependent packages - 413 dependent repositories - 474 thousand downloads last month - 2,482 stars on GitHub - 4 maintainers
stopes 1.0.1
Large-Scale Translation Data Mining.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 33 downloads last month - 230 stars on GitHub - 4 maintainers
ckanext-privatedatasets 0.4.1
CKAN Extension - Private Datasets
5 versions - Latest release: about 4 years ago - 2 dependent repositories - 47 downloads last month - 18 stars on GitHub - 4 maintainers
Top 8.6% on pypi.org
fastdup 1.123
Fast tool for gaining insights from large image repositories.
332 versions - Latest release: 21 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
Top 10.0% on pypi.org
tensorflow-io-plugin-gs-nightly 0.18.0.dev20210513213318
TensorFlow IO
29 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 1.23 thousand downloads last month - 690 stars on GitHub - 4 maintainers
radiopadre 1.2.0
A data visualization framework for jupyter notebooks
19 versions - Latest release: about 1 year ago - 2 dependent repositories - 62 downloads last month - 10 stars on GitHub - 3 maintainers
radiopadre-client 1.2.0
Radiopadre client-side script
17 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 340 downloads last month - 5 stars on GitHub - 3 maintainers
Top 1.3% on pypi.org
tensorflow-io-gcs-filesystem 0.37.0
TensorFlow IO
22 versions - Latest release: 26 days ago - 105 dependent packages - 5,470 dependent repositories - 14.4 million downloads last month - 690 stars on GitHub - 3 maintainers
Top 6.8% on pypi.org
cvat-cli 2.13.0
Command-line client for CVAT
35 versions - Latest release: 18 days ago - 1 dependent repositories - 1.46 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
waymo-open-dataset-tf-2-4-0 1.4.1
Waymo Open Dataset libraries.
3 versions - Latest release: over 2 years ago - 1 dependent package - 7 dependent repositories - 77 downloads last month - 3 maintainers
physionet 0.1.3
A collection of tools for working with the PhysioNet repository.
5 versions - Latest release: 12 months ago - 1 dependent repositories - 54 downloads last month - 67 stars on GitHub - 3 maintainers
pycovjson 0.3.9
Create CovJSON files from common scientific data formats
8 versions - Latest release: over 7 years ago - 1 dependent repositories - 43 downloads last month - 11 stars on GitHub - 3 maintainers
Top 4.7% on pypi.org
mirdata 0.3.8
Common loaders for MIR datasets.
46 versions - Latest release: 7 months ago - 3 dependent packages - 5 dependent repositories - 1.67 thousand downloads last month - 344 stars on GitHub - 3 maintainers
tqcli 0.3.0.21
TQCLI is the client application for using TranQuant services TranQuant is a data marketplace that...
10 versions - Latest release: over 7 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 3 maintainers
Top 6.6% on pypi.org
scipp 24.2.0
Multi-dimensional data arrays with labeled dimensions
43 versions - Latest release: 3 months ago - 12 dependent packages - 3 dependent repositories - 8.51 thousand downloads last month - 107 stars on GitHub - 3 maintainers
cmem-plugin-kaggle 2.0.0
Import dataset resources from Kaggle.
4 versions - Latest release: 11 months ago - 45 downloads last month - 0 stars on GitHub - 3 maintainers
Top 4.9% on pypi.org
tensorflow-io-gcs-filesystem-nightly 0.31.0.dev20230309180344
TensorFlow IO
160 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 9.33 thousand downloads last month - 690 stars on GitHub - 3 maintainers
Top 3.1% on pypi.org
datumaro 1.6.1
Dataset Management Framework (Datumaro)
57 versions - Latest release: 24 days ago - 7 dependent packages - 30 dependent repositories - 11.9 thousand downloads last month - 481 stars on GitHub - 3 maintainers
Top 1.8% on pypi.org
cvat-sdk 2.13.0
CVAT REST API
36 versions - Latest release: 18 days ago - 4 dependent packages - 54 dependent repositories - 14.6 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
genomic-benchmarks 0.0.9
Genomic Benchmarks
8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 949 downloads last month - 87 stars on GitHub - 3 maintainers
Top 10.0% on pypi.org
mathematics-dataset 1.0.1
A synthetic dataset of school-level mathematics questions
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 13 thousand downloads last month - 1,720 stars on GitHub - 2 maintainers
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 3 years ago - 73 dependent packages - 3,913 dependent repositories - 467 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
Top 2.9% on pypi.org
dataprofiler 0.10.9
What is in your data? Detect schema, statistics and entities in almost any file.
54 versions - Latest release: 3 months ago - 2 dependent packages - 28 dependent repositories - 17 thousand downloads last month - 1,369 stars on GitHub - 2 maintainers
solidago 0.1.1 💰
Algorithms for Secure Algorithmic Governance
9 versions - Latest release: about 1 month ago - 128 downloads last month - 314 stars on GitHub - 2 maintainers
Top 5.0% on pypi.org
convokit 3.0.0
ConvoKit
64 versions - Latest release: 10 months ago - 15 dependent repositories - 2.93 thousand downloads last month - 512 stars on GitHub - 2 maintainers
flexible-fl 0.6.1
Federated Learning (FL) experiment simulation in Python.
4 versions - Latest release: 3 months ago - 4 dependent packages - 68 downloads last month - 11 stars on GitHub - 2 maintainers
edudata 0.0.18
This project aims to provide convenient interfaces for downloading and preprocessing dataset in e...
18 versions - Latest release: almost 3 years ago - 1 dependent repositories - 208 downloads last month - 200 stars on GitHub - 2 maintainers
Top 8.3% on pypi.org
waymo-open-dataset-tf-2-5-0 1.4.1
Waymo Open Dataset libraries.
2 versions - Latest release: over 2 years ago - 33 dependent repositories - 362 downloads last month - 2 maintainers
classtree 0.0.2
A toolkit for hierarchical classification
2 versions - Latest release: 5 months ago - 24 downloads last month - 682 stars on GitHub - 2 maintainers
ego4d 1.7.2
Ego4D Dataset CLI
17 versions - Latest release: about 1 month ago - 1 dependent repositories - 617 downloads last month - 288 stars on GitHub - 2 maintainers
Top 9.2% on pypi.org
datalabs 0.4.15
Datalabs
54 versions - Latest release: over 1 year ago - 3 dependent packages - 2 dependent repositories - 392 downloads last month - 125 stars on GitHub - 2 maintainers
cdp-patches 1.0
Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium,...
2 versions - Latest release: 2 months ago - 201 downloads last month - 25 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
waymo-open-dataset-tf-2-1-0 1.3.1
Waymo Open Dataset libraries.
4 versions - Latest release: about 3 years ago - 1 dependent package - 97 dependent repositories - 287 downloads last month - 2 maintainers
rubaialter 1.1.0b1
A module for altering numerical dataset's formats, made on top of Pandas.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 2 stars on GitHub - 2 maintainers
connectome 0.10.0
A library for datasets containing heterogeneous data
34 versions - Latest release: about 2 months ago - 1 dependent repositories - 246 downloads last month - 12 stars on GitHub - 2 maintainers
sdnist 2.3.0
SDNist: Deidentified Data Report Generator
10 versions - Latest release: 11 months ago - 1 dependent repositories - 233 downloads last month - 30 stars on GitHub - 2 maintainers
pandas-datacube 0.0.4
A package allowing to download datacubes into pandas data frames
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 40 downloads last month - 3 stars on GitHub - 2 maintainers
Top 6.6% on pypi.org
flwr-datasets 0.1.0
Flower Datasets
3 versions - Latest release: 2 months ago - 7 dependent repositories - 4.28 thousand downloads last month - 3,924 stars on GitHub - 2 maintainers
mflux-ai 0.7.0
The Python client for MFlux.ai
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 87 downloads last month - 1 stars on GitHub - 2 maintainers
cppe5 0.1.1 💰
A library to easily download, load and work with the CPPE-5 dataset.
2 versions - Latest release: about 1 year ago - 1 dependent repositories - 52 downloads last month - 64 stars on GitHub - 2 maintainers
pydax 0.2.0
Access DAX datasets.
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 29 downloads last month - 17 stars on GitHub - 2 maintainers
ckanext-datarequests 1.1.0
CKAN Extension - Data Requests
11 versions - Latest release: over 5 years ago - 2 dependent repositories - 104 downloads last month - 17 stars on GitHub - 2 maintainers
ua-datasets 0.1.1
A collection of ukrainian language datasets
11 versions - Latest release: 7 months ago - 1 dependent repositories - 57 downloads last month - 50 stars on GitHub - 2 maintainers
stadata 1.0.0
API for get all statistics data from BPS
6 versions - Latest release: 7 months ago - 62 downloads last month - 132 stars on GitHub - 2 maintainers
dlkp 0.0.1
A deep learning library for keyphrase extraction and generation
1 version - Latest release: over 2 years ago - 1 dependent repositories - 15 downloads last month - 25 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
torchxrayvision 1.2.3 💰
TorchXRayVision: A library of chest X-ray datasets and models
43 versions - Latest release: 13 days ago - 1 dependent package - 31 dependent repositories - 13.6 thousand downloads last month - 775 stars on GitHub - 2 maintainers
Top 9.1% on pypi.org
soundata 0.1.3
Python library for loading and working with sound datasets.
16 versions - Latest release: 4 months ago - 2 dependent repositories - 1.02 thousand downloads last month - 270 stars on GitHub - 2 maintainers
neptune-mlflow 1.1.1
neptune.ai MLflow integration library
17 versions - Latest release: 6 months ago - 2 dependent packages - 1 dependent repositories - 355 downloads last month - 30 stars on GitHub - 2 maintainers
ws-benchmark 1.1.1
a weak supervision learning benchmark
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 36 downloads last month - 211 stars on GitHub - 2 maintainers
flexanomalies 0.0.2
Federated Learning (FL) experiment simulation in Python.
2 versions - Latest release: 2 months ago - 19 downloads last month - 11 stars on GitHub - 2 maintainers
medigan 1.0.0
medigan is a modular open-source Python library that provides an interface to multiple generative...
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 76 downloads last month - 106 stars on GitHub - 2 maintainers
knowage-python 8.0.2
Web service for Knowage python widget and python dataset
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 388 stars on GitHub - 2 maintainers
Top 0.3% on pypi.org
faker 25.2.0 💰
Faker is a Python package that generates fake data for you.
361 versions - Latest release: 14 days ago - 382 dependent packages - 15,807 dependent repositories - 15.2 million downloads last month - 16,716 stars on GitHub - 2 maintainers
asreview-covid19 0.9.4
Covid-19 related datasets for ASReview
12 versions - Latest release: about 2 years ago - 1 dependent repositories - 149 downloads last month - 27 stars on GitHub - 2 maintainers
dao-scripts 1.2.2
"A tool to download data to monitor DAO activity"
26 versions - Latest release: 6 months ago - 1 dependent package - 360 downloads last month - 0 stars on GitHub - 2 maintainers
scitacean 24.4.0
High-level interface for SciCat
13 versions - Latest release: about 1 month ago - 1 dependent repositories - 195 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.4% on pypi.org
waymo-open-dataset-tf-2-3-0 1.3.1
Waymo Open Dataset libraries.
4 versions - Latest release: about 3 years ago - 1 dependent package - 4 dependent repositories - 70 downloads last month - 2 maintainers
adept-augmentations 0.1.1
A Python library aimed at adeptly, augmenting NLP training data.
3 versions - Latest release: about 1 year ago - 52 downloads last month - 54 stars on GitHub - 2 maintainers
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.4 thousand downloads last month - 30 stars on GitHub - 2 maintainers
vbench 0.1.1
Video generation benchmark
2 versions - Latest release: 4 months ago - 274 downloads last month - 259 stars on GitHub - 2 maintainers
mhm 5.13.1
Python distribution of mHM with bindings.
3 versions - Latest release: 9 months ago - 1 dependent package - 65 downloads last month - 214 stars on GitHub - 2 maintainers
Top 9.0% on pypi.org
opendatalab 0.0.10
OpenDataLab Python SDK
83 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 188 thousand downloads last month - 52 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
cfpq-data 4.0.3
Python package containing Graphs and Grammars for experimental analysis of Context-Free Path Quer...
10 versions - Latest release: 3 months ago - 15 dependent repositories - 1.89 thousand downloads last month - 9 stars on GitHub - 2 maintainers
waymo-open-dataset-tf-2-0-0 1.3.1
Waymo Open Dataset libraries.
4 versions - Latest release: about 3 years ago - 1 dependent package - 2 dependent repositories - 57 downloads last month - 2 maintainers
ocf-datapipes 3.3.24 💰
Pytorch Datapipes built for use in Open Climate Fix's forecasting work
245 versions - Latest release: 17 days ago - 3 dependent packages - 2.39 thousand downloads last month - 10 stars on GitHub - 2 maintainers
fedscale 1.0
FedScale: Benchmarking Model and System Performance of Federated Learning
2 versions - Latest release: about 2 months ago - 123 downloads last month - 361 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
continuum 1.2.7
A clean and simple library for Continual Learning in PyTorch.
51 versions - Latest release: over 1 year ago - 11 dependent repositories - 693 downloads last month - 400 stars on GitHub - 2 maintainers
waymo-open-dataset-tf-2-2-0 1.3.1
Waymo Open Dataset libraries.
3 versions - Latest release: about 3 years ago - 2 dependent packages - 5 dependent repositories - 175 downloads last month - 2 maintainers
Top 0.8% on pypi.org
fake-factory 9999.9.9 💰
The `fake-factory` package was deprecated on December 15th, 2016. Use the `Faker` package instead.
22 versions - Latest release: over 7 years ago - 7 dependent packages - 806 dependent repositories - 58.4 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 5.2% on pypi.org
waymo-open-dataset-tf-2-6-0 1.4.9
Waymo Open Dataset libraries.
8 versions - Latest release: almost 2 years ago - 3 dependent packages - 12 dependent repositories - 628 downloads last month - 2 maintainers
friendly-data-registry 20220103
Schema registry for friendly_data
10 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 80 downloads last month - 0 stars on GitHub - 2 maintainers
zengin-code 1.1.0.20240415 💰
bank codes and branch codes for Japanese.
161 versions - Latest release: about 1 month ago - 1 dependent repositories - 5.34 thousand downloads last month - 13 stars on GitHub - 2 maintainers
Top 8.7% on pypi.org
bio-embeddings-tape-proteins 0.5
Repostory of Protein Benchmarking and Modeling
2 versions - Latest release: almost 3 years ago - 2 dependent repositories - 237 downloads last month - 626 stars on GitHub - 2 maintainers
pystoxx 0.1.2
Search and retrieve current data and historical information for publicly traded companies
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 66 downloads last month - 0 stars on GitHub - 1 maintainer
aroma 0.0.0a7
A library to prepare asynchronous time series datasets
3 versions - Latest release: about 1 year ago - 1 dependent repositories - 753 downloads last month - 1 stars on GitHub - 1 maintainer
pygta-data-collector 0.0.1
Collecting images for GTA sel-driving car project
1 version - Latest release: over 1 year ago - 9 downloads last month - 1 maintainer
hipose 0.7.1
Human whole-body pose estimation using MARG multi-sensor data.
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 36 downloads last month - 53 stars on GitHub - 1 maintainer
nidsdata 0.0.3
NIDS Dataset
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 22 downloads last month - 36 stars on GitHub - 1 maintainer
batchelor 0.5.8
A simple, yet effective batching system using threadpoolexecutor.
12 versions - Latest release: almost 4 years ago - 1 dependent repositories - 53 downloads last month - 0 stars on GitHub - 1 maintainer
dbrecord 1.1.4
sqlite based kv database using for big data IO.
10 versions - Latest release: almost 2 years ago - 1 dependent package - 3 dependent repositories - 166 downloads last month - 6 stars on GitHub - 1 maintainer
biasondemand 0.1.0
A Python package that generates synthetic datasets with different types of bias
1 version - Latest release: about 1 year ago - 17 downloads last month - 2 stars on GitHub - 1 maintainer
metabatch 0.9.0
MetaBatch: A micro-framework for efficient batching of tasks in PyTorch.
2 versions - Latest release: 11 months ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
rgmining-synthetic-dataset 0.9.3
A synthetic dataset for Review graph mining project
8 versions - Latest release: almost 7 years ago - 1 dependent repositories - 30 downloads last month - 3 stars on GitHub - 1 maintainer
parquet-dataset 0.0.1.dev4
Dataset for parquet group
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 18 downloads last month - 3 stars on GitHub - 1 maintainer
classixclustering 1.2.5
Fast and explainable clustering based on sorting
117 versions - Latest release: about 2 months ago - 1 dependent repositories - 464 downloads last month - 81 stars on GitHub - 1 maintainer
nlp-dataset-readers 0.1.7
Dataset Readers for NLP
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 1 maintainer
coco-merger 0.0.2
Python package which aims to merge 2 COCO .json files
2 versions - Latest release: over 1 year ago - 85 downloads last month - 37 stars on GitHub - 1 maintainer
ddd-dataset 0.1.2
Toolkit for Description Detection Dataset ($D^3$)
3 versions - Latest release: 2 months ago - 369 downloads last month - 91 stars on GitHub - 1 maintainer
skyimages 0.0.2
Downloading sky image datasets for pytorch applications
2 versions - Latest release: over 1 year ago - 17 downloads last month - 11 stars on GitHub - 1 maintainer