Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "dataset" keyword
Top 0.3% on pypi.org
361 versions - Latest release: 14 days ago - 382 dependent packages - 15,807 dependent repositories - 15.2 million downloads last month - 16,716 stars on GitHub - 2 maintainers
faker 25.2.0 π°
Faker is a Python package that generates fake data for you.361 versions - Latest release: 14 days ago - 382 dependent packages - 15,807 dependent repositories - 15.2 million downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
22 versions - Latest release: 26 days ago - 105 dependent packages - 5,470 dependent repositories - 14.4 million downloads last month - 690 stars on GitHub - 3 maintainers
tensorflow-io-gcs-filesystem 0.37.0
TensorFlow IO22 versions - Latest release: 26 days ago - 105 dependent packages - 5,470 dependent repositories - 14.4 million downloads last month - 690 stars on GitHub - 3 maintainers
Top 0.6% on pypi.org
33 versions - Latest release: 5 months ago - 116 dependent packages - 3,946 dependent repositories - 4.36 million downloads last month - 4,085 stars on GitHub - 8 maintainers
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.33 versions - Latest release: 5 months ago - 116 dependent packages - 3,946 dependent repositories - 4.36 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 1.4% on pypi.org
44 versions - Latest release: 26 days ago - 19 dependent packages - 293 dependent repositories - 3.92 million downloads last month - 690 stars on GitHub - 6 maintainers
tensorflow-io 0.37.0
TensorFlow IO44 versions - Latest release: 26 days ago - 19 dependent packages - 293 dependent repositories - 3.92 million downloads last month - 690 stars on GitHub - 6 maintainers
Top 1.0% on pypi.org
1,859 versions - Latest release: 5 months ago - 13 dependent packages - 296 dependent repositories - 1.35 million downloads last month - 4,085 stars on GitHub - 8 maintainers
tfds-nightly 4.9.4.dev202401070044
tensorflow/datasets is a library of datasets ready to use with TensorFlow.1,859 versions - Latest release: 5 months ago - 13 dependent packages - 296 dependent repositories - 1.35 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 0.7% on pypi.org
33 versions - Latest release: about 1 month ago - 92 dependent packages - 2,976 dependent repositories - 722 thousand downloads last month - 3,452 stars on GitHub - 4 maintainers
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.33 versions - Latest release: about 1 month ago - 92 dependent packages - 2,976 dependent repositories - 722 thousand downloads last month - 3,452 stars on GitHub - 4 maintainers
Top 1.2% on pypi.org
313 versions - Latest release: 13 days ago - 6 dependent packages - 413 dependent repositories - 474 thousand downloads last month - 2,482 stars on GitHub - 4 maintainers
whylogs 1.4.0
Profile and monitor your ML data pipeline end-to-end313 versions - Latest release: 13 days ago - 6 dependent packages - 413 dependent repositories - 474 thousand downloads last month - 2,482 stars on GitHub - 4 maintainers
Top 0.9% on pypi.org
22 versions - Latest release: almost 3 years ago - 73 dependent packages - 3,913 dependent repositories - 467 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions22 versions - Latest release: almost 3 years ago - 73 dependent packages - 3,913 dependent repositories - 467 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
119 versions - Latest release: about 2 months ago - 61 dependent packages - 227 dependent repositories - 318 thousand downloads last month - 326 stars on GitHub - 1 maintainer
pytest-cases 3.8.5
Separate test code from test cases in pytest.119 versions - Latest release: about 2 months ago - 61 dependent packages - 227 dependent repositories - 318 thousand downloads last month - 326 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
7 versions - Latest release: almost 2 years ago - 10 dependent packages - 316 dependent repositories - 218 thousand downloads last month - 44 stars on GitHub - 1 maintainer
fastdownload 0.0.7
A general purpose data downloading library.7 versions - Latest release: almost 2 years ago - 10 dependent packages - 316 dependent repositories - 218 thousand downloads last month - 44 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
83 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 188 thousand downloads last month - 52 stars on GitHub - 2 maintainers
opendatalab 0.0.10
OpenDataLab Python SDK83 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 188 thousand downloads last month - 52 stars on GitHub - 2 maintainers
Top 3.6% on pypi.org
10 versions - Latest release: 6 months ago - 5 dependent packages - 130 dependent repositories - 143 thousand downloads last month - 146 stars on GitHub - 1 maintainer
musdb 0.4.2
Python parser for the SIGSEP MUSDB18 dataset10 versions - Latest release: 6 months ago - 5 dependent packages - 130 dependent repositories - 143 thousand downloads last month - 146 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
62 versions - Latest release: over 2 years ago - 11 dependent packages - 303 dependent repositories - 125 thousand downloads last month - 1,357 stars on GitHub - 6 maintainers
quandl 3.7.0
Package for quandl API access62 versions - Latest release: over 2 years ago - 11 dependent packages - 303 dependent repositories - 125 thousand downloads last month - 1,357 stars on GitHub - 6 maintainers
Top 2.2% on pypi.org
23 versions - Latest release: 5 months ago - 23 dependent packages - 94 dependent repositories - 101 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
colour-science 0.4.4 π°
Colour Science for Python23 versions - Latest release: 5 months ago - 23 dependent packages - 94 dependent repositories - 101 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
29 versions - Latest release: 10 months ago - 9 dependent packages - 30 dependent repositories - 70.3 thousand downloads last month - 1,370 stars on GitHub - 1 maintainer
beir 2.0.0
A Heterogeneous Benchmark for Information Retrieval29 versions - Latest release: 10 months ago - 9 dependent packages - 30 dependent repositories - 70.3 thousand downloads last month - 1,370 stars on GitHub - 1 maintainer
iden 0.0.3
simple library to manage a dataset of shards to train machine learning models9 versions - Latest release: 2 months ago - 2 dependent packages - 63.7 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
27 versions - Latest release: 16 days ago - 4 dependent packages - 61 dependent repositories - 61.5 thousand downloads last month - 703 stars on GitHub - 5 maintainers
mosaicml-streaming 0.7.6
Streaming lets users create PyTorch compatible datasets that can be streamed from cloud-based obj...27 versions - Latest release: 16 days ago - 4 dependent packages - 61 dependent repositories - 61.5 thousand downloads last month - 703 stars on GitHub - 5 maintainers
Top 0.8% on pypi.org
22 versions - Latest release: over 7 years ago - 7 dependent packages - 806 dependent repositories - 58.4 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
fake-factory 9999.9.9 π°
The `fake-factory` package was deprecated on December 15th, 2016. Use the `Faker` package instead.22 versions - Latest release: over 7 years ago - 7 dependent packages - 806 dependent repositories - 58.4 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 2.5% on pypi.org
17 versions - Latest release: 3 months ago - 8 dependent packages - 17 dependent repositories - 52.4 thousand downloads last month - 34 stars on GitHub - 1 maintainer
rdata 0.11.2
Read R datasets from Python.17 versions - Latest release: 3 months ago - 8 dependent packages - 17 dependent repositories - 52.4 thousand downloads last month - 34 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
25 versions - Latest release: about 1 month ago - 26 dependent repositories - 42.6 thousand downloads last month - 100 stars on GitHub - 1 maintainer
datadotworld 2.0.0
Python library for data.world25 versions - Latest release: about 1 month ago - 26 dependent repositories - 42.6 thousand downloads last month - 100 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
185 versions - Latest release: about 1 month ago - 1 dependent package - 39 dependent repositories - 41.8 thousand downloads last month - 15,269 stars on GitHub - 1 maintainer
label-studio 1.12.0
Label Studio annotation tool185 versions - Latest release: about 1 month ago - 1 dependent package - 39 dependent repositories - 41.8 thousand downloads last month - 15,269 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
18 versions - Latest release: about 2 years ago - 2 dependent packages - 8 dependent repositories - 41.3 thousand downloads last month - 778 stars on GitHub - 1 maintainer
names-dataset 3.1.0 π°
The python library to handle names18 versions - Latest release: about 2 years ago - 2 dependent packages - 8 dependent repositories - 41.3 thousand downloads last month - 778 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
87 versions - Latest release: 4 months ago - 2 dependent packages - 10 dependent repositories - 39.1 thousand downloads last month - 3,256 stars on GitHub - 1 maintainer
img2dataset 1.45.0
Easily turn a set of image urls to an image dataset87 versions - Latest release: 4 months ago - 2 dependent packages - 10 dependent repositories - 39.1 thousand downloads last month - 3,256 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
18 versions - Latest release: 28 days ago - 10 dependent packages - 21 dependent repositories - 31.5 thousand downloads last month - 296 stars on GitHub - 1 maintainer
ir-datasets 0.5.7
provides a common interface to many IR ad-hoc ranking benchmarks, training datasets, etc.18 versions - Latest release: 28 days ago - 10 dependent packages - 21 dependent repositories - 31.5 thousand downloads last month - 296 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
5 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 31 thousand downloads last month - 2,546 stars on GitHub - 1 maintainer
waymo-open-dataset-tf-2-11-0 1.6.1
Waymo Open Dataset5 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 31 thousand downloads last month - 2,546 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
902 versions - Latest release: about 1 year ago - 3 dependent repositories - 29.9 thousand downloads last month - 690 stars on GitHub - 5 maintainers
tensorflow-io-nightly 0.31.0.dev20230309180344
TensorFlow IO902 versions - Latest release: about 1 year ago - 3 dependent repositories - 29.9 thousand downloads last month - 690 stars on GitHub - 5 maintainers
Top 2.8% on pypi.org
12 versions - Latest release: over 2 years ago - 4 dependent packages - 91 dependent repositories - 26.4 thousand downloads last month - 406 stars on GitHub - 1 maintainer
split-folders 0.5.1
Split folders with files (e.g. images) into training, validation and test (dataset) folders.12 versions - Latest release: over 2 years ago - 4 dependent packages - 91 dependent repositories - 26.4 thousand downloads last month - 406 stars on GitHub - 1 maintainer
stringzilla 3.8.3
SIMD-accelerated string search, sort, hashes, fingerprints, & edit distances37 versions - Latest release: about 1 month ago - 1 dependent package - 23.8 thousand downloads last month - 1,749 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
52 versions - Latest release: 12 days ago - 1 dependent package - 21 dependent repositories - 21.5 thousand downloads last month - 127 stars on GitHub - 1 maintainer
cpi 1.1.5 π°
Quickly adjust U.S. dollars for inflation using the Consumer Price Index (CPI)52 versions - Latest release: 12 days ago - 1 dependent package - 21 dependent repositories - 21.5 thousand downloads last month - 127 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
19 versions - Latest release: 9 months ago - 1 dependent package - 33 dependent repositories - 19.6 thousand downloads last month - 837 stars on GitHub - 1 maintainer
tfrecord 1.14.4
TFRecord reader19 versions - Latest release: 9 months ago - 1 dependent package - 33 dependent repositories - 19.6 thousand downloads last month - 837 stars on GitHub - 1 maintainer
wiz-craft 1.1.1
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data explorat...6 versions - Latest release: 7 months ago - 18.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
114 versions - Latest release: about 1 month ago - 43 dependent packages - 78 dependent repositories - 18 thousand downloads last month - 492 stars on GitHub - 5 maintainers
datalad 1.0.2
data distribution geared toward scientific datasets114 versions - Latest release: about 1 month ago - 43 dependent packages - 78 dependent repositories - 18 thousand downloads last month - 492 stars on GitHub - 5 maintainers
Top 2.9% on pypi.org
54 versions - Latest release: 3 months ago - 2 dependent packages - 28 dependent repositories - 17 thousand downloads last month - 1,369 stars on GitHub - 2 maintainers
dataprofiler 0.10.9
What is in your data? Detect schema, statistics and entities in almost any file.54 versions - Latest release: 3 months ago - 2 dependent packages - 28 dependent repositories - 17 thousand downloads last month - 1,369 stars on GitHub - 2 maintainers
Top 1.8% on pypi.org
36 versions - Latest release: 18 days ago - 4 dependent packages - 54 dependent repositories - 14.6 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
cvat-sdk 2.13.0
CVAT REST API36 versions - Latest release: 18 days ago - 4 dependent packages - 54 dependent repositories - 14.6 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
Top 7.2% on pypi.org
25 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 14.1 thousand downloads last month - 62 stars on GitHub - 1 maintainer
xarray-dataclasses 1.7.0
xarray data creation made easy by dataclass25 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 14.1 thousand downloads last month - 62 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
43 versions - Latest release: 13 days ago - 1 dependent package - 31 dependent repositories - 13.6 thousand downloads last month - 775 stars on GitHub - 2 maintainers
torchxrayvision 1.2.3 π°
TorchXRayVision: A library of chest X-ray datasets and models43 versions - Latest release: 13 days ago - 1 dependent package - 31 dependent repositories - 13.6 thousand downloads last month - 775 stars on GitHub - 2 maintainers
Top 10.0% on pypi.org
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 13 thousand downloads last month - 1,720 stars on GitHub - 2 maintainers
mathematics-dataset 1.0.1
A synthetic dataset of school-level mathematics questions2 versions - Latest release: about 5 years ago - 1 dependent repositories - 13 thousand downloads last month - 1,720 stars on GitHub - 2 maintainers
Top 3.1% on pypi.org
57 versions - Latest release: 24 days ago - 7 dependent packages - 30 dependent repositories - 11.9 thousand downloads last month - 481 stars on GitHub - 3 maintainers
datumaro 1.6.1
Dataset Management Framework (Datumaro)57 versions - Latest release: 24 days ago - 7 dependent packages - 30 dependent repositories - 11.9 thousand downloads last month - 481 stars on GitHub - 3 maintainers
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.4 thousand downloads last month - 30 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
56 versions - Latest release: 6 months ago - 5 dependent packages - 24 dependent repositories - 10.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
randfacts 0.21.0 π°
Package to generate random facts56 versions - Latest release: 6 months ago - 5 dependent packages - 24 dependent repositories - 10.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
31 versions - Latest release: 10 months ago - 7 dependent repositories - 10.1 thousand downloads last month - 8,436 stars on GitHub - 1 maintainer
doccano 1.8.4 π°
doccano, text annotation tool for machine learning practitioners31 versions - Latest release: 10 months ago - 7 dependent repositories - 10.1 thousand downloads last month - 8,436 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 10 thousand downloads last month - 634 stars on GitHub - 1 maintainer
neuspell 1.0.0
NeuSpell: A Neural Spelling Correction Toolkit2 versions - Latest release: about 3 years ago - 1 dependent repositories - 10 thousand downloads last month - 634 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
10 versions - Latest release: 4 months ago - 3 dependent packages - 18 dependent repositories - 9.97 thousand downloads last month - 977 stars on GitHub - 1 maintainer
medmnist 3.0.1
MedMNIST: 18 MNIST-like Datasets for 2D and 3D Biomedical Image Classification10 versions - Latest release: 4 months ago - 3 dependent packages - 18 dependent repositories - 9.97 thousand downloads last month - 977 stars on GitHub - 1 maintainer
pytest-dataset 0.3.2
Plugin for loading different datasets for pytest by prefix from json or yaml files4 versions - Latest release: 9 months ago - 9.63 thousand downloads last month - 3 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
160 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 9.33 thousand downloads last month - 690 stars on GitHub - 3 maintainers
tensorflow-io-gcs-filesystem-nightly 0.31.0.dev20230309180344
TensorFlow IO160 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 9.33 thousand downloads last month - 690 stars on GitHub - 3 maintainers
Top 6.6% on pypi.org
43 versions - Latest release: 3 months ago - 12 dependent packages - 3 dependent repositories - 8.51 thousand downloads last month - 107 stars on GitHub - 3 maintainers
scipp 24.2.0
Multi-dimensional data arrays with labeled dimensions43 versions - Latest release: 3 months ago - 12 dependent packages - 3 dependent repositories - 8.51 thousand downloads last month - 107 stars on GitHub - 3 maintainers
arctix 0.0.5
A library to get a text summary of nested objects12 versions - Latest release: 13 days ago - 6.38 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
86 versions - Latest release: 4 months ago - 3 dependent repositories - 5.68 thousand downloads last month - 2,163 stars on GitHub - 1 maintainer
clip-retrieval 2.44.0
Easily computing clip embeddings and building a clip retrieval system with them86 versions - Latest release: 4 months ago - 3 dependent repositories - 5.68 thousand downloads last month - 2,163 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
850 versions - Latest release: about 2 months ago - 1 dependent repositories - 5.53 thousand downloads last month - 328 stars on GitHub - 1 maintainer
torchdatasets-nightly 1711929801
PyTorch based library focused on data processing and input pipelines in general.850 versions - Latest release: about 2 months ago - 1 dependent repositories - 5.53 thousand downloads last month - 328 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.51 thousand downloads last month - 302 stars on GitHub - 4 maintainers
retriever 3.1.0
Data Retriever12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.51 thousand downloads last month - 302 stars on GitHub - 4 maintainers
Top 9.8% on pypi.org
12 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 5.35 thousand downloads last month - 54 stars on GitHub - 1 maintainer
synergy-dataset 1.0.3
Python package for the SYNERGY dataset12 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 5.35 thousand downloads last month - 54 stars on GitHub - 1 maintainer
zengin-code 1.1.0.20240415 π°
bank codes and branch codes for Japanese.161 versions - Latest release: about 1 month ago - 1 dependent repositories - 5.34 thousand downloads last month - 13 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
31 versions - Latest release: about 1 year ago - 2 dependent packages - 5 dependent repositories - 4.58 thousand downloads last month - 10,900 stars on GitHub - 1 maintainer
pix2tex 0.1.2
pix2tex: Using a ViT to convert images of equations into LaTeX code.31 versions - Latest release: about 1 year ago - 2 dependent packages - 5 dependent repositories - 4.58 thousand downloads last month - 10,900 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
27 versions - Latest release: 10 months ago - 4 dependent packages - 8 dependent repositories - 4.44 thousand downloads last month - 1 maintainer
sapien 2.2.2
['SAPIEN: A SimulAted Parted based Interactive ENvironment']27 versions - Latest release: 10 months ago - 4 dependent packages - 8 dependent repositories - 4.44 thousand downloads last month - 1 maintainer
Top 6.6% on pypi.org
3 versions - Latest release: 2 months ago - 7 dependent repositories - 4.28 thousand downloads last month - 3,924 stars on GitHub - 2 maintainers
flwr-datasets 0.1.0
Flower Datasets3 versions - Latest release: 2 months ago - 7 dependent repositories - 4.28 thousand downloads last month - 3,924 stars on GitHub - 2 maintainers
Top 8.5% on pypi.org
131 versions - Latest release: 13 days ago - 1 dependent package - 3 dependent repositories - 4.21 thousand downloads last month - 20 stars on GitHub - 1 maintainer
segments-ai 1.8.1
Segments.ai Python SDK131 versions - Latest release: 13 days ago - 1 dependent package - 3 dependent repositories - 4.21 thousand downloads last month - 20 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
332 versions - Latest release: 21 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
fastdup 1.123
Fast tool for gaining insights from large image repositories.332 versions - Latest release: 21 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
recognizer 1.4
π¦Gracefully face reCAPTCHA challenge with ultralytics YOLOv8-seg, CLIPs VIT-B/16 and CLIP-Seg/RD6...4 versions - Latest release: about 2 months ago - 1 dependent package - 3.36 thousand downloads last month - 61 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
20 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 3.04 thousand downloads last month - 22 stars on GitHub - 1 maintainer
path-dict 4.0.0
Extends Python's dict with useful extras20 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 3.04 thousand downloads last month - 22 stars on GitHub - 1 maintainer
lakeapi 0.14.0
API for accessing Lake crypto market data28 versions - Latest release: 25 days ago - 1 dependent repositories - 2.97 thousand downloads last month - 19 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
64 versions - Latest release: 10 months ago - 15 dependent repositories - 2.93 thousand downloads last month - 512 stars on GitHub - 2 maintainers
convokit 3.0.0
ConvoKit64 versions - Latest release: 10 months ago - 15 dependent repositories - 2.93 thousand downloads last month - 512 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
15 versions - Latest release: 12 months ago - 1 dependent package - 3 dependent repositories - 2.93 thousand downloads last month - 77 stars on GitHub - 1 maintainer
doccano-client 1.2.8
A simple client for doccano API.15 versions - Latest release: 12 months ago - 1 dependent package - 3 dependent repositories - 2.93 thousand downloads last month - 77 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
57 versions - Latest release: 6 months ago - 1 dependent package - 3 dependent repositories - 2.71 thousand downloads last month - 297 stars on GitHub - 1 maintainer
pylabel 0.1.55 π°
Transform, analyze, and visualize computer vision annotations.57 versions - Latest release: 6 months ago - 1 dependent package - 3 dependent repositories - 2.71 thousand downloads last month - 297 stars on GitHub - 1 maintainer
sgs 2.1.1
Python wrapper para o webservice do SGS - Sistema Gerenciador de Series Temporais do Banco Centra...30 versions - Latest release: over 2 years ago - 1 dependent repositories - 2.67 thousand downloads last month - 72 stars on GitHub - 1 maintainer
chrome-fingerprints 1.1
A Collection of 10.000 self-collected Chrome Fingerprints. Wrapped in a easy-to-use API, availabl...2 versions - Latest release: 6 months ago - 1 dependent package - 2.64 thousand downloads last month - 69 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
23 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 2.59 thousand downloads last month - 139 stars on GitHub - 1 maintainer
mtdata 0.4.1
mtdata is a tool to download datasets for machine translation23 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 2.59 thousand downloads last month - 139 stars on GitHub - 1 maintainer
cc-net 1.0.0
Tools to download and clean Common Crawl2 versions - Latest release: over 3 years ago - 1 dependent repositories - 2.53 thousand downloads last month - 866 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
38 versions - Latest release: 11 days ago - 1 dependent package - 4 dependent repositories - 2.39 thousand downloads last month - 20 stars on GitHub - 1 maintainer
audb 1.7.2
Load and publish databases in audformat38 versions - Latest release: 11 days ago - 1 dependent package - 4 dependent repositories - 2.39 thousand downloads last month - 20 stars on GitHub - 1 maintainer
ocf-datapipes 3.3.24 π°
Pytorch Datapipes built for use in Open Climate Fix's forecasting work245 versions - Latest release: 17 days ago - 3 dependent packages - 2.39 thousand downloads last month - 10 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
10 versions - Latest release: 3 months ago - 15 dependent repositories - 1.89 thousand downloads last month - 9 stars on GitHub - 2 maintainers
cfpq-data 4.0.3
Python package containing Graphs and Grammars for experimental analysis of Context-Free Path Quer...10 versions - Latest release: 3 months ago - 15 dependent repositories - 1.89 thousand downloads last month - 9 stars on GitHub - 2 maintainers
Top 3.8% on pypi.org
27 versions - Latest release: 8 months ago - 3 dependent packages - 21 dependent repositories - 1.74 thousand downloads last month - 523 stars on GitHub - 1 maintainer
cryptocmd 0.6.4 π°
Cryptocurrency historical market price data scrapper.27 versions - Latest release: 8 months ago - 3 dependent packages - 21 dependent repositories - 1.74 thousand downloads last month - 523 stars on GitHub - 1 maintainer
emnist 0.0
Extended MNIST - Python Package1 version - Latest release: about 5 years ago - 22 dependent repositories - 1.7 thousand downloads last month - 6 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
5 versions - Latest release: over 2 years ago - 1 dependent package - 15 dependent repositories - 1.67 thousand downloads last month - 626 stars on GitHub - 1 maintainer
tape-proteins 0.5
Repostory of Protein Benchmarking and Modeling5 versions - Latest release: over 2 years ago - 1 dependent package - 15 dependent repositories - 1.67 thousand downloads last month - 626 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
46 versions - Latest release: 7 months ago - 3 dependent packages - 5 dependent repositories - 1.67 thousand downloads last month - 344 stars on GitHub - 3 maintainers
mirdata 0.3.8
Common loaders for MIR datasets.46 versions - Latest release: 7 months ago - 3 dependent packages - 5 dependent repositories - 1.67 thousand downloads last month - 344 stars on GitHub - 3 maintainers
Top 3.5% on pypi.org
10 versions - Latest release: almost 2 years ago - 1 dependent package - 17 dependent repositories - 1.47 thousand downloads last month - 3,063 stars on GitHub - 1 maintainer
trdg 1.8.0 π°
TextRecognitionDataGenerator: A synthetic data generator for text recognition10 versions - Latest release: almost 2 years ago - 1 dependent package - 17 dependent repositories - 1.47 thousand downloads last month - 3,063 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
35 versions - Latest release: 18 days ago - 1 dependent repositories - 1.46 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
cvat-cli 2.13.0
Command-line client for CVAT35 versions - Latest release: 18 days ago - 1 dependent repositories - 1.46 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
Top 8.8% on pypi.org
9 versions - Latest release: almost 3 years ago - 3 dependent repositories - 1.44 thousand downloads last month - 39 stars on GitHub - 1 maintainer
wooldridge 0.4.4
Data sets from Introductory Econometrics: A Modern Approach (6th ed, J.M. Wooldridge)9 versions - Latest release: almost 3 years ago - 3 dependent repositories - 1.44 thousand downloads last month - 39 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
9 versions - Latest release: over 1 year ago - 3 dependent packages - 7 dependent repositories - 1.3 thousand downloads last month - 9 stars on GitHub - 4 maintainers
frictionless-ckan-mapper 1.0.9
A library for mapping CKAN metadata <=> Frictionless metadata.9 versions - Latest release: over 1 year ago - 3 dependent packages - 7 dependent repositories - 1.3 thousand downloads last month - 9 stars on GitHub - 4 maintainers
open-mastr 0.14.3
A package that provides an interface for downloading and processing the data of the Marktstammdat...14 versions - Latest release: about 1 month ago - 1.27 thousand downloads last month - 65 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
29 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 1.23 thousand downloads last month - 690 stars on GitHub - 4 maintainers
tensorflow-io-plugin-gs-nightly 0.18.0.dev20210513213318
TensorFlow IO29 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 1.23 thousand downloads last month - 690 stars on GitHub - 4 maintainers
moviechat 0.6.3
Long video understanding10 versions - Latest release: about 1 month ago - 1.13 thousand downloads last month - 408 stars on GitHub - 1 maintainer
globox 2.4.5
Globox is a package and command line interface to read and convert object detection databases (CO...20 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.12 thousand downloads last month - 149 stars on GitHub - 1 maintainer
video2dataset 1.3.0
Easily create large video dataset from video urls4 versions - Latest release: 4 months ago - 1.04 thousand downloads last month - 449 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
16 versions - Latest release: 4 months ago - 2 dependent repositories - 1.02 thousand downloads last month - 270 stars on GitHub - 2 maintainers
soundata 0.1.3
Python library for loading and working with sound datasets.16 versions - Latest release: 4 months ago - 2 dependent repositories - 1.02 thousand downloads last month - 270 stars on GitHub - 2 maintainers
waymo-open-dataset-tf-2-12-0 1.6.4
Waymo Open Dataset3 versions - Latest release: about 2 months ago - 1.01 thousand downloads last month - 2,551 stars on GitHub - 1 maintainer
genomic-benchmarks 0.0.9
Genomic Benchmarks8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 949 downloads last month - 87 stars on GitHub - 3 maintainers
Top 9.6% on pypi.org
32 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 946 downloads last month - 48 stars on GitHub - 1 maintainer
crowsetta 5.0.2
A Python tool to work with any format for annotating animal vocalizations and bioacoustics data32 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 946 downloads last month - 48 stars on GitHub - 1 maintainer
cc2dataset 1.5.0
Easily convert common crawl to image caption set using pyspark3 versions - Latest release: 11 months ago - 884 downloads last month - 292 stars on GitHub - 1 maintainer
datasetrising 1.0.4
Toolchain for creating and training Stable Diffusion models with custom datasets86 versions - Latest release: 6 months ago - 802 downloads last month - 11 stars on GitHub - 1 maintainer
torchvideo 0.0.1
PyTorch video dataset library2 versions - Latest release: about 3 years ago - 1 dependent repositories - 795 downloads last month - 82 stars on GitHub - 1 maintainer
aroma 0.0.0a7
A library to prepare asynchronous time series datasets3 versions - Latest release: about 1 year ago - 1 dependent repositories - 753 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
74 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 742 downloads last month - 75 stars on GitHub - 1 maintainer
tensorbay 1.24.2
Graviti TensorBay Python SDK74 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 742 downloads last month - 75 stars on GitHub - 1 maintainer
extra-keras-datasets 1.2.0 π°
Extending the Keras Datasets module with extra ones.13 versions - Latest release: over 3 years ago - 3 dependent repositories - 737 downloads last month - 31 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
21 versions - Latest release: over 4 years ago - 2 dependent packages - 17 dependent repositories - 715 downloads last month - 98 stars on GitHub - 1 maintainer
pytreebank 0.2.7
Python package for loading Stanford Sentiment Treebank corpus21 versions - Latest release: over 4 years ago - 2 dependent packages - 17 dependent repositories - 715 downloads last month - 98 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
2 versions - Latest release: over 7 years ago - 32 dependent repositories - 708 downloads last month - 867 stars on GitHub - 1 maintainer
fuel 0.2.0
Data pipeline framework for machine learning2 versions - Latest release: over 7 years ago - 32 dependent repositories - 708 downloads last month - 867 stars on GitHub - 1 maintainer
cross 1.0.3
Tool to cross CSV/TSV datasets4 versions - Latest release: over 7 years ago - 5 dependent repositories - 700 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
51 versions - Latest release: over 1 year ago - 11 dependent repositories - 693 downloads last month - 400 stars on GitHub - 2 maintainers
continuum 1.2.7
A clean and simple library for Continual Learning in PyTorch.51 versions - Latest release: over 1 year ago - 11 dependent repositories - 693 downloads last month - 400 stars on GitHub - 2 maintainers
datamaestro-text 2024.3.10
Datamaestro module for text-related datasets65 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 684 downloads last month - 3 stars on GitHub - 1 maintainer
starwhale-bootstrap 0.2.2b6
MLOps Platform65 versions - Latest release: almost 2 years ago - 1 dependent repositories - 672 downloads last month - 187 stars on GitHub - 1 maintainer
nlprep 0.2.1
Download and pre-processing data for nlp tasks70 versions - Latest release: almost 3 years ago - 1 dependent repositories - 635 downloads last month - 28 stars on GitHub - 1 maintainer
Related Keywords
python
167
machine-learning
109
deep-learning
96
data
86
datasets
66
pytorch
65
learning
50
tensorflow
44
data-science
42
machine
40
nlp
39
computer-vision
33
python3
32
natural-language-processing
30
machine learning
28
ai
25
csv
20
image
20
object-detection
18
annotation
17
pandas
17
llm
16
classification
16
database
15
cli
15
dataset-generation
15
json
15
data-analysis
15
benchmark
15
models
13
preprocessing
13
images
13
segmentation
13
driving
12
autonomous
12
deep
12
ml
11
numpy
11
pypi
11
library
11
text-classification
11
dataloader
11
metadata
11
huggingface
11
download
11
training
10
visualization
10
api
10
package
10
mlops
10
image-classification
10
deep learning
10
image-processing
10
data-mining
10
corpus
10
test
10
NLP
9
neural-networks
9
coco
9
python-package
9
large-language-models
9
annotation-tool
9
annotations
9
analytics
9
labeling-tool
9
science
9
labeling
9
bert
8
torch
8
computer vision
8
natural language processing
8
yolo
8
audio
8
generator
8
testing
8
streaming
8
jax
8
federated-learning
8
bioinformatics
8
dataset-manager
8
hacktoberfest
7
downloader
7
scraper
7
fake
7
data-labeling
7
datascience
7
ocr
7
keras
7
sentiment-analysis
7
statistics
7
multimodal
7
semantic-segmentation
7
scraping
7
clustering
7
image-dataset
6
big-data
6
imagenet
6
artificial-intelligence
6
pascal-voc
6
conversion
6