Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "datasets" keyword

Top 3.6% on pypi.org
torchgeo 0.5.2
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
11 versions - Latest release: 2 months ago - 3 dependent packages - 4 dependent repositories - 7.03 thousand downloads last month - 2,232 stars on GitHub - 6 maintainers
sinkove 0.0.2
Library for interacting with Sinkove datasets
1 version - Latest release: 10 months ago - 15 downloads last month - 0 stars on GitHub - 2 maintainers
subsetsio 0.5.4
Easily access the Subsets data warehouse using Python.
11 versions - Latest release: 4 months ago - 62 downloads last month - 1 stars on GitHub - 2 maintainers
saf-datasets 0.6.1
Data set loading and annotation facilities for the Simple Annotation Framework
2 versions - Latest release: 1 day ago - 210 downloads last month - 0 stars on GitHub - 1 maintainer
arekit-ss 0.24.0
Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuni...
2 versions - Latest release: 6 months ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
doccano 1.8.4 ๐Ÿ’ฐ
doccano, text annotation tool for machine learning practitioners
31 versions - Latest release: 10 months ago - 7 dependent repositories - 10.4 thousand downloads last month - 8,436 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
datasets 2.19.1
HuggingFace community-driven open-source library of datasets
83 versions - Latest release: 4 days ago - 650 dependent packages - 14,962 dependent repositories - 9.46 million downloads last month - 18,474 stars on GitHub - 4 maintainers
Top 2.3% on pypi.org
hub 3.0.1
Activeloop Deep Lake
132 versions - Latest release: over 1 year ago - 2 dependent packages - 129 dependent repositories - 4.38 thousand downloads last month - 7,736 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
deeplake 3.9.4
Activeloop Deep Lake
140 versions - Latest release: 2 days ago - 27 dependent packages - 1,384 dependent repositories - 52.1 thousand downloads last month - 7,736 stars on GitHub - 5 maintainers
fdatasets 1.12.1 removed
HuggingFace/Datasets is an open library of NLP datasets.
1 version - Latest release: about 2 years ago - 14,671 stars on GitHub
hub-redirect 3.0.3
Activeloop Deep Lake
5 versions - Latest release: over 1 year ago - 1 dependent package - 28 downloads last month - 7,725 stars on GitHub - 2 maintainers
tarzan 0.1.0
high-level IO for tar based dataset
1 version - Latest release: 3 months ago - 15 downloads last month - 1,978 stars on GitHub - 2 maintainers
tabdoc 1.0.4
tabular datasets to excel,word,pdf
13 versions - Latest release: 11 months ago - 1 dependent repositories - 91 downloads last month - 1 stars on GitHub - 2 maintainers
Top 2.7% on pypi.org
rlds 0.1.8
A Python library for Reinforcement Learning Datasets.
9 versions - Latest release: about 1 year ago - 2 dependent packages - 79 dependent repositories - 8.52 thousand downloads last month - 216 stars on GitHub - 4 maintainers
redlite 0.1.3
LLM testing on steroids
56 versions - Latest release: 9 days ago - 428 downloads last month - 0 stars on GitHub - 1 maintainer
ucimlr 0.3.0
Easy access to datasets from the UCI Machine Learning Repository
9 versions - Latest release: about 4 years ago - 1 dependent repositories - 118 downloads last month - 5 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
moabb 1.0.0
Mother of All BCI Benchmarks
10 versions - Latest release: 7 months ago - 2 dependent packages - 24 dependent repositories - 5.94 thousand downloads last month - 624 stars on GitHub - 2 maintainers
visual-graph-datasets 0.15.5
Datasets for the training of graph neural networks (GNNs) and subsequent visualization of attribu...
30 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 421 downloads last month - 0 stars on GitHub - 2 maintainers
Top 10.0% on pypi.org
simulate 0.1.2
HuggingFace community-driven open-source library of simulation environments
7 versions - Latest release: over 1 year ago - 3 dependent repositories - 123 downloads last month - 186 stars on GitHub - 4 maintainers
open-mastr 0.14.3
A package that provides an interface for downloading and processing the data of the Marktstammdat...
14 versions - Latest release: 16 days ago - 1.84 thousand downloads last month - 65 stars on GitHub - 2 maintainers
wildlife-datasets 1.0.3
Library for easier access and research of wildlife re-identification datasets
49 versions - Latest release: about 18 hours ago - 482 downloads last month - 36 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
colour-science 0.4.4 ๐Ÿ’ฐ
Colour Science for Python
23 versions - Latest release: 5 months ago - 21 dependent packages - 94 dependent repositories - 97.1 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
colour-datasets 0.2.5 ๐Ÿ’ฐ
Colour science datasets for use with Colour
8 versions - Latest release: 5 months ago - 1 dependent package - 4 dependent repositories - 136 downloads last month - 53 stars on GitHub - 2 maintainers
Top 3.7% on pypi.org
climetlab 0.22.2
Handling of climate/meteorological dataa.
175 versions - Latest release: 3 days ago - 5 dependent packages - 19 dependent repositories - 7.63 thousand downloads last month - 351 stars on GitHub - 2 maintainers
report-manager 0.8.0
Manage your reports
6 versions - Latest release: about 1 year ago - 1 dependent package - 35 downloads last month - 2 stars on GitHub - 2 maintainers
survival-datasets 0.1.5
Data loader for common datasets in Survival Analysis.
6 versions - Latest release: 10 months ago - 66 downloads last month - 0 stars on GitHub - 2 maintainers
tinysets 0.0.3
Collection of different datasets
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 24 downloads last month - 6 stars on GitHub - 2 maintainers
safe-ds-datasets 0.18.0
Ready-to-use datasets for the Safe-DS Python library.
1 version - Latest release: 13 days ago - 187 downloads last month - 2 stars on GitHub - 2 maintainers
schemarrow 0.1.1a0 ๐Ÿ’ฐ
A library for switching pandas backend to pyarrow
2 versions - Latest release: 2 months ago - 19 downloads last month - 2 stars on GitHub - 2 maintainers
ast-monitor 0.5.0
AST-Monitor is a wearable Raspberry Pi computer for cyclists
17 versions - Latest release: about 2 months ago - 1 dependent repositories - 129 downloads last month - 6 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...
29 versions - Latest release: 2 days ago - 8 dependent packages - 19 dependent repositories - 21.7 thousand downloads last month - 8,710 stars on GitHub - 5 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...
7 versions - Latest release: 2 months ago - 50 downloads last month - 8,694 stars on GitHub - 1 maintainer
tsgm 0.0.5
Time Series Generative Modelling Framework
5 versions - Latest release: about 2 months ago - 395 downloads last month - 94 stars on GitHub - 2 maintainers
bookrest 0.1.4 ๐Ÿ’ฐ
The easiest way to add a Django and DRF powered API to any project
6 versions - Latest release: about 6 years ago - 1 dependent repositories - 33 downloads last month - 8,956 stars on GitHub - 2 maintainers
datasette-core 0.22.1 ๐Ÿ’ฐ
An instant JSON API for your SQLite databases
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 28 downloads last month - 8,956 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
datasette 0.64.6 ๐Ÿ’ฐ
An open source multi-tool for exploring and publishing data
147 versions - Latest release: 5 months ago - 104 dependent packages - 285 dependent repositories - 45.1 thousand downloads last month - 8,673 stars on GitHub - 1 maintainer
smartdoc15-ch1 0.8
A Python wrapper for the "computable" version of the SmartDoc 2015 - Challenge 1 dataset.
4 versions - Latest release: almost 6 years ago - 1 dependent repositories - 43 downloads last month - 6 stars on GitHub - 2 maintainers
linora 1.6.0
Simple and efficient tools for data mining and data analysis.
43 versions - Latest release: about 1 year ago - 3 dependent repositories - 348 downloads last month - 12 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
vision-datasets 1.0.12
A utility repo for vision dataset access and management.
52 versions - Latest release: 3 months ago - 27 dependent repositories - 13.7 thousand downloads last month - 17 stars on GitHub - 2 maintainers
Top 9.5% on pypi.org
squirrel-core 0.19.8
Squirrel is a Python library that enables ML teams to share, load, and transform data in a collab...
107 versions - Latest release: 4 months ago - 2 dependent repositories - 867 downloads last month - 279 stars on GitHub - 2 maintainers
jiant 2.2.0
State-of-the-art Natural Language Processing toolkit for multi-task and transfer learning built o...
6 versions - Latest release: almost 3 years ago - 1 dependent repositories - 92 downloads last month - 1,608 stars on GitHub - 2 maintainers
Top 9.0% on pypi.org
gopup 0.3.8
GoPUP database
38 versions - Latest release: over 1 year ago - 1 dependent repositories - 414 downloads last month - 2,524 stars on GitHub - 1 maintainer
detection_datasets 0.3.8
Easily load and transform datasets for object detection
14 versions - Latest release: 5 months ago - 138 downloads last month - 7 stars on GitHub - 2 maintainers
ml-git 2.9.9
ML-Git: version control for ML artefacts
11 versions - Latest release: 7 months ago - 73 downloads last month - 32 stars on GitHub - 1 maintainer
protaska-gpt 0.0.12 ๐Ÿ’ฐ
Unleash the Potential of Datasets with Intelligent Tasks, Tutorials, and Algorithm Recommendations.
13 versions - Latest release: 11 months ago - 99 downloads last month - 2 stars on GitHub - 2 maintainers
anemoi-datasets 0.1.7
A package to hold various functions to support training of ML models on ECMWF data.
6 versions - Latest release: 3 days ago - 333 downloads last month - 1 stars on GitHub - 2 maintainers
arekit 0.24.0
Library devoted to Document level Attitude and Relation Extraction for text objects with entity-l...
6 versions - Latest release: 6 months ago - 1 dependent package - 71 downloads last month - 52 stars on GitHub - 1 maintainer
mlgeo 0.0.3
Repository for Machine Learning in Geotechnics
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 36 downloads last month - 10 stars on GitHub - 2 maintainers
torch-conduit 0.4.2
Lightweight framework for dataloading with PyTorch and channeling the power of PyTorch Lightning
18 versions - Latest release: 2 months ago - 2 dependent repositories - 169 downloads last month - 11 stars on GitHub - 3 maintainers
datasetsevaluator 0.0.5
A tool to automate collecting and testing against datasets on openml.org
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 38 downloads last month - 2 maintainers
Top 1.6% on pypi.org
label-studio 1.12.0
Label Studio annotation tool
184 versions - Latest release: 21 days ago - 1 dependent package - 39 dependent repositories - 38.4 thousand downloads last month - 15,269 stars on GitHub - 1 maintainer
tensorclus 0.0.2
TensorClus is a Python package for clustering of three-way tensor data
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 38 downloads last month - 12 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
cihai 0.33.0
Library for CJK (chinese, japanese, korean) language data.
62 versions - Latest release: about 1 month ago - 1 dependent package - 7 dependent repositories - 593 downloads last month - 76 stars on GitHub - 1 maintainer
iiif_downloader 0.0.8
Download images from IIIF servers
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 81 downloads last month - 14 stars on GitHub - 2 maintainers
adept-augmentations 0.1.1
A Python library aimed at adeptly, augmenting NLP training data.
3 versions - Latest release: about 1 year ago - 52 downloads last month - 54 stars on GitHub - 4 maintainers
cesnet-datazoo 0.1.6
A toolkit for large network traffic datasets
24 versions - Latest release: 4 days ago - 500 downloads last month - 6 stars on GitHub - 2 maintainers
dareblopy 0.0.5
dareblopy
5 versions - Latest release: over 3 years ago - 2 dependent repositories - 380 downloads last month - 102 stars on GitHub - 1 maintainer
datahammer 1.0.3
This module provides an easy way to manipulate and inspect lists of data. It was designed to han...
2 versions - Latest release: 9 months ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitLab.com - 2 maintainers
columnq-cli 0.5.2
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 11 days ago - 1 dependent repositories - 533 downloads last month - 3,089 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
roapi 0.11.3
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
9 versions - Latest release: 11 days ago - 1 dependent repositories - 1.71 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
roapi-http 0.6.0
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
17 versions - Latest release: about 2 years ago - 1 dependent repositories - 427 downloads last month - 3,089 stars on GitHub - 1 maintainer
dips-plus 1.1.0
The Enhanced Database of Interacting Protein Structures for Interface Prediction
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 30 downloads last month - 41 stars on GitHub - 2 maintainers
gonk-ai 0.1.4
Generic backend for dataset annotation.
4 versions - Latest release: 7 months ago - 36 downloads last month - 5 stars on GitHub - 2 maintainers
roboflow2huggingface 0.0.22
Convert Roboflow datasets into HuggingFace datasets format and upload to HuggingFace Hub.
20 versions - Latest release: about 1 year ago - 177 downloads last month - 1 maintainer
boxs 0.1
Automatically track data and artifacts
1 version - Latest release: over 2 years ago - 1 dependent repositories - 29 downloads last month - 0 stars on GitLab.com - 2 maintainers
ddf-utils 1.0.14
Commonly used functions/utilities for DDF file model.
39 versions - Latest release: over 2 years ago - 1 dependent repositories - 321 downloads last month - 2 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
datasetsforecast 0.0.8
Datasets for Time series forecasting
7 versions - Latest release: about 1 year ago - 4 dependent packages - 9 dependent repositories - 39.4 thousand downloads last month - 44 stars on GitHub - 6 maintainers
Top 9.4% on pypi.org
jmd-imagescraper 1.0.2
Image scraper for DuckDuckGo for creating deep learning datasets
5 versions - Latest release: over 3 years ago - 4 dependent repositories - 456 downloads last month - 31 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
nlp 0.4.0
HuggingFace/NLP is an open library of NLP datasets.
8 versions - Latest release: almost 4 years ago - 3 dependent packages - 104 dependent repositories - 7.9 thousand downloads last month - 18,441 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
minari 0.4.3 ๐Ÿ’ฐ
A standard format for offline reinforcement learning datasets, with popular reference datasets an...
10 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 1.08 thousand downloads last month - 218 stars on GitHub - 4 maintainers
rdatasets 0.2.8
provides over 2264 datasets as pandas dataframe from various R packages
10 versions - Latest release: 2 months ago - 2 dependent repositories - 282 downloads last month - 6 stars on GitHub - 2 maintainers
thermostat-datasets 1.1.0
Collection of NLP model explanations and accompanying analysis tools
5 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 380 downloads last month - 140 stars on GitHub - 4 maintainers
csdmpy 0.6.0
A python module for the core scientific dataset model.
20 versions - Latest release: 6 months ago - 6 dependent repositories - 1.31 thousand downloads last month - 15 stars on GitHub - 2 maintainers
thetangle 0.6.1
A minimalistic utility to explore The Tangle
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 2 maintainers
datasets-server-python 0.5.0
Python SDK to access the Datasets Server
5 versions - Latest release: about 1 year ago - 8 downloads last month - 605 stars on GitHub - 2 maintainers
rnutil 0.4.4
Utility functions for the Neural Networks course for the University of Buenos Aires (UBA)
6 versions - Latest release: about 2 years ago - 1 dependent repositories - 40 downloads last month - 3 stars on GitHub - 1 maintainer
redes-neuronales-util 0.3.3
Utility functions for the Neural Networks course for the University of Buenos Aires (UBA)
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 28 downloads last month - 3 stars on GitHub - 2 maintainers
zooper-premium-handbags-dataset 0.1.4
This package contains premium_handbags datasets library
1 version - Latest release: over 4 years ago - 1 dependent repositories - 4 downloads last month - 2 maintainers
text2sql 0.1.1
Convert natural language questions to SQL query
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 52 downloads last month - 66 stars on GitHub - 1 maintainer
china-datasets 1.0.0.2
china-datasets ๆ˜ฏไธ€ไธชๅฟซ้€Ÿไธ‹่ฝฝไธญๆ–‡ๆ•ฐๆฎ้›†๏ผŒๅค„็†ๆ•ฐๆฎ้›†๏ผŒ็ฒพ็›Šๅปบๆจก็š„ๅŒ…ใ€‚
3 versions - Latest release: over 1 year ago - 27 downloads last month - 67 stars on GitHub - 2 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
33 versions - Latest release: 5 months ago - 96 dependent packages - 3,946 dependent repositories - 3.86 million downloads last month - 4,085 stars on GitHub - 8 maintainers
prospr 1.2.1
A toolbox for protein folding with Python.
59 versions - Latest release: 4 months ago - 1 dependent repositories - 1.5 thousand downloads last month - 18 stars on GitHub - 2 maintainers
data-mine 0.0.11
DataMine is a collection of datasets ready to be used for machine learning applications and not o...
11 versions - Latest release: almost 4 years ago - 1 dependent repositories - 43 downloads last month - 9 stars on GitHub - 2 maintainers
kaze 0.0.15
CLI for Managing The Data Dependency of Deep Learning Projects
15 versions - Latest release: about 2 years ago - 1 dependent repositories - 113 downloads last month - 2 maintainers
pytorch-mcrf 0.0.3
Multiple CRF implementation for PyTorch
3 versions - Latest release: almost 3 years ago - 2 dependent repositories - 104 downloads last month - 843 stars on GitHub - 2 maintainers
yellowbrick-datasets 1.0
Yellowbrick datasets management and deployment scripts.
1 version - Latest release: over 5 years ago - 2 dependent repositories - 19 downloads last month - 4 stars on GitHub - 2 maintainers
cvutil3d 0.0.1
Set of auxiliary scripts for 3D computer vision tasks
1 version - Latest release: 6 days ago - 0 stars on GitHub - 2 maintainers
hugdatafast 1.0.0
The elegant bridge between hugginface data and fastai
7 versions - Latest release: over 3 years ago - 4 dependent repositories - 103 downloads last month - 19 stars on GitHub - 2 maintainers
rs-datasets 0.5.1
Tool for autodownloading recommendation systems datasets
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 276 downloads last month - 30 stars on GitHub - 1 maintainer
greekroom 0.0.1
The Greek Room will be a suite of tools supporting Biblical natural language processing.
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 5 downloads last month - 2 maintainers
intake-duckdb 0.1.2
DuckDB plugin for Intake
3 versions - Latest release: about 1 year ago - 49 downloads last month - 1 stars on GitHub - 2 maintainers
cognitive-service-vision-model-customization-python-samples 0.0.6
A sample code repo for model customization using Python for Cognitive Service for Vision.
6 versions - Latest release: 28 days ago - 718 downloads last month - 66 stars on GitHub - 2 maintainers
hf-datasets 0.3.0
HuggingFace/NLP is an open library of NLP datasets.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 31 downloads last month - 18,441 stars on GitHub - 2 maintainers
almirah 0.3.0
a wardrobe for datasets
3 versions - Latest release: 17 days ago - 201 downloads last month - 0 stars on GitHub - 2 maintainers
Top 4.9% on pypi.org
retriever 3.1.0
Data Retriever
12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.68 thousand downloads last month - 302 stars on GitHub - 8 maintainers
cvutil 0.0.3
Set of auxiliary scripts for computer vision tasks
1 version - Latest release: 23 days ago - 179 downloads last month - 0 stars on GitHub - 1 maintainer
unithon 0.0.9
unithon - Python library to unify datasets
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 2 maintainers
graph-datasets 0.13.1
Load graph datasets.
24 versions - Latest release: 2 months ago - 144 downloads last month - 3 stars on GitHub - 2 maintainers
anemoi-dataset 0.1.5
A package to hold various functions to support training of ML models on ECMWF data.
1 version - Latest release: 7 days ago - 1 stars on GitHub - 2 maintainers
hackernews500kindex 0.1.0
This package contains HackerNews 500K titles indexed with universal sentence encoder.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 10 downloads last month - 1 maintainer