Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataset" keyword

egegrouper 0.7.1
Tool for grouping EGEG examinations
11 versions - Latest release: about 6 years ago - 1 dependent repositories - 112 downloads last month - 2 maintainers
Top 7.7% on pypi.org
cihai 0.33.0
Library for CJK (chinese, japanese, korean) language data.
62 versions - Latest release: about 1 month ago - 1 dependent package - 7 dependent repositories - 593 downloads last month - 76 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
segments-ai 1.7.6
Segments.ai Python SDK
129 versions - Latest release: 14 days ago - 1 dependent package - 3 dependent repositories - 4.05 thousand downloads last month - 20 stars on GitHub - 1 maintainer
daget 0.5
Download dataset via DOI or landing page url
5 versions - Latest release: 7 months ago - 62 downloads last month - 1 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
pix2tex 0.1.2
pix2tex: Using a ViT to convert images of equations into LaTeX code.
31 versions - Latest release: 12 months ago - 1 dependent package - 5 dependent repositories - 4.58 thousand downloads last month - 10,900 stars on GitHub - 2 maintainers
openfisca-uk-data 0.9.0 πŸ’°
A Python package to manage OpenFisca-UK-compatible microdata
20 versions - Latest release: about 2 years ago - 1 dependent repositories - 178 downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.0% on pypi.org
convokit 3.0.0
ConvoKit
64 versions - Latest release: 9 months ago - 15 dependent repositories - 2.93 thousand downloads last month - 512 stars on GitHub - 4 maintainers
adept-augmentations 0.1.1
A Python library aimed at adeptly, augmenting NLP training data.
3 versions - Latest release: about 1 year ago - 52 downloads last month - 54 stars on GitHub - 4 maintainers
Top 5.8% on pypi.org
pylabel 0.1.55 πŸ’°
Transform, analyze, and visualize computer vision annotations.
57 versions - Latest release: 5 months ago - 1 dependent package - 3 dependent repositories - 2.13 thousand downloads last month - 297 stars on GitHub - 1 maintainer
aws-json-dataset 0.1.0
Send JSON datasets to various AWS services.
1 version - Latest release: 3 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
visuallayer 0.0.15
Open, Clean Datasets for Computer Vision.
5 versions - Latest release: 11 months ago - 35 downloads last month - 65 stars on GitHub - 2 maintainers
qtdatasetviewer 0.0.4
Torch dataset explorer
3 versions - Latest release: about 1 year ago - 25 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
crowsetta 5.0.2
A Python tool to work with any format for annotating animal vocalizations and bioacoustics data
32 versions - Latest release: 3 months ago - 4 dependent packages - 5 dependent repositories - 432 downloads last month - 48 stars on GitHub - 2 maintainers
oapapersloader 1.0.1
Package for working with OAPapers dataset.
2 versions - Latest release: 11 days ago - 219 downloads last month - 0 stars on GitHub - 2 maintainers
Top 6.6% on pypi.org
scipp 24.2.0
Multi-dimensional data arrays with labeled dimensions
42 versions - Latest release: 3 months ago - 6 dependent packages - 3 dependent repositories - 6.64 thousand downloads last month - 106 stars on GitHub - 3 maintainers
bigquery-test-kit 0.5.0
BigQuery test kit
2 versions - Latest release: 3 months ago - 1 dependent repositories - 445 downloads last month - 50 stars on GitHub - 2 maintainers
Top 4.7% on pypi.org
mirdata 0.3.8
Common loaders for MIR datasets.
46 versions - Latest release: 6 months ago - 3 dependent packages - 5 dependent repositories - 1.67 thousand downloads last month - 344 stars on GitHub - 6 maintainers
dao-scripts 1.2.2
"A tool to download data to monitor DAO activity"
25 versions - Latest release: 5 months ago - 1 dependent package - 419 downloads last month - 0 stars on GitHub - 4 maintainers
solidago 0.1.1 πŸ’°
Algorithms for Secure Algorithmic Governance
9 versions - Latest release: 13 days ago - 115 downloads last month - 314 stars on GitHub - 4 maintainers
basicprop 0.5.2
A synthetic dataset used for generative models
7 versions - Latest release: over 7 years ago - 30 downloads last month - 2 maintainers
referit 1.0.0
Python wrapper to load the ReferIt Game dataset
1 version - Latest release: over 6 years ago - 1 dependent repositories - 32 downloads last month - 389 stars on GitHub - 2 maintainers
open-mastr 0.14.3
A package that provides an interface for downloading and processing the data of the Marktstammdat...
14 versions - Latest release: 12 days ago - 1.71 thousand downloads last month - 65 stars on GitHub - 2 maintainers
tsp 1.7.3
Making permafrost data effortless
29 versions - Latest release: 8 months ago - 12 dependent repositories - 335 downloads last month - 5 stars on GitLab.com - 1 maintainer
climateserv 1.0.5
This is a package to access the ClimateSERV API](https://climateserv.servirglobal.net/)
17 versions - Latest release: about 2 months ago - 1 dependent repositories - 202 downloads last month - 6 stars on GitHub - 1 maintainer
petbenchmarks 0.0.2
Benchmarking procedure to test Process Extraction from Text pproaches on the PET dataset
6 versions - Latest release: over 1 year ago - 1 dependent repositories - 30 downloads last month - 2 maintainers
download-oscar 2.1
Downloading all files of a language from the OSCAR (Open Super-large Crawled Aggregated coRpus)
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 41 downloads last month - 8 stars on GitHub - 1 maintainer
propertylistings 0.1.0
Webscraping tool for archiving sales records on RightMove.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 2 maintainers
ut-course-catalog 0.2.16 πŸ’°
Python package for fetching UTokyo Online Course Catalogue
19 versions - Latest release: about 1 month ago - 164 downloads last month - 0 stars on GitHub - 2 maintainers
fedscale 1.0
FedScale: Benchmarking Model and System Performance of Federated Learning
2 versions - Latest release: about 1 month ago - 123 downloads last month - 361 stars on GitHub - 2 maintainers
mhm 5.13.1
Python distribution of mHM with bindings.
3 versions - Latest release: 9 months ago - 161 downloads last month - 214 stars on GitHub - 4 maintainers
llm-dataset-converter 0.2.3
Python3 library for converting between various LLM dataset formats.
11 versions - Latest release: about 15 hours ago - 1 dependent repositories - 193 downloads last month - 4 stars on GitHub - 1 maintainer
ckanext-datarequests 1.1.0
CKAN Extension - Data Requests
11 versions - Latest release: over 5 years ago - 2 dependent repositories - 104 downloads last month - 17 stars on GitHub - 4 maintainers
gonk-ai 0.1.4
Generic backend for dataset annotation.
4 versions - Latest release: 7 months ago - 36 downloads last month - 5 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
waymo-open-dataset-tf-2-1-0 1.3.1
Waymo Open Dataset libraries.
4 versions - Latest release: almost 3 years ago - 1 dependent package - 97 dependent repositories - 287 downloads last month - 4 maintainers
dictabase 4.0.16
A database interface that mimics a python dictionary.
41 versions - Latest release: almost 4 years ago - 2 dependent repositories - 10 downloads last month - 5 stars on GitHub - 2 maintainers
xnrl 2022.9.29
xNRL - Read NRL files into xarray Datasets nested within pandas DataFrames
1 version - Latest release: over 1 year ago - 14 downloads last month - 4 stars on GitHub - 2 maintainers
wiker 0.0.1
Library for wikipedia dataset collection
1 version - Latest release: over 1 year ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
shuttum 1.1
A utility API to easily interact with the ShutTUM dataset
3 versions - Latest release: about 6 years ago - 1 dependent repositories - 14 downloads last month - 13 stars on GitHub - 2 maintainers
answer 0.1
Brazilian Agricultural Research Corporation (EMBRAPA) fully annotated dataset for plant diseases....
1 version - Latest release: about 5 years ago - 2 dependent repositories - 108 downloads last month - 40 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.
33 versions - Latest release: 12 days ago - 69 dependent packages - 2,976 dependent repositories - 638 thousand downloads last month - 3,445 stars on GitHub - 6 maintainers
ddf-utils 1.0.14
Commonly used functions/utilities for DDF file model.
39 versions - Latest release: over 2 years ago - 1 dependent repositories - 321 downloads last month - 2 stars on GitHub - 2 maintainers
nidsdata 0.0.3
NIDS Dataset
1 version - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 36 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
clip-retrieval 2.44.0
Easily computing clip embeddings and building a clip retrieval system with them
86 versions - Latest release: 4 months ago - 3 dependent repositories - 9.47 thousand downloads last month - 2,085 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
jmd-imagescraper 1.0.2
Image scraper for DuckDuckGo for creating deep learning datasets
5 versions - Latest release: over 3 years ago - 4 dependent repositories - 456 downloads last month - 31 stars on GitHub - 2 maintainers
myca 0.1.1
Personalized Text Classification dataset
2 versions - Latest release: over 1 year ago - 2 dependent repositories - 9 downloads last month - 0 stars on GitHub - 2 maintainers
nntools 0.1.0
Light library built to facilitate the training of neural network with Pytorch.
2 versions - Latest release: 10 months ago - 1 dependent repositories - 18 downloads last month - 2 stars on GitHub - 2 maintainers
classixclustering 1.2.5
Fast and explainable clustering based on sorting
117 versions - Latest release: 30 days ago - 1 dependent repositories - 464 downloads last month - 81 stars on GitHub - 1 maintainer
dwcontents 1.0.0b5
Jupyter contents manager for data.world
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 31 downloads last month - 1 stars on GitHub - 2 maintainers
waymo-open-dataset-tf-2-4-0 1.4.1
Waymo Open Dataset libraries.
3 versions - Latest release: over 2 years ago - 1 dependent package - 7 dependent repositories - 77 downloads last month - 6 maintainers
mahanlp 0.0.2
An NLP Library for Marathi Language
11 versions - Latest release: almost 2 years ago - 279 downloads last month - 87 stars on GitHub - 1 maintainer
demomarlib 0.0.3
An NLP Library for Marathi Language
3 versions - Latest release: over 1 year ago - 15 downloads last month - 87 stars on GitHub - 2 maintainers
randomlib 4.5
An NLP Library for Marathi Language
44 versions - Latest release: about 1 year ago - 117 downloads last month - 87 stars on GitHub - 1 maintainer
maha-nlp 0.0.4 removed
An NLP Library for Marathi
4 versions - Latest release: over 1 year ago - 19 stars on GitHub
bq-test-kit 0.4.3
BigQuery test kit
10 versions - Latest release: about 3 years ago - 1 dependent repositories - 473 downloads last month - 50 stars on GitHub - 2 maintainers
datasetsync 0.0.1
Download dataset
1 version - Latest release: over 1 year ago - 22 downloads last month - 2 maintainers
factor-table 0.2.3
data collection
3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 12 downloads last month - 1 maintainer
brec 1.0.0
pypi distribution for BREC
1 version - Latest release: 9 months ago - 15 downloads last month - 10 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
waymo-open-dataset-tf-2-6-0 1.4.9
Waymo Open Dataset libraries.
8 versions - Latest release: almost 2 years ago - 3 dependent packages - 12 dependent repositories - 628 downloads last month - 2 maintainers
pygta-data-collector 0.0.1
Collecting images for GTA sel-driving car project
1 version - Latest release: over 1 year ago - 9 downloads last month - 2 maintainers
chreader 0.2.1
An open-source Chinese NLP Dataset Reader library, built on allennlp & pytorch.
2 versions - Latest release: over 3 years ago - 29 downloads last month - 2 maintainers
stopes 1.0.1
Large-Scale Translation Data Mining.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 33 downloads last month - 230 stars on GitHub - 4 maintainers
Top 9.8% on pypi.org
synergy-dataset 1.0.3
Python package for the SYNERGY dataset
12 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 5.11 thousand downloads last month - 49 stars on GitHub - 2 maintainers
tfds-aihub 0.1.0
AIHub 데이터λ₯Ό TFRecord λ°©μ‹μœΌλ‘œ κ°€κ³΅ν•˜κΈ° μœ„ν•œ ν”„λ‘œμ νŠΈ
1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 0 stars on GitHub - 2 maintainers
simpledataqualityanalyzer 1.0.0.4
A python package that analyzes CSV files and generates a symple HTML report with some summary sta...
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitLab.com - 2 maintainers
geox 0.0.14
GeoX, Geostatic Dataset Integration Tool
14 versions - Latest release: over 1 year ago - 78 downloads last month - 2 stars on GitHub - 2 maintainers
convertme 0.1.4
Simple dataset convertor in Python
5 versions - Latest release: over 1 year ago - 18 downloads last month - 2 stars on GitHub - 2 maintainers
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 3 years ago - 49 dependent packages - 3,913 dependent repositories - 466 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
target-datadotworld 1.0.1
Singer target for data.world
5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 45 downloads last month - 3 stars on GitHub - 2 maintainers
augmenting 0.0.0
An image dataset augmentation package.
1 version - Latest release: about 1 year ago - 1 dependent repositories - 27 downloads last month - 2 maintainers
achan-test 0.0.1b10
Proto IDL for Data Catalog Service
10 versions - Latest release: over 4 years ago - 91 downloads last month - 54 stars on GitHub - 1 maintainer
pycites 0.1.4
Package to download and interact with the CITES Trade Database in Python
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 2 maintainers
Top 8.3% on pypi.org
waymo-open-dataset-tf-2-5-0 1.4.1
Waymo Open Dataset libraries.
2 versions - Latest release: over 2 years ago - 33 dependent repositories - 362 downloads last month - 4 maintainers
dapipe 0.2.1
Creates dataset builder objects
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 37 downloads last month - 8 stars on GitHub - 2 maintainers
odsclient 0.8.4
A nonofficial client for OpenDataSoft API.
13 versions - Latest release: over 2 years ago - 1 dependent repositories - 344 downloads last month - 8 stars on GitHub - 2 maintainers
brainpy-datasets 0.0.0.7
BrainPy Datasets
6 versions - Latest release: 11 months ago - 80 downloads last month - 2 stars on GitHub - 1 maintainer
pywhu3d 0.2.16
Example pywhu3d tool Package
20 versions - Latest release: 4 months ago - 102 downloads last month - 1 maintainer
tf-inputs 0.2.3
Input pipelines for TensorFlow that make sense.
5 versions - Latest release: about 5 years ago - 1 dependent repositories - 67 downloads last month - 4 stars on GitHub - 2 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
33 versions - Latest release: 5 months ago - 96 dependent packages - 3,946 dependent repositories - 3.86 million downloads last month - 4,085 stars on GitHub - 8 maintainers
insperareader 0.1.2
JSON parsing of Inspera Assessment files
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 3 downloads last month - 0 stars on GitHub - 2 maintainers
rgmining-amazon-dataset 0.5.1
An Amazon dataset for Review Graph Mining Project
3 versions - Latest release: almost 7 years ago - 1 dependent repositories - 12 downloads last month - 6 stars on GitHub - 2 maintainers
webdata 0.0.1
Publish data on web
1 version - Latest release: over 8 years ago - 4 dependent repositories - 23 downloads last month - 1 stars on GitHub - 2 maintainers
datasetstools 1.0.3
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 10 downloads last month - 2 maintainers
mlstructfp 0.5.7
Machine learning structural floor plan dataset
36 versions - Latest release: about 1 month ago - 250 downloads last month - 8 stars on GitHub - 1 maintainer
datapipeml 0.8
Framework to manipulate dataframes fluidly in a pipeline.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 12 downloads last month - 7 stars on GitHub - 2 maintainers
ml-pyxis 0.4.dev0
Tool for reading and writing datasets of tensors with MessagePack and Lightning Memory-Mapped Dat...
1 version - Latest release: almost 3 years ago - 2 dependent repositories - 43 downloads last month - 114 stars on GitHub - 2 maintainers
iden 0.0.3
simple library to manage a dataset of shards to train machine learning models
9 versions - Latest release: about 2 months ago - 37.6 thousand downloads last month - 0 stars on GitHub - 2 maintainers
biobeee 0.0.5
Bioinformatics tool for performing web scrapping on biological database and pre-processing
3 versions - Latest release: 6 months ago - 27 downloads last month - 0 stars on GitLab.com - 2 maintainers
Top 2.2% on pypi.org
colour-science 0.4.4 πŸ’°
Colour Science for Python
23 versions - Latest release: 5 months ago - 21 dependent packages - 94 dependent repositories - 94.4 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
cvat-sdk 2.12.1
CVAT REST API
34 versions - Latest release: 7 days ago - 3 dependent packages - 54 dependent repositories - 12.4 thousand downloads last month - 11,417 stars on GitHub - 6 maintainers
kyoushi-dataset 0.2.1
Tool for labeling log data from testbeds
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 23 downloads last month - 1 stars on GitHub - 2 maintainers
Top 9.2% on pypi.org
colour-datasets 0.2.5 πŸ’°
Colour science datasets for use with Colour
8 versions - Latest release: 5 months ago - 1 dependent package - 4 dependent repositories - 149 downloads last month - 52 stars on GitHub - 2 maintainers
dataset-builder-for-segmentation 0.0.3
A dataset pipeline builder for semantic image segmentation
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 3 stars on GitHub - 1 maintainer
datamaestro-ml 2021.10.5
"Machine learning related datasets"
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 57 downloads last month - 0 stars on GitHub - 2 maintainers
apibackuper 1.0.8
apibackuper: a command-line tool and python library for API backuping
7 versions - Latest release: over 1 year ago - 1 dependent repositories - 165 downloads last month - 13 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
quandl 3.7.0
Package for quandl API access
62 versions - Latest release: over 2 years ago - 8 dependent packages - 303 dependent repositories - 137 thousand downloads last month - 1,356 stars on GitHub - 6 maintainers
pandas-datareader-gdax 0.1.2 πŸ’°
GDAX data for Pandas in the style of DataReader.
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 34 downloads last month - 11 stars on GitHub - 1 maintainer
dataexploration 1.0.0
Utility for exploring the dataset
1 version - Latest release: over 6 years ago - 1 dependent repositories - 8 downloads last month - 2 maintainers
vl-datasets 0.0.11
Open, Clean Datasets for Computer Vision.
11 versions - Latest release: 12 months ago - 98 downloads last month - 64 stars on GitHub - 2 maintainers
pydataencoder 1.0.5
Dataset Encoder Package
6 versions - Latest release: about 2 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
dogsandcats 0.0.4
Dataset package for dogs and cats classification problem
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 1 maintainer