Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-curation" keyword

Top 2.1% on pypi.org
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...
29 versions - Latest release: 1 day ago - 8 dependent packages - 19 dependent repositories - 21.7 thousand downloads last month - 8,710 stars on GitHub - 5 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...
7 versions - Latest release: 2 months ago - 50 downloads last month - 8,694 stars on GitHub - 1 maintainer
tmtk 0.5.8
A toolkit for ETL curation for the tranSMART data warehouse.
26 versions - Latest release: almost 4 years ago - 1 dependent repositories - 164 downloads last month - 6 stars on GitHub - 10 maintainers
Top 9.0% on pypi.org
renumics-spotlight 1.6.8
Visualize and maintain datasets to develop and understand data-driven algorithms.
53 versions - Latest release: about 2 months ago - 3 dependent packages - 1 dependent repositories - 1.66 thousand downloads last month - 1,014 stars on GitHub - 2 maintainers
sliceguard 0.0.35
A library for detecting critical data slices in structured and unstructured data based on feature...
33 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 210 downloads last month - 51 stars on GitHub - 2 maintainers
cleanlab-studio 2.0.4
Client interface for all things Cleanlab Studio
79 versions - Latest release: 2 days ago - 1 dependent repositories - 3.22 thousand downloads last month - 21 stars on GitHub - 5 maintainers
selfclean 0.0.22
A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates a...
21 versions - Latest release: 8 days ago - 618 downloads last month - 9 stars on GitHub - 2 maintainers
learn2clean 0.2.1
Python Library for Data Preprocessing with Reinforcement Learning.
1 version - Latest release: about 5 years ago - 1 dependent repositories - 21 downloads last month - 43 stars on GitHub - 1 maintainer
synrbl 0.0.10
Synthesis Rebalancing Framework for Computational Chemistry
10 versions - Latest release: 8 days ago - 756 downloads last month - 5 stars on GitHub - 2 maintainers
cubids-bond-fork 0.1.0
BIDS On Disk Editor
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 73 downloads last month - 18 stars on GitHub - 1 maintainer
ldcoolp-figshare 0.3.2
Python tool using the Figshare API for data curation
6 versions - Latest release: almost 3 years ago - 2 dependent repositories - 38 downloads last month - 3 stars on GitHub - 4 maintainers
Top 2.2% on pypi.org
fiftyone-db 1.1.2
FiftyOne DB
21 versions - Latest release: 2 months ago - 1 dependent package - 36 dependent repositories - 62.8 thousand downloads last month - 6,627 stars on GitHub - 4 maintainers
cleanlab-cli 0.1.14
Command line interface for all things Cleanlab Studio
16 versions - Latest release: over 1 year ago - 129 downloads last month - 20 stars on GitHub - 6 maintainers
metamapper 0.1
metamapper
1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 75 stars on GitHub - 1 maintainer
docta.ai 0.2
Docta.ai
3 versions - Latest release: 9 months ago - 15 downloads last month - 2,799 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
fiftyone-desktop 0.33.7
FiftyOne Desktop
60 versions - Latest release: 23 days ago - 1 dependent package - 1 dependent repositories - 1.33 thousand downloads last month - 6,627 stars on GitHub - 6 maintainers
fiftyone-db-ubuntu2204 0.4.0
FiftyOne DB
1 version - Latest release: 12 months ago - 6.7 thousand downloads last month - 6,627 stars on GitHub - 2 maintainers
fiftyone-db-ubuntu2004 0.4.0
FiftyOne DB
1 version - Latest release: over 1 year ago - 1 dependent repositories - 15 downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-debian9 0.4.0
FiftyOne DB
6 versions - Latest release: over 1 year ago - 1 dependent repositories - 10 downloads last month - 6,627 stars on GitHub - 2 maintainers
fiftyone-db-ubuntu1604 0.3.0
FiftyOne DB
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 15 downloads last month - 6,619 stars on GitHub - 2 maintainers
fiftyone-eval-only 0.14.3
FiftyOne, for evaluation only.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 6,625 stars on GitHub - 2 maintainers
fiftyone-db-rhel7 0.4.0
FiftyOne DB
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 17 downloads last month - 6,625 stars on GitHub - 1 maintainer
cubids 1.1.0
Curation of BIDS (CuBIDS): A sanity-preserving software package for processing BIDS datasets.
10 versions - Latest release: 27 days ago - 23 downloads last month - 18 stars on GitHub - 4 maintainers
Top 8.6% on pypi.org
fastdup 1.120
Fast tool for gaining insights from large image repositories.
329 versions - Latest release: about 1 month ago - 1 dependent repositories - 6.37 thousand downloads last month - 1,392 stars on GitHub - 4 maintainers
Related Keywords
data-cleaning 15 data-centric-ai 15 machine-learning 14 data-science 13 python 12 data-quality 12 computer-vision 11 visualization 11 deep-learning 11 image-classification 11 active-learning 10 object-detection 9 unstructured-data 9 vector-search 8 developer-tools 8 artificial-intelligence 8 outlier-detection 5 data-validation 4 data-labeling 4 data-profiling 4 noisy-labels 4 data-analysis 3 data-visualization 2 exploratory-data-analysis 2 python-package 2 neuroscience-methods 2 neuroscience 2 cleanlab 2 annotations 2 automl 2 neuroinformatics 2 llm 2 neuroimaging 2 model-deployment 2 natural-language-processing 2 structured-data 2 text-classification 2 data 2 data-organization 2 neuroimaging-data-science 2 weak-supervision 2 out-of-distribution-detection 2 llms 2 labeling 2 datasets 2 dataquality 2 machine_learning 2 data_cleaning 2 confident_learning 2 classification 2 weak_supervision 2 learning_with_noisy_labels 2 unsupervised_learning 2 dataops 2 annotation 2 datacentric 2 datacentric_ai 2 visualization-tools 1 visual-search 1 novelty-detection 1 image-similarity 1 image-processing 1 image-duplicate-detection 1 image-classfication 1 figshare 1 ld-cool-p 1 image-analysis 1 image 1 dataset 1 data-augmentation 1 data-catalog 1 data-discovery 1 data-warehouse 1 rlhf 1 django 1 metadata 1 language-model 1 data-diagnosis 1 data-centric-machine-learning 1 curation 1 Data diagnosis 1 metamapper 1 schema-inspection 1 bond 1 concept tree 1 data-modeling 1 eda 1 jupyter-notebook 1 data-exploration 1 video 1 data curation 1 timeseries 1 meshes 1 machine learning 1 images 1 hacktoberfest 1 data science 1 pandas 1 ai 1 audio 1