Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-curation" keyword

urlgenie 1.0.0
Tool to make URL extraction, generalization, validation, and filtration easy.
1 version - Latest release: 1 day ago - 78 downloads last month - 0 stars on GitHub - 1 maintainer
ldcoolp-figshare 0.3.2
Python tool using the Figshare API for data curation
6 versions - Latest release: almost 3 years ago - 2 dependent repositories - 38 downloads last month - 3 stars on GitHub - 2 maintainers
synrbl 0.0.19
Synthesis Rebalancing Framework for Computational Chemistry
19 versions - Latest release: 3 days ago - 1.66 thousand downloads last month - 5 stars on GitHub - 1 maintainer
tmtk 0.5.8
A toolkit for ETL curation for the tranSMART data warehouse.
26 versions - Latest release: almost 4 years ago - 1 dependent repositories - 164 downloads last month - 6 stars on GitHub - 5 maintainers
selfclean 0.0.22
A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates a...
21 versions - Latest release: 19 days ago - 618 downloads last month - 9 stars on GitHub - 1 maintainer
cubids-bond-fork 0.1.0
BIDS On Disk Editor
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 73 downloads last month - 18 stars on GitHub - 1 maintainer
cubids 1.1.0
Curation of BIDS (CuBIDS): A sanity-preserving software package for processing BIDS datasets.
10 versions - Latest release: about 1 month ago - 116 downloads last month - 18 stars on GitHub - 2 maintainers
cleanlab-cli 0.1.14
Command line interface for all things Cleanlab Studio
16 versions - Latest release: over 1 year ago - 129 downloads last month - 20 stars on GitHub - 3 maintainers
cleanlab-studio 2.0.4
Client interface for all things Cleanlab Studio
79 versions - Latest release: 13 days ago - 1 dependent repositories - 2.95 thousand downloads last month - 21 stars on GitHub - 4 maintainers
learn2clean 0.2.1
Python Library for Data Preprocessing with Reinforcement Learning.
1 version - Latest release: about 5 years ago - 1 dependent repositories - 21 downloads last month - 43 stars on GitHub - 1 maintainer
sliceguard 0.0.35
A library for detecting critical data slices in structured and unstructured data based on feature...
33 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 210 downloads last month - 51 stars on GitHub - 1 maintainer
metamapper 0.1
metamapper
1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 75 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
renumics-spotlight 1.6.8
Visualize and maintain datasets to develop and understand data-driven algorithms.
53 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 2.34 thousand downloads last month - 1,016 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
fastdup 1.123
Fast tool for gaining insights from large image repositories.
332 versions - Latest release: 14 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
docta.ai 0.2
Docta.ai
3 versions - Latest release: 9 months ago - 31 downloads last month - 2,953 stars on GitHub - 1 maintainer
fiftyone-eval-only 0.14.3
FiftyOne, for evaluation only.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-ubuntu2004 0.4.0
FiftyOne DB
1 version - Latest release: over 1 year ago - 1 dependent repositories - 27 downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-rhel7 0.4.0
FiftyOne DB
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 37 downloads last month - 6,627 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
fiftyone-db 1.1.2
FiftyOne DB
21 versions - Latest release: 2 months ago - 1 dependent package - 36 dependent repositories - 54.2 thousand downloads last month - 6,627 stars on GitHub - 3 maintainers
fiftyone-db-debian9 0.4.0
FiftyOne DB
6 versions - Latest release: over 1 year ago - 1 dependent repositories - 43 downloads last month - 6,627 stars on GitHub - 2 maintainers
fiftyone-db-ubuntu2204 0.4.0
FiftyOne DB
1 version - Latest release: about 1 year ago - 7.1 thousand downloads last month - 6,627 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
fiftyone-desktop 0.33.7
FiftyOne Desktop
60 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 1.9 thousand downloads last month - 6,627 stars on GitHub - 4 maintainers
fiftyone-db-ubuntu1604 0.3.0
FiftyOne DB
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 35 downloads last month - 6,627 stars on GitHub - 2 maintainers
Top 2.1% on pypi.org
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...
29 versions - Latest release: 12 days ago - 11 dependent packages - 19 dependent repositories - 26.2 thousand downloads last month - 8,808 stars on GitHub - 4 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...
7 versions - Latest release: 3 months ago - 65 downloads last month - 8,808 stars on GitHub - 1 maintainer
Related Keywords
data-cleaning 16 data-centric-ai 15 machine-learning 14 data-science 13 data-quality 12 python 12 computer-vision 11 visualization 11 deep-learning 11 image-classification 11 active-learning 10 unstructured-data 9 object-detection 9 artificial-intelligence 8 vector-search 8 developer-tools 8 outlier-detection 5 noisy-labels 4 data-labeling 4 data-profiling 4 data-validation 4 data-analysis 3 data 2 data-visualization 2 text-classification 2 structured-data 2 exploratory-data-analysis 2 llm 2 model-deployment 2 natural-language-processing 2 automl 2 annotations 2 cleanlab 2 datasets 2 dataquality 2 labeling 2 llms 2 out-of-distribution-detection 2 weak-supervision 2 dataops 2 annotation 2 datacentric 2 datacentric_ai 2 unsupervised_learning 2 learning_with_noisy_labels 2 weak_supervision 2 classification 2 confident_learning 2 data_cleaning 2 machine_learning 2 data-organization 2 neuroimaging 2 neuroimaging-data-science 2 neuroinformatics 2 neuroscience 2 neuroscience-methods 2 python-package 2 image-duplicate-detection 1 image-classfication 1 image-analysis 1 image 1 dataset 1 data-augmentation 1 video 1 timeseries 1 meshes 1 images 1 image-processing 1 image-similarity 1 novelty-detection 1 visual-search 1 visualization-tools 1 Data diagnosis 1 curation 1 data-centric-machine-learning 1 data-diagnosis 1 language-model 1 rlhf 1 learn2clean 1 bond 1 self-supervised-learning 1 jupyter-notebook 1 data-modeling 1 concept tree 1 arborist 1 etl 1 transmart 1 unbalanced-reactions 1 rules 1 rebalancing 1 reaction-databases 1 maximum-common-subgraph 1 ld-cool-p 1 figshare 1 url-generalization 1 data-sanitization 1 data-processing 1 data-cleansing 1 generalization 1 url-parsing 1