Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-curation" keyword
tmtk 0.5.8
A toolkit for ETL curation for the tranSMART data warehouse.26 versions - Latest release: almost 4 years ago - 1 dependent repositories - 164 downloads last month - 6 stars on GitHub - 5 maintainers
Top 2.1% on pypi.org
30 versions - Latest release: 24 days ago - 11 dependent packages - 19 dependent repositories - 31.3 thousand downloads last month - 8,853 stars on GitHub - 4 maintainers
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...30 versions - Latest release: 24 days ago - 11 dependent packages - 19 dependent repositories - 31.3 thousand downloads last month - 8,853 stars on GitHub - 4 maintainers
Top 8.6% on pypi.org
332 versions - Latest release: 26 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
fastdup 1.123
Fast tool for gaining insights from large image repositories.332 versions - Latest release: 26 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
Top 7.2% on pypi.org
61 versions - Latest release: about 2 months ago - 1 dependent package - 1 dependent repositories - 1.65 thousand downloads last month - 6,627 stars on GitHub - 4 maintainers
fiftyone-desktop 0.33.7
FiftyOne Desktop61 versions - Latest release: about 2 months ago - 1 dependent package - 1 dependent repositories - 1.65 thousand downloads last month - 6,627 stars on GitHub - 4 maintainers
cleanlab-studio 2.0.4
Client interface for all things Cleanlab Studio81 versions - Latest release: 25 days ago - 1 dependent repositories - 2.21 thousand downloads last month - 21 stars on GitHub - 4 maintainers
Top 2.2% on pypi.org
22 versions - Latest release: 3 months ago - 1 dependent package - 36 dependent repositories - 52.5 thousand downloads last month - 6,627 stars on GitHub - 3 maintainers
fiftyone-db 1.1.2
FiftyOne DB22 versions - Latest release: 3 months ago - 1 dependent package - 36 dependent repositories - 52.5 thousand downloads last month - 6,627 stars on GitHub - 3 maintainers
cleanlab-cli 0.1.14
Command line interface for all things Cleanlab Studio16 versions - Latest release: over 1 year ago - 79 downloads last month - 21 stars on GitHub - 3 maintainers
fiftyone-db-ubuntu1604 0.3.0
FiftyOne DB5 versions - Latest release: over 3 years ago - 1 dependent repositories - 35 downloads last month - 6,627 stars on GitHub - 2 maintainers
fiftyone-db-debian9 0.4.0
FiftyOne DB6 versions - Latest release: over 1 year ago - 1 dependent repositories - 43 downloads last month - 6,627 stars on GitHub - 2 maintainers
ldcoolp-figshare 0.3.2
Python tool using the Figshare API for data curation6 versions - Latest release: almost 3 years ago - 2 dependent repositories - 13 downloads last month - 3 stars on GitHub - 2 maintainers
cubids 1.1.0
Curation of BIDS (CuBIDS): A sanity-preserving software package for processing BIDS datasets.10 versions - Latest release: about 2 months ago - 116 downloads last month - 18 stars on GitHub - 2 maintainers
selfclean 0.0.22
A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates a...21 versions - Latest release: about 1 month ago - 285 downloads last month - 9 stars on GitHub - 1 maintainer
sliceguard 0.0.35
A library for detecting critical data slices in structured and unstructured data based on feature...33 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 210 downloads last month - 51 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
53 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 2.99 thousand downloads last month - 1,020 stars on GitHub - 1 maintainer
renumics-spotlight 1.6.8
Visualize and maintain datasets to develop and understand data-driven algorithms.53 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 2.99 thousand downloads last month - 1,020 stars on GitHub - 1 maintainer
learn2clean 0.2.1
Python Library for Data Preprocessing with Reinforcement Learning.1 version - Latest release: about 5 years ago - 1 dependent repositories - 21 downloads last month - 43 stars on GitHub - 1 maintainer
urlgenie 1.0.0
Python package to make URL extraction, generalization, validation, and filtration easy.6 versions - Latest release: 13 days ago - 299 downloads last month - 3 stars on GitHub - 1 maintainer
metamapper 0.1
metamapper1 version - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 75 stars on GitHub - 1 maintainer
fiftyone-eval-only 0.14.3
FiftyOne, for evaluation only.1 version - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-ubuntu2204 0.4.0
FiftyOne DB1 version - Latest release: about 1 year ago - 7.1 thousand downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-ubuntu2004 0.4.0
FiftyOne DB1 version - Latest release: over 1 year ago - 1 dependent repositories - 27 downloads last month - 6,627 stars on GitHub - 1 maintainer
fiftyone-db-rhel7 0.4.0
FiftyOne DB3 versions - Latest release: over 1 year ago - 1 dependent repositories - 37 downloads last month - 6,627 stars on GitHub - 1 maintainer
synrbl 0.0.19
Synthesis Rebalancing Framework for Computational Chemistry20 versions - Latest release: 14 days ago - 1.82 thousand downloads last month - 5 stars on GitHub - 1 maintainer
docta.ai 0.2
Docta.ai3 versions - Latest release: 10 months ago - 31 downloads last month - 2,953 stars on GitHub - 1 maintainer
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...7 versions - Latest release: 3 months ago - 45 downloads last month - 8,846 stars on GitHub - 1 maintainer
cubids-bond-fork 0.1.0
BIDS On Disk Editor1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 73 downloads last month - 18 stars on GitHub - 1 maintainer
Related Keywords
data-cleaning
16
data-centric-ai
15
machine-learning
14
data-science
13
python
12
data-quality
12
visualization
11
image-classification
11
computer-vision
11
deep-learning
11
active-learning
10
object-detection
9
unstructured-data
9
artificial-intelligence
8
developer-tools
8
vector-search
8
outlier-detection
5
data-profiling
4
data-labeling
4
data-validation
4
noisy-labels
4
data-analysis
3
model-deployment
2
natural-language-processing
2
structured-data
2
text-classification
2
neuroscience-methods
2
exploratory-data-analysis
2
llm
2
data-visualization
2
automl
2
annotations
2
python-package
2
cleanlab
2
neuroinformatics
2
data
2
neuroimaging-data-science
2
neuroimaging
2
data-organization
2
neuroscience
2
machine_learning
2
data_cleaning
2
confident_learning
2
classification
2
weak_supervision
2
learning_with_noisy_labels
2
unsupervised_learning
2
datacentric_ai
2
datacentric
2
annotation
2
dataops
2
dataquality
2
datasets
2
labeling
2
llms
2
out-of-distribution-detection
2
weak-supervision
2
url-generalization
1
data-sanitization
1
data-processing
1
data-cleansing
1
generalization
1
url-parsing
1
reinforcement-learning
1
data-preprocessing
1
data-cleaning-pipeline
1
automated
1
pipeline
1
preprocessing
1
learn2clean
1
bond
1
rlhf
1
language-model
1
data-diagnosis
1
data-centric-machine-learning
1
curation
1
Data diagnosis
1
unbalanced-reactions
1
rules
1
rebalancing
1
reaction-databases
1
maximum-common-subgraph
1
schema-inspection
1
metamapper
1
metadata
1
django
1
data-warehouse
1
data-discovery
1
data-catalog
1
video
1
visual-search
1
novelty-detection
1
image-similarity
1
image-processing
1
image-duplicate-detection
1
image-classfication
1
image-analysis
1
image
1
dataset
1
data-augmentation
1