Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "dedupe" keyword
Top 1.8% on pypi.org
102 versions - Latest release: 2 months ago - 3 dependent packages - 83 dependent repositories - 12 thousand downloads last month - 10,494 stars on GitHub - 1 maintainer
borgbackup 1.2.8 💰
Deduplicated, encrypted, authenticated and compressed backups102 versions - Latest release: 2 months ago - 3 dependent packages - 83 dependent repositories - 12 thousand downloads last month - 10,494 stars on GitHub - 1 maintainer
Top 7.6% on pypi.org
8 versions - Latest release: about 4 years ago - 1 dependent package - 17 dependent repositories - 1.14 thousand downloads last month - 30 stars on GitHub - 1 maintainer
django-super-deduper 0.1.4
Utilities for deduping Django model instances8 versions - Latest release: about 4 years ago - 1 dependent package - 17 dependent repositories - 1.14 thousand downloads last month - 30 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
23 versions - Latest release: about 5 years ago - 6 dependent packages - 59 dependent repositories - 2.13 million downloads last month - 916 stars on GitHub - 1 maintainer
recordlinkage 0.13.2 💰
A record linkage toolkit for linking and deduplication23 versions - Latest release: about 5 years ago - 6 dependent packages - 59 dependent repositories - 2.13 million downloads last month - 916 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
174 versions - Latest release: over 1 year ago - 7 dependent packages - 132 dependent repositories - 67.3 thousand downloads last month - 3,989 stars on GitHub - 2 maintainers
dedupe 2.0.23
A python library for accurate and scaleable data deduplication and entity-resolution174 versions - Latest release: over 1 year ago - 7 dependent packages - 132 dependent repositories - 67.3 thousand downloads last month - 3,989 stars on GitHub - 2 maintainers
mail-deduplicate 7.3.0 💰
📧 CLI to deduplicate mails from mail boxes.18 versions - Latest release: 7 months ago - 1 dependent repositories - 75 downloads last month - 159 stars on GitHub - 1 maintainer
yadupe 1.1.0
Recursively scan one or more given directories for duplicate files.4 versions - Latest release: over 2 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 1 maintainer
maildir-deduplicate 2.2.0 💰
Deduplicate mails from a set of maildir folders.11 versions - Latest release: almost 4 years ago - 2 dependent repositories - 104 downloads last month - 159 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
3 versions - Latest release: 5 months ago - 1 dependent repositories - 1.27 thousand downloads last month - 900 stars on GitHub - 1 maintainer
zingg 0.4.0
Zingg Entity Resolution, Data Mastering and Deduplication3 versions - Latest release: 5 months ago - 1 dependent repositories - 1.27 thousand downloads last month - 900 stars on GitHub - 1 maintainer
mobio-dedupe-sdk-test 1.0.33
Mobio dedupe SDK31 versions - Latest release: 6 months ago - 67 downloads last month - 1 maintainer
oagdedupe 0.2.1
oagdedupe is a Python library for scalable entity resolution, using active learning to learn bloc...3 versions - Latest release: over 1 year ago - 10 downloads last month - 2 stars on GitHub - 1 maintainer
dedupe-fork-eccovia 2.0.13
A python library for accurate and scaleable data deduplication and entity-resolution2 versions - Latest release: over 1 year ago - 41 downloads last month - 3,987 stars on GitHub - 1 maintainer
dedupe-fh 1.9.7
A python library for accurate and scaleable data deduplication and entity-resolution1 version - Latest release: almost 5 years ago - 1 dependent repositories - 32 downloads last month - 394 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
28 versions - Latest release: almost 5 years ago - 3 dependent repositories - 227 downloads last month - 334 stars on GitHub - 1 maintainer
imgdupes 0.1.1
CLI tool to dedupe images based on perceptual hash28 versions - Latest release: almost 5 years ago - 3 dependent repositories - 227 downloads last month - 334 stars on GitHub - 1 maintainer
csvdedupe2 0.2.7
Command line tools for deduplicating and merging csv files7 versions - Latest release: about 2 years ago - 1 dependent repositories - 69 downloads last month - 401 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
20 versions - Latest release: over 4 years ago - 7 dependent repositories - 191 downloads last month - 401 stars on GitHub - 2 maintainers
csvdedupe 0.1.20
Command line tools for deduplicating and merging csv files20 versions - Latest release: over 4 years ago - 7 dependent repositories - 191 downloads last month - 401 stars on GitHub - 2 maintainers
superdeduper 0.1.7
A simple interface to datamade/dedupe to make probabilistic record linkage easy.7 versions - Latest release: about 7 years ago - 1 dependent repositories - 73 downloads last month - 42 stars on GitHub - 1 maintainer
pgdedupe 0.2.1
A simple interface to datamade/dedupe to make probabilistic record linkage easy.2 versions - Latest release: about 7 years ago - 1 dependent repositories - 26 downloads last month - 42 stars on GitHub - 1 maintainer
Related Keywords
python
10
record-linkage
9
entity-resolution
8
deduplication
7
cli
4
python-library
3
deduplicate
3
cleanup
2
mbox
2
maildir
2
email
2
mail
2
CLI
2
dedupe-library
2
de-duplicating
2
datamade
2
clustering
2
mailbox
2
postgresql
2
machine-learning
2
database
2
record linkage
2
babyl
2
data-cleaning
2
mh
2
mmdf
2
csv-files
2
identity
1
identity-resolution
1
masterdata
1
fuzzymatch
1
fuzzy-matching
1
etl
1
dataquality
1
datalake
1
dataengineering
1
data-transformations
1
data-transformation
1
backup
1
ml
1
modern-data-stack
1
spark
1
mobio
1
contact spam
1
entity resolution
1
blocking
1
image
1
perceptual
1
hash
1
perceptual-hashes
1
perceptual-hashing
1
superdeduper
1
pgdedupe
1
borgbackup
1
c
1
compression
1
cython
1
encryption
1
python-3
1
ssh
1
django
1
data-matching
1
privacy
1
similarity
1
string-distance
1
utrecht-university
1
Babyl
1
MH
1
MMDF
1
dedup
1
duplicate-files
1
duplicatefilefinder
1
duplicates
1
remove-duplicate-files
1
remove-duplicates
1
search-duplicates
1
Entity Resolution
1
data mastering
1
identity resolution
1
analytics
1
analytics-engineering
1
data-science
1