An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "duplicate-detection" keyword

View the packages on the pypi.org package registry that are tagged with the "duplicate-detection" keyword.

Top 2.4% on pypi.org
nomic 3.4.1
The official Nomic python client.
132 versions - Latest release: 3 months ago - 32 dependent packages - 761 dependent repositories - 28.8 thousand downloads last month - 1,634 stars on GitHub - 1 maintainer
dupe-eraser 1.0.0
Command-line tools to automate the deletion of duplicates files.
1 version - Latest release: almost 8 years ago - 1 dependent repositories - 38 downloads last month - 13 stars on GitHub - 1 maintainer
pydupes 0.6.1
A duplicate file finder that may be faster in environments with millions of files and terabytes o...
10 versions - Latest release: over 3 years ago - 1 dependent repositories - 310 downloads last month - 3 stars on GitHub - 1 maintainer
ytmusic-deleter 3.1.0 💰
Easily delete your YouTube Music library
70 versions - Latest release: 3 days ago - 1 dependent repositories - 1.27 thousand downloads last month - 135 stars on GitHub - 1 maintainer
er-evaluation 2.3.0 💰
An End-to-End Evaluation Framework for Entity Resolution Systems.
9 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 370 downloads last month - 9 stars on GitHub - 1 maintainer
imageduplicatefinder 0.6.0
Simple duplication finder for Images, matches on names and then compares image hashes.
1 version - Latest release: over 2 years ago - 57 downloads last month - 0 stars on GitHub - 1 maintainer
findsame 0.1.2
Find duplicate files and directories using hashes and a Merkle tree
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 142 downloads last month - 4 stars on GitHub - 1 maintainer
py-image-dedup 2.0.0 💰
A library to find duplicate images and delete unwanted ones
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 113 downloads last month - 150 stars on GitHub - 1 maintainer
doc2term 0.1
A fast NLP tokenizer that detects tokens and remove duplications and punctuations
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 1 maintainer
thebear 0.6.0
Bear - the decluttering deduplicator
6 versions - Latest release: over 5 years ago - 1 dependent repositories - 293 downloads last month - 4 stars on GitHub - 1 maintainer
audiomatch 0.2.2
A small command-line tool to find similar audio files
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 443 downloads last month - 147 stars on GitHub - 1 maintainer
pyjedai 0.2.4
An open-source library that builds powerful end-to-end Entity Resolution workflows.
23 versions - Latest release: 18 days ago - 1.06 thousand downloads last month - 76 stars on GitHub - 3 maintainers
Top 9.7% on pypi.org
videohash 3.0.1
Python package for Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit compa...
24 versions - Latest release: almost 3 years ago - 1 dependent repositories - 22.5 thousand downloads last month - 305 stars on GitHub - 1 maintainer
simages 23.0.7
Find similar images in a dataset
17 versions - Latest release: almost 2 years ago - 1 dependent repositories - 375 downloads last month - 23 stars on GitHub - 1 maintainer
deplicate 1.2.3
Advanced Duplicate File Finder for Python. Nothing is impossible to solve.
11 versions - Latest release: over 7 years ago - 1 dependent repositories - 496 downloads last month - 77 stars on GitHub - 1 maintainer
dhunter 1.3.0
The hunter - fast and easy file duplicate finder utility.
4 versions - Latest release: over 5 years ago - 1 dependent repositories - 170 downloads last month - 0 stars on GitHub - 1 maintainer
imgdups 0.1.6
Very fast two folder image duplicate finder programmed with pickle and cv2
5 versions - Latest release: over 1 year ago - 177 downloads last month - 2 stars on GitHub - 1 maintainer
searchdups 1.0.0
Searches for duplicate files in folders (recursively, if needed)
1 version - Latest release: about 2 years ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
pathlesstaken 1.0.2rc1 💰
Library and executable for identifying anomalous file path strings
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 50 downloads last month - 25 stars on GitHub - 1 maintainer
demystify-digipres 2.0.0 💰
engine for the analysis of DROID and Siegfried file format reports
7 versions - Latest release: 12 months ago - 1 dependent repositories - 191 downloads last month - 25 stars on GitHub - 1 maintainer
dupeutil 1.0.0
A command-line program written in Python for detecting and removing duplicate files
1 version - Latest release: about 3 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 1 maintainer
duplicate-finder 1.4.0
Package to find duplicate files in and across folders
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 302 downloads last month - 25 stars on GitHub - 1 maintainer
Related Keywords
python 12 duplicates 8 duplicate-files 4 deduplication 4 duplicate 4 files 3 duplicatefilefinder 3 fuzzy-matching 2 duplicate-images 2 entity-resolution 2 python3 2 command-line 2 command-line-tool 2 commandline 2 digital-preservation 2 archives 2 code4lib 2 cli 2 collection-profiling 2 images 2 pronom 2 digipres 2 format-analysis 2 duplicate-video-finder 1 duplicate-videos 1 ffmpeg 1 find-similar-videos-by-content 1 ndvd 1 ndvr 1 near-duplicate-video 1 near-duplicate-video-clip-detection 1 video-deduplication 1 video-similarity-search 1 visual-claim 1 video diff 1 video 1 compare videos 1 near duplicate video 1 video hashing 1 perceptual video hashing 1 NDVD 1 near duplicate video detection 1 videohash 1 machine-learning 1 entity-matching 1 data-matching 1 data-disambigation 1 link-discovery 1 dupeutil 1 filename-analysis 1 string-analysis 1 file-analysis 1 python-script 1 command 1 sysadmin 1 command line 1 deduplicate 1 file 1 windows 1 unix 1 scanning 1 pypi 1 purge-duplicate-files 1 multi-filtering 1 macosx 1 finder 1 duplication-finder 1 duplicates-removed 1 deplicate 1 dups 1 duplicatefinder 1 similarity-detection 1 autoencoder 1 similar data 1 preprocessing 1 photos 1 remover 1 imagehash 1 deletion 1 image 1 statistics 1 record-linkage 1 ml-testing 1 ml-evaluation 1 matching 1 inventor-name-disambiguation 1 evaluation 1 disambiguation 1 data-science 1 author-name-disambiguation 1 er_evaluation 1 youtubemusic 1 youtube 1 sort 1 playlists 1 playlist-manager 1 music 1 library 1 delete 1 click 1