Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "duplicate-detection" keyword

deplicate 1.2.3
Advanced Duplicate File Finder for Python. Nothing is impossible to solve.
11 versions - Latest release: over 6 years ago - 1 dependent repositories - 86 downloads last month - 76 stars on GitHub - 1 maintainer
findsame 0.1.2
Find duplicate files and directories using hashes and a Merkle tree
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 21 downloads last month - 4 stars on GitHub - 1 maintainer
py-image-dedup 2.0.0 💰
A library to find duplicate images and delete unwanted ones
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 61 downloads last month - 149 stars on GitHub - 1 maintainer
audiomatch 0.2.2
A small command-line tool to find similar audio files
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 123 downloads last month - 135 stars on GitHub - 1 maintainer
imgdups 0.1.6
Very fast two folder image duplicate finder programmed with pickle and cv2
5 versions - Latest release: 6 months ago - 45 downloads last month - 2 stars on GitHub - 1 maintainer
imageduplicatefinder 0.6.0
Simple duplication finder for Images, matches on names and then compares image hashes.
1 version - Latest release: almost 2 years ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
demystify-digipres 2.0.0 💰
engine for the analysis of DROID and Siegfried file format reports
7 versions - Latest release: 18 days ago - 1 dependent repositories - 132 downloads last month - 23 stars on GitHub - 1 maintainer
pathlesstaken 1.0.2rc1 💰
Library and executable for identifying anomalous file path strings
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 16 downloads last month - 23 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
videohash 3.0.1
Python package for Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit compa...
24 versions - Latest release: almost 2 years ago - 1 dependent repositories - 3.46 thousand downloads last month - 250 stars on GitHub - 1 maintainer
thebear 0.6.0
Bear - the decluttering deduplicator
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 61 downloads last month - 4 stars on GitHub - 1 maintainer
simages 23.0.7
Find similar images in a dataset
17 versions - Latest release: 11 months ago - 1 dependent repositories - 129 downloads last month - 20 stars on GitHub - 1 maintainer
duplicate-finder 1.4.0
Package to find duplicate files in and across folders
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 134 downloads last month - 22 stars on GitHub - 1 maintainer
dupe-eraser 1.0.0
Command-line tools to automate the deletion of duplicates files.
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 10 downloads last month - 12 stars on GitHub - 1 maintainer
doc2term 0.1
A fast NLP tokenizer that detects tokens and remove duplications and punctuations
1 version - Latest release: about 3 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
dhunter 1.3.0
The hunter - fast and easy file duplicate finder utility.
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 49 downloads last month - 0 stars on GitHub - 1 maintainer
pyjedai 0.1.7
An open-source library that builds powerful end-to-end Entity Resolution workflows.
16 versions - Latest release: 30 days ago - 311 downloads last month - 62 stars on GitHub - 2 maintainers
dupeutil 1.0.0
A command-line program written in Python for detecting and removing duplicate files
1 version - Latest release: about 2 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on GitHub - 1 maintainer
er-evaluation 2.3.0 💰
An End-to-End Evaluation Framework for Entity Resolution Systems.
9 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 103 downloads last month - 9 stars on GitHub - 1 maintainer
searchdups 1.0.0
Searches for duplicate files in folders (recursively, if needed)
1 version - Latest release: over 1 year ago - 16 downloads last month - 0 stars on GitHub - 1 maintainer
pydupes 0.6.1
A duplicate file finder that may be faster in environments with millions of files and terabytes o...
10 versions - Latest release: over 2 years ago - 1 dependent repositories - 57 downloads last month - 3 stars on GitHub - 1 maintainer
Related Keywords
python 10 duplicates 8 duplicate 4 duplicate-files 4 deduplication 4 files 3 duplicatefilefinder 3 command-line-tool 2 commandline 2 duplicate-images 2 pronom 2 fuzzy-matching 2 format-analysis 2 digital-preservation 2 digipres 2 python3 2 collection-profiling 2 entity-resolution 2 code4lib 2 command-line 2 archives 2 images 2 cli 2 text-processing 1 audio 1 standarization 1 punctuation 1 NLP 1 ndvr 1 near-duplicate-video 1 near-duplicate-video-clip-detection 1 video-deduplication 1 video-similarity-search 1 visual-claim 1 cli-app 1 clutterremoval 1 photos 1 preprocessing 1 similar data 1 autoencoder 1 similarity-detection 1 file-management 1 tokenizer 1 duplication 1 python-script 1 command 1 sysadmin 1 command line 1 statistics 1 record-linkage 1 ml-testing 1 ml-evaluation 1 matching 1 inventor-name-disambiguation 1 evaluation 1 disambiguation 1 data-science 1 author-name-disambiguation 1 er_evaluation 1 dupeutil 1 machine-learning 1 entity-matching 1 data-matching 1 data-disambigation 1 link-discovery 1 deduplicate 1 file 1 nlp-library 1 nlp 1 doc2term 1 text-tokenizing 1 detection 1 python-3 1 image-comparison 1 image-analysis 1 hacktoberfest 1 find-duplicates 1 dedup 1 merkletree 1 file-hashing 1 multiprocessing 1 multithreading 1 hash 1 merkle-tree 1 windows 1 unix 1 scanning 1 pypi 1 purge-duplicate-files 1 multi-filtering 1 macosx 1 finder 1 duplication-finder 1 duplicates-removed 1 deplicate 1 dups 1 duplicatefinder 1 ndvd 1 find-similar-videos-by-content 1 ffmpeg 1