Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "duplicates" keyword
listset 0.1.0
remove duplicates from lists1 version - Latest release: almost 6 years ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitHub - 1 maintainer
removedupes 1.0.2
Remove all duplicate files from a directory, fast3 versions - Latest release: about 6 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
arrayhascher 0.11
Fast hash in 2D Arrays (Numpy/Pandas/lists/tuples)2 versions - Latest release: 5 months ago - 2 dependent packages - 33 downloads last month - 1 stars on GitHub - 1 maintainer
duplicateindexer 0.10
Find duplicates in multiple lists and return their indices and values.1 version - Latest release: 8 months ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
stridesduplicatefinder 0.10
Calculate overlapping values between two arrays and return the results as a DataFrame1 version - Latest release: 8 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
screwduplicates 0.11
provides a simple and efficient way to remove duplicates from an iterable (even with unhashable e...2 versions - Latest release: 11 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
tagfile 0.1.0
Search, index and tag your files and find duplicates1 version - Latest release: about 1 year ago - 15 downloads last month - 3 stars on GitHub - 1 maintainer
drop-duplicates-nested-list 0.10
Drops duplicates from nested list1 version - Latest release: over 1 year ago - 7 downloads last month - 0 stars on GitHub - 1 maintainer
a-pandas-ex-duplicates-to-df 0.10
Creates a DataFrame/Series from duplicates1 version - Latest release: over 1 year ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
imageduplicatefinder 0.6.0
Simple duplication finder for Images, matches on names and then compares image hashes.1 version - Latest release: almost 2 years ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
find-duplicate-contacts 0.0.3
Find duplicate contacts in vCard files1 version - Latest release: almost 2 years ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
yadupe 1.1.0
Recursively scan one or more given directories for duplicate files.4 versions - Latest release: over 2 years ago - 1 dependent repositories - 32 downloads last month - 0 stars on GitHub - 1 maintainer
umitools 0.3.4
A toolset for handling sequencing data with unique molecular identifiers (UMIs)3 versions - Latest release: about 6 years ago - 2 dependent repositories - 40 downloads last month - 12 stars on GitHub - 1 maintainer
thebear 0.6.0
Bear - the decluttering deduplicator6 versions - Latest release: over 4 years ago - 1 dependent repositories - 61 downloads last month - 4 stars on GitHub - 1 maintainer
simages 23.0.7
Find similar images in a dataset17 versions - Latest release: 11 months ago - 1 dependent repositories - 129 downloads last month - 20 stars on GitHub - 1 maintainer
pyunique 0.0.10 💰
Pyunique helps you get rid of duplicate files10 versions - Latest release: about 1 year ago - 1 dependent repositories - 64 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
2 versions - Latest release: 9 months ago - 2 dependent repositories - 300 downloads last month - 237 stars on GitHub - 2 maintainers
python_hashes 0.1dev
Library of interesting (non-cryptographic) hashes in pure Python.2 versions - Latest release: 9 months ago - 2 dependent repositories - 300 downloads last month - 237 stars on GitHub - 2 maintainers
hashdb2 1.0
HashDb2 provides a simple method for executing commands based on matched files3 versions - Latest release: about 7 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
findsame 0.1.2
Find duplicate files and directories using hashes and a Merkle tree3 versions - Latest release: over 4 years ago - 1 dependent repositories - 26 downloads last month - 4 stars on GitHub - 1 maintainer
findd 0.9.3
Find duplicate files, based on size and hashvalues.13 versions - Latest release: about 3 years ago - 2 dependent repositories - 86 downloads last month - 3 stars on GitHub - 1 maintainer
fdedup 0.0.8
Command line tool to find file duplicates.7 versions - Latest release: over 9 years ago - 2 dependent repositories - 23 downloads last month - 2 maintainers
dupe-eraser 1.0.0
Command-line tools to automate the deletion of duplicates files.1 version - Latest release: almost 7 years ago - 1 dependent repositories - 10 downloads last month - 12 stars on GitHub - 1 maintainer
dhunter 1.3.0
The hunter - fast and easy file duplicate finder utility.4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 49 downloads last month - 0 stars on GitHub - 1 maintainer
deplicate-cli 0.1.1
Command Line Interface for deplicate.2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 38 downloads last month - 1 stars on GitHub - 1 maintainer
deplicate 1.2.3
Advanced Duplicate File Finder for Python. Nothing is impossible to solve.11 versions - Latest release: over 6 years ago - 1 dependent repositories - 104 downloads last month - 76 stars on GitHub - 1 maintainer
deduplicator 1.0.0
Rapid file deduplication utility for Unix systems1 version - Latest release: about 6 years ago - 1 dependent repositories - 15 downloads last month - 3 stars on GitHub - 1 maintainer
company-name-matching 0.4.3
Returns a score of 2 companies to be the same40 versions - Latest release: over 2 years ago - 391 downloads last month - 1 maintainer
company-name-matching2 0.0.6
Returns a score of 2 companies to be the same4 versions - Latest release: over 2 years ago - 49 downloads last month - 1 maintainer
closely 19.0.2
Closely find closest pairs of points, eg duplicates, in a dataset7 versions - Latest release: almost 5 years ago - 1 dependent package - 4 dependent repositories - 105 downloads last month - 1 stars on GitHub - 1 maintainer
removedup 1.0.6
Remove duplicates from parallel corpora7 versions - Latest release: 5 months ago - 1.52 thousand downloads last month - 5 stars on GitHub - 1 maintainer
dropduplicatesplanb 0.11
Drops duplicates in DataFrames with tedious dtypes2 versions - Latest release: 4 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
citex 0.2.2
Tools to manage large BibTex libraries9 versions - Latest release: over 7 years ago - 81 downloads last month - 4 stars on GitHub - 1 maintainer
searchdups 1.0.0
Searches for duplicate files in folders (recursively, if needed)1 version - Latest release: over 1 year ago - 16 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
python
10
duplicate-detection
8
hash
6
duplicatefilefinder
4
duplicate-files
4
list
4
files
4
duplicate
4
cli
3
find
3
matching
3
remove
3
cleaning
2
drop
2
dataframe
2
names
2
filter
2
companies
2
dups
2
deduplication
2
file
2
python3
2
nested
2
locate
2
fast
2
numpy
2
similar data
2
deduplicate
2
set
2
preprocessing
1
deplicate
1
duplicatefinder
1
duplicates-removed
1
duplication-finder
1
file-management
1
finder
1
macosx
1
autoencoder
1
similarity-detection
1
bloom-filter
1
geohashes
1
hash-functions
1
hashes-implemented
1
nilsimsa
1
simhash
1
comparison
1
same
1
identical
1
merkle-tree
1
multithreading
1
multiprocessing
1
file-hashing
1
merkletree
1
database-assisted
1
python-script
1
commandline
1
command-line-tool
1
command-line
1
command
1
sysadmin
1
command line
1
reference
1
citation
1
cite
1
latex
1
bibtex
1
endnote
1
dataframes
1
llm
1
clean
1
similarity
1
points
1
neighbors
1
closest-pair-of-points
1
closest pairs
1
geometry
1
mathematics
1
backup
1
hardlinks
1
windows
1
unix
1
scanning
1
pypi
1
purge-duplicate-files
1
multi-filtering
1
listset
1
checksum
1
database
1
harddisk
1
space
1
pandas
1
DataFrame
1
Series
1
series
1
image
1
duplicate-images
1
deletion
1
imagehash
1
remover
1
duplication-detection
1