Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org : pyminhash

MinHashing is a very efficient way of finding similar records in a dataset based on Jaccard similarity. PyMinHash implements efficient minhashing for Pandas dataframes. See instructions below or look at the example notebook to get started. Developed by [Frits Hermans](https://www.linkedin.com/in/frits-hermans-data-scientist/) PyPI: [https://pypi.org/project/PyMinHash/](https://pypi.org/project/PyMinHash/)

Registry - Source - JSON
purl: pkg:conda/pyminhash
License: MIT
Latest release: about 2 years ago
First release: over 2 years ago
Dependent packages: 1
Stars: 8 on GitHub
Forks: 3 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 19 days ago

deduplipy 0.7.9
<a href="https://deduplipy.readthedocs.io/en/latest/"> <img src="https://deduplipy.readthedocs....
4 versions - Latest release: about 2 years ago - 64 stars on GitHub