Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org : pyminhash : 0.1.3

MinHashing is a very efficient way of finding similar records in a dataset based on Jaccard similarity. PyMinHash implements efficient minhashing for Pandas dataframes. See instructions below or look at the example notebook to get started. Developed by [Frits Hermans](https://www.linkedin.com/in/frits-hermans-data-scientist/) PyPI: [https://pypi.org/project/PyMinHash/](https://pypi.org/project/PyMinHash/)

Registry - Download - JSON
purl: pkg:conda/[email protected]
Published:
Indexed:
Related tag: v0.1.3 - compare

    Loading...
    Readme
    Loading...