proxy.golang.org : github.com/minishlab/semhash
Fast Semantic Text Deduplication & Filtering
Registry
-
Source
- Documentation
- JSON
- codemeta.json
purl: pkg:golang/github.com/minishlab/semhash
Keywords:
datasets
, deduplication
, model2vec
, preprocessing
, semantic-deduplication
, vicinity
License: MIT
Latest release: 3 months ago
First release: about 1 year ago
Stars: 810 on GitHub
Forks: 50 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 23 days ago