Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-matching" keyword
entity-embed 0.0.6
Transform entities like companies, products, etc. into vectors to support scalable Record Linkage...6 versions - Latest release: almost 3 years ago - 1 dependent repositories - 58 downloads last month - 139 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
23 versions - Latest release: about 5 years ago - 6 dependent packages - 59 dependent repositories - 1.61 million downloads last month - 913 stars on GitHub - 1 maintainer
recordlinkage 0.13.2 💰
A record linkage toolkit for linking and deduplication23 versions - Latest release: about 5 years ago - 6 dependent packages - 59 dependent repositories - 1.61 million downloads last month - 913 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
130 versions - Latest release: about 2 months ago - 3 dependent packages - 4 dependent repositories - 104 thousand downloads last month - 1,072 stars on GitHub - 4 maintainers
splink 3.9.14
Fast probabilistic data linkage at scale130 versions - Latest release: about 2 months ago - 3 dependent packages - 4 dependent repositories - 104 thousand downloads last month - 1,072 stars on GitHub - 4 maintainers
Top 5.0% on pypi.org
5 versions - Latest release: almost 2 years ago - 13 dependent repositories - 159 thousand downloads last month - 280 stars on GitHub - 1 maintainer
fuzzymatcher 0.0.6
Fuzzy match two pandas dataframes based on one or more common fields5 versions - Latest release: almost 2 years ago - 13 dependent repositories - 159 thousand downloads last month - 280 stars on GitHub - 1 maintainer
textmatch 1.0.1
Find fuzzy matches between datasets.2 versions - Latest release: 4 months ago - 1 dependent package - 50 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
26 versions - Latest release: 4 months ago - 11 dependent repositories - 7.47 thousand downloads last month - 175 stars on GitHub - 1 maintainer
csvmatch 2.0.1
Find fuzzy matches between CSV files.26 versions - Latest release: 4 months ago - 11 dependent repositories - 7.47 thousand downloads last month - 175 stars on GitHub - 1 maintainer
pyjedai 0.1.7
An open-source library that builds powerful end-to-end Entity Resolution workflows.16 versions - Latest release: 30 days ago - 311 downloads last month - 62 stars on GitHub - 2 maintainers
Related Keywords
entity-resolution
6
fuzzy-matching
5
record-linkage
5
deduplication
4
python
3
entity-matching
2
machine-learning
2
duckdb
1
em-algorithm
1
spark
1
uk-gov-data-science
1
matching
1
fuzzy
1
probabalistic
1
recordlinking
1
fuzzymatching
1
probabalistic-matching
1
pypi
1
csv
1
link-discovery
1
data-disambigation
1
duplicate-detection
1
deduplicate-data
1
data-science
1
utrecht-university
1
string-distance
1
similarity
1
python-library
1
privacy
1
dedupe
1
representation-learning
1
pytorch
1
embeddings
1
deep-learning
1
approximate-nearest-neighbors
1
embedding
1
entity resolution
1
record linkage
1