Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "entity-resolution" keyword

pvrhinodemo 3.0.2
Rhino Speech-to-Intent engine demos.
35 versions - Latest release: 4 months ago - 1 dependent repositories - 362 downloads last month - 593 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
pvrhino 3.0.2
Rhino Speech-to-Intent engine.
25 versions - Latest release: 4 months ago - 2 dependent packages - 3 dependent repositories - 1.54 thousand downloads last month - 593 stars on GitHub - 1 maintainer
dedupe-fh 1.9.7
A python library for accurate and scaleable data deduplication and entity-resolution
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 32 downloads last month - 394 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
zingg 0.4.0
Zingg Entity Resolution, Data Mastering and Deduplication
3 versions - Latest release: 5 months ago - 1 dependent repositories - 1.15 thousand downloads last month - 890 stars on GitHub - 1 maintainer
mismo 0.1.0
The SQL/Ibis powered sklearn of record linkage
1 version - Latest release: 11 months ago - 18 downloads last month - 11 stars on GitHub - 1 maintainer
deduplipy 0.7.10
End-to-end deduplication solution
23 versions - Latest release: about 1 year ago - 1 dependent repositories - 372 downloads last month - 71 stars on GitHub - 1 maintainer
textmatch 1.0.1
Find fuzzy matches between datasets.
2 versions - Latest release: 3 months ago - 1 dependent package - 50 downloads last month - 7 stars on GitHub - 1 maintainer
erexplain 0.1.2
library to explain models for Entity Resolution
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
splink 3.9.14
Fast probabilistic data linkage at scale
128 versions - Latest release: about 2 months ago - 3 dependent packages - 4 dependent repositories - 104 thousand downloads last month - 1,072 stars on GitHub - 4 maintainers
Top 1.6% on pypi.org
dedupe 2.0.23
A python library for accurate and scaleable data deduplication and entity-resolution
174 versions - Latest release: about 1 year ago - 7 dependent packages - 132 dependent repositories - 74.7 thousand downloads last month - 3,987 stars on GitHub - 2 maintainers
dedupe-fork-eccovia 2.0.13
A python library for accurate and scaleable data deduplication and entity-resolution
2 versions - Latest release: over 1 year ago - 54 downloads last month - 3,987 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
rltk 2.0.0a20
Record Linkage ToolKit
20 versions - Latest release: over 2 years ago - 4 dependent packages - 9 dependent repositories - 330 downloads last month - 103 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
recordlinkage 0.13.2 💰
A record linkage toolkit for linking and deduplication
23 versions - Latest release: about 5 years ago - 6 dependent packages - 59 dependent repositories - 1.14 million downloads last month - 909 stars on GitHub - 1 maintainer
moviegraphbenchmark 1.1.0
Benchmark datasets for Entity Resolution on Knowledge Graphs containing information about movies,...
4 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 40 downloads last month - 7 stars on GitHub - 1 maintainer
oasis 0.1.3
Optimal Asymptotic Sequential Importance Sampling
2 versions - Latest release: almost 3 years ago - 1 dependent package - 3 dependent repositories - 66 downloads last month - 14 stars on GitHub - 1 maintainer
eche 0.2.1
Little helper for handling entity clusters
3 versions - Latest release: about 2 months ago - 1 dependent package - 489 downloads last month - 1 stars on GitHub - 1 maintainer
csvdedupe2 0.2.7
Command line tools for deduplicating and merging csv files
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 69 downloads last month - 401 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
csvdedupe 0.1.20
Command line tools for deduplicating and merging csv files
20 versions - Latest release: about 4 years ago - 7 dependent repositories - 191 downloads last month - 401 stars on GitHub - 2 maintainers
nlu-by-samed 5.1.4
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained mod...
2 versions - Latest release: 3 months ago - 17 downloads last month - 814 stars on GitHub - 1 maintainer
table-extractor-new 5.1.0
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained mod...
1 version - Latest release: 5 months ago - 27 downloads last month - 814 stars on GitHub - 1 maintainer
nlu-ocr-shailesh 5.0.0
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained mod...
1 version - Latest release: 7 months ago - 19 downloads last month - 814 stars on GitHub - 1 maintainer
johnsnowlabs-my-mehmet 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
1 version - Latest release: 11 months ago - 22 downloads last month - 814 stars on GitHub - 1 maintainer
nlu-by-ckl 5.0.2rc1
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained mod...
15 versions - Latest release: 7 months ago - 2 dependent packages - 140 downloads last month - 814 stars on GitHub - 1 maintainer
johnsnowlabs-by-kshitiz 5.0.1
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
31 versions - Latest release: 9 months ago - 45 downloads last month - 814 stars on GitHub - 1 maintainer
linktransformer 0.1.14
A friendly way to do link, aggregate, cluster and de-duplicate dataframes using large language mo...
15 versions - Latest release: about 1 month ago - 484 downloads last month - 69 stars on GitHub - 2 maintainers
shailesh-text-gen 4.2.1
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 10000+ of pretrained mod...
1 version - Latest release: 10 months ago - 18 downloads last month - 814 stars on GitHub - 1 maintainer
shailesh-bart 4.2.1
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 10000+ of pretrained mod...
1 version - Latest release: 10 months ago - 20 downloads last month - 814 stars on GitHub - 1 maintainer
shailesh 4.2.1
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 10000+ of pretrained mod...
1 version - Latest release: 10 months ago - 18 downloads last month - 814 stars on GitHub - 1 maintainer
databricks-arc 0.1.18
ARC: data linking solution for Databricks with Splink
14 versions - Latest release: 7 months ago - 796 downloads last month - 33 stars on GitHub - 3 maintainers
johnsnowlabs-tmp 4.4.25
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
13 versions - Latest release: 11 months ago - 176 downloads last month - 814 stars on GitHub - 1 maintainer
oagdedupe 0.2.1
oagdedupe is a Python library for scalable entity resolution, using active learning to learn bloc...
3 versions - Latest release: over 1 year ago - 19 downloads last month - 2 stars on GitHub - 1 maintainer
qlink 0.1a1
Entity Resolution and Record Linkage library
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 20 downloads last month - 7 stars on GitHub - 1 maintainer
nlu-spark23 1.1.1rc2
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with hundreds of pretrained m...
1 version - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 814 stars on GitHub - 2 maintainers
forayer 0.4.4
First aid utilies for knowledge graph exploration with an entity centric approach
12 versions - Latest release: about 1 year ago - 1 dependent repositories - 80 downloads last month - 6 stars on GitHub - 1 maintainer
deduper 0.0.7
OneFlorida De-duplication Software
9 versions - Latest release: about 6 years ago - 1 dependent repositories - 89 downloads last month - 12 stars on GitHub - 1 maintainer
cymbology 0.2.3
financial identifier validation.
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 1.33 thousand downloads last month - 13 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
csvmatch 2.0.1
Find fuzzy matches between CSV files.
26 versions - Latest release: 3 months ago - 11 dependent repositories - 7.47 thousand downloads last month - 175 stars on GitHub - 1 maintainer
johnsnowlabs-for-databricks-by-ckl 5.1.8rc16
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
43 versions - Latest release: 6 months ago - 300 downloads last month - 814 stars on GitHub - 1 maintainer
johnsnowlabs-by-ckl 5.0.29
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
49 versions - Latest release: 8 months ago - 300 downloads last month - 814 stars on GitHub - 1 maintainer
pyjedai 0.1.7
An open-source library that builds powerful end-to-end Entity Resolution workflows.
16 versions - Latest release: 22 days ago - 311 downloads last month - 62 stars on GitHub - 2 maintainers
johnsnowlabs-for-databricks 5.3.5
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source p...
135 versions - Latest release: 4 days ago - 2.28 thousand downloads last month - 818 stars on GitHub - 2 maintainers
Top 2.9% on pypi.org
nlu 5.3.1
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained mod...
127 versions - Latest release: 16 days ago - 13 dependent packages - 8 dependent repositories - 18.2 thousand downloads last month - 814 stars on GitHub - 2 maintainers
aj-zsl-nlu 4.2.0
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 10000+ of pretrained mod...
1 version - Latest release: 10 months ago - 34 downloads last month - 814 stars on GitHub - 1 maintainer
sylloge 0.3.0
Small library to simplify collecting and loading of entity alignment benchmark datasets
5 versions - Latest release: about 2 months ago - 1 dependent repositories - 46 downloads last month - 5 stars on GitHub - 1 maintainer
graphlet 0.1.1
Graphlet AI Knowledge Graph Factory
1 version - Latest release: almost 2 years ago - 12 downloads last month - 27 stars on GitHub - 1 maintainer
kiez 0.5.0
Hubness reduced nearest neighbor search for entity alignment with knowledge graph embeddings
13 versions - Latest release: 4 months ago - 1 dependent repositories - 133 downloads last month - 22 stars on GitHub - 1 maintainer
merge-machine 0.1.5
A library for extreme fuzzy tabular data matching that relies on Elasticsearch
5 versions - Latest release: over 6 years ago - 1 dependent repositories - 19 downloads last month - 35 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
anonlink 0.15.3
Anonymous linkage using cryptographic hashes and bloom filters
37 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 4.09 thousand downloads last month - 59 stars on GitHub - 5 maintainers
er-evaluation 2.3.0 💰
An End-to-End Evaluation Framework for Entity Resolution Systems.
9 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 103 downloads last month - 9 stars on GitHub - 1 maintainer
entity-embed 0.0.6
Transform entities like companies, products, etc. into vectors to support scalable Record Linkage...
6 versions - Latest release: almost 3 years ago - 1 dependent repositories - 38 downloads last month - 138 stars on GitHub - 1 maintainer
floof 0.1.11
A library for fuzzymatching
19 versions - Latest release: 7 months ago - 699 downloads last month - 5 stars on GitHub - 1 maintainer
Related Keywords
record-linkage 23 natural-language-understanding 18 nlu 18 transformers 17 sentiment-classifier 16 seq2seq 16 sentiment-analysis 16 spell-checker 16 streamlit 16 t5 16 text-classification 16 text-summarization 16 text-translation 16 sentence-embeddings 16 pandas 16 named-entity-recognition 16 lemmatizer 16 language-detection 16 dependency-parsing 16 bert-embedding 16 NLP 16 spark 13 deduplication 12 python 11 NLU 10 development 10 entity resolution 9 fuzzy-matching 8 dedupe 8 data-matching 6 Labs 6 Snow 6 John 6 Medical 6 Legal 6 Finance 6 OCR 6 Spark 6 record linkage 6 data-science 4 knowledge graph 4 machine-learning 4 knowledge-graph 4 entity-matching 3 python-library 3 entity-alignment 3 clustering 3 etl 2 similarity 2 de-duplicating 2 dedupe-library 2 evaluation 2 cli 2 csv-files 2 deep-learning 2 csv 2 duplicate-detection 2 embedding 2 vui 2 voice-user-interface 2 voice-ui 2 voice-recognition 2 voice-control 2 voice-commands 2 voice-command-control 2 voice-command 2 voice-assistant 2 spoken-language-understanding 2 speech-recognition 2 slu 2 slot-filling 2 on-device 2 intent-inference 2 natural language understanding 2 speech recognition 2 voice control 2 voice commands 2 Speech-to-Intent 2 duckdb 2 datamade 2 approximate nearest neighbor search 1 hubness 1 graphs 1 graph-analytics 1 graph-algorithms 1 gnns 1 big-data 1 ai 1 pyspark 1 network 1 graph 1 motif 1 graphlet 1 entity alignment 1 datasets 1 uk-gov-data-science 1 data-disambigation 1 link-discovery 1 em-algorithm 1 validation 1