Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "information-extraction" keyword

nlpcube 0.3.1.2
Natural Language Processing Toolkit with support for tokenization, sentence splitting, lemmatizat...
22 versions - Latest release: 12 months ago - 1 dependent repositories - 135 downloads last month - 551 stars on GitHub - 4 maintainers
fastner 0.1.3
Finetune transformer-based models for the Named Entity Recognition task in a simple and fast way.
21 versions - Latest release: over 1 year ago - 24 downloads last month - 1 stars on GitHub - 1 maintainer
cprex 0.3.0
Chemical Properties Relation Extraction
1 version - Latest release: 3 months ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
parstdex 1.3.1
Persian time and date marker extractor
14 versions - Latest release: almost 2 years ago - 2 dependent repositories - 75 downloads last month - 25 stars on GitHub - 2 maintainers
Top 9.3% on pypi.org
snorkel-metal 0.5.0
A system for quickly generating training data with multi-task weak supervision
21 versions - Latest release: about 5 years ago - 10 dependent repositories - 184 downloads last month - 420 stars on GitHub - 2 maintainers
Top 9.0% on pypi.org
adaseq 0.6.6
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
11 versions - Latest release: 7 months ago - 1 dependent repositories - 334 downloads last month - 367 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
fcsparser 0.2.8
A python package for reading raw fcs files
14 versions - Latest release: 8 months ago - 9 dependent packages - 45 dependent repositories - 12.9 thousand downloads last month - 1,541 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
kor 1.0.1
Extract information with LLMs from text
19 versions - Latest release: 4 months ago - 1 dependent package - 8 dependent repositories - 26.7 thousand downloads last month - 1,521 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
htmldate 1.8.1 💰
Fast and robust extraction of original and updated publication dates from URLs and web pages.
54 versions - Latest release: about 2 months ago - 5 dependent packages - 50 dependent repositories - 1.1 million downloads last month - 114 stars on GitHub - 1 maintainer
netmedpy
NetMedPy evaluates network localization (statistical analysis of the largest connected component/...
2 versions - 236 downloads last month - 0 stars on GitHub - 1 maintainer
slotminer 0.9.2
python package for slot extraction (information extraciton) from texts.
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 13 downloads last month - 16 stars on GitHub - 1 maintainer
cogie 0.1.6
CogIE: An Information Extraction Toolkit for Bridging Text and CogNet
7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 43 downloads last month - 64 stars on GitHub - 1 maintainer
puggle 0.2.13
A Python package for working with the outputs of Information Extraction models and tools such as ...
20 versions - Latest release: 2 months ago - 47 downloads last month - 2 stars on GitHub - 1 maintainer
lanno 0.1.6
Let Large Language Models Serve As Data Annotators.
8 versions - Latest release: about 1 year ago - 54 downloads last month - 26 stars on GitHub - 1 maintainer
tieval 0.1.2
A framework for evaluation and development of temporal-aware models.
11 versions - Latest release: 3 months ago - 1 dependent repositories - 41 downloads last month - 15 stars on GitHub - 1 maintainer
brevia 0.0.28
Extensible API and framework to build your Retrieval Augmented Generation (RAG) and Information E...
28 versions - Latest release: 26 days ago - 1.03 thousand downloads last month - 22 stars on GitHub - 1 maintainer
snowball-extractor 1.0.5
Snowball: Extracting Relations from Large Plain-Text Collections
5 versions - Latest release: about 1 year ago - 32 downloads last month - 176 stars on GitHub - 1 maintainer
naimai 1.0.0
Python library to help with scientific literature research
7 versions - Latest release: over 1 year ago - 26 downloads last month - 24 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
medcat 1.11.0
Concept annotation tool for Electronic Health Records
327 versions - Latest release: about 1 month ago - 2 dependent packages - 21 dependent repositories - 4.98 thousand downloads last month - 405 stars on GitHub - 3 maintainers
jointtsmodel 1.6
jointtsmodel - library of joint topic-sentiment models
7 versions - Latest release: almost 4 years ago - 1 dependent repositories - 37 downloads last month - 32 stars on GitHub - 1 maintainer
xpotato 0.1.5
XAI human-in-the-loop information extraction framework
15 versions - Latest release: about 1 year ago - 2 dependent repositories - 132 downloads last month - 46 stars on GitHub - 1 maintainer
geoparsepy 2.1.4
Geoparsing library to extract and disambiguate locations from text, using OSM database for very h...
7 versions - Latest release: almost 4 years ago - 1 dependent repositories - 59 downloads last month - 54 stars on GitHub - 1 maintainer
gliclass
Generalist and Lightweight Model for Text Classification
2 versions - 686 stars on GitHub - 1 maintainer
rnn4ie 0.1.0
Chinese Information Extraction
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 10 downloads last month - 0 stars on GitHub - 1 maintainer
dlkp 0.0.1
A deep learning library for keyphrase extraction and generation
1 version - Latest release: over 2 years ago - 1 dependent repositories - 12 downloads last month - 25 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
paddlenlp 2.8.0
Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks f...
90 versions - Latest release: about 2 months ago - 15 dependent packages - 438 dependent repositories - 42.3 thousand downloads last month - 11,391 stars on GitHub - 1 maintainer
simple-ner 0.8.1
rule based NER
29 versions - Latest release: about 3 years ago - 1 dependent repositories - 87 downloads last month - 40 stars on GitHub - 2 maintainers
soton-corenlppy 1.0.0
NLP library providing support for information extraction applications
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 43 downloads last month - 2 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
holmes-extractor 4.2.1
Information extraction from English and German texts based on predicate logic
16 versions - Latest release: about 1 year ago - 2 dependent repositories - 186 downloads last month - 130 stars on GitHub - 2 maintainers
ytu 0.2.0 💰
A library to extract information from YouTube URLs.
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
pyopenie 0.2.0
Python wrapper for OpenIE5
1 version - Latest release: over 4 years ago - 1 dependent package - 3 dependent repositories - 36 downloads last month - 41 stars on GitHub - 1 maintainer
pdf4py 0.1.0
A PDF parser written in Python3 with no external dependencies.
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 19 downloads last month - 55 stars on GitHub - 1 maintainer
extr 0.0.44
Named Entity Recognition (NER) and Relation Extraction (RE) library using Regular Expressions
44 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 261 downloads last month - 8 stars on GitHub - 1 maintainer
ontogpt 0.3.11
OntoGPT
27 versions - Latest release: about 2 months ago - 1 dependent repositories - 490 downloads last month - 530 stars on GitHub - 2 maintainers
targeted-sum 1.0.9
A package for targeted summarization
10 versions - Latest release: over 1 year ago - 45 downloads last month - 87 stars on GitHub - 1 maintainer
llano 0.1.8
Let Large Language Models Serve As Data Annotators.
2 versions - Latest release: about 1 year ago - 20 downloads last month - 26 stars on GitHub - 1 maintainer
quantulum 0.1.16
Extract quantities from unstructured text.
17 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 73 downloads last month - 119 stars on GitHub - 1 maintainer
gliner 0.1.13
Generalist model for NER (Extract any entity types from texts)
18 versions - Latest release: about 1 month ago - 3 dependent packages - 66.3 thousand downloads last month - 686 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
pytorch-ie 0.30.3
State-of-the-art Information Extraction in PyTorch
54 versions - Latest release: about 2 months ago - 3 dependent packages - 2 dependent repositories - 244 downloads last month - 65 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
yargy 0.16.0
Rule-based facts extraction for Russian language
15 versions - Latest release: 11 months ago - 2 dependent packages - 56 dependent repositories - 10 thousand downloads last month - 307 stars on GitHub - 2 maintainers
Top 6.0% on pypi.org
abbreviations 0.2.5
Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs
9 versions - Latest release: over 4 years ago - 1 dependent package - 8 dependent repositories - 8.18 thousand downloads last month - 85 stars on GitHub - 1 maintainer
pyeditdistance 1.0.1
A pure, minimalist, no-dependency Python library of various edit distances.
7 versions - Latest release: almost 2 years ago - 1 dependent package - 1.35 thousand downloads last month - 0 stars on GitHub - 1 maintainer
pyfmr 0.0.2
A python wrapper for FMR
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 24 downloads last month - 2 stars on GitHub - 1 maintainer
faster-tokenizers 0.1.1
PaddleNLP Faster Tokenizer Library written in C++
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 43 downloads last month - 11,391 stars on GitHub - 1 maintainer
patterns-finder 1.0.1
Simple, Fast, Powerful and Easily extensible python package
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 18 downloads last month - 23 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
fast-tokenizer-python 1.0.2
PaddleNLP Fast Tokenizer Library written in C++
4 versions - Latest release: over 1 year ago - 2 dependent packages - 14 dependent repositories - 1.15 thousand downloads last month - 11,391 stars on GitHub - 1 maintainer
tool-helpers 0.1.1
Data tool helpers for PaddleNLP pre-training.
2 versions - Latest release: almost 2 years ago - 1 dependent package - 10.4 thousand downloads last month - 11,391 stars on GitHub - 1 maintainer
saber 0.1.0
Saber: Sequence Annotator for Biomedical Entities and Relations
1 version - Latest release: over 5 years ago - 2 dependent repositories - 27 downloads last month - 103 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
paddle-pipelines 0.6.2
Paddle-Pipelines: An End to End Natural Language Proceessing Development Kit Based on PaddleNLP
11 versions - Latest release: 6 months ago - 1 dependent repositories - 129 downloads last month - 11,391 stars on GitHub - 1 maintainer
faster-tokenizer 0.2.0
PaddleNLP Faster Tokenizer Library written in C++
7 versions - Latest release: over 1 year ago - 1 dependent package - 187 downloads last month - 11,391 stars on GitHub - 1 maintainer
cnn4ie 0.1.9
Chinese Information Extraction
10 versions - Latest release: over 2 years ago - 1 dependent repositories - 38 downloads last month - 6 stars on GitHub - 1 maintainer
bent 0.0.62
BENT: Biomedical Entity Annotator
53 versions - Latest release: 25 days ago - 1 dependent repositories - 2.01 thousand downloads last month - 9 stars on GitHub - 1 maintainer
huspacy-nightly 0.11.0.dev261 💰
HuSpaCy: industrial strength Hungarian natural language processing
126 versions - Latest release: 5 months ago - 1 dependent repositories - 303 downloads last month - 142 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
numerizer 0.2.3
Python module for converting natural language numbers into ints and floats.
10 versions - Latest release: about 1 year ago - 1 dependent package - 10 dependent repositories - 24.2 thousand downloads last month - 210 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
huspacy 0.11.0 💰
HuSpaCy: industrial strength Hungarian natural language processing
21 versions - Latest release: 8 months ago - 1 dependent package - 6 dependent repositories - 933 downloads last month - 142 stars on GitHub - 1 maintainer
eznlp 0.2.4
Easy Natural Language Processing
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 31 downloads last month - 120 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
deduce 3.0.2
Deduce: de-identification method for Dutch medical text
24 versions - Latest release: 4 months ago - 3 dependent repositories - 1.93 thousand downloads last month - 46 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
geotext 0.4.0
Geotext extracts countriy and city mentions from text
4 versions - Latest release: almost 6 years ago - 69 dependent repositories - 47.3 thousand downloads last month - 128 stars on GitHub - 1 maintainer
aymara 0.4.1
Python bindings to the LIMA linguistic analyzer
22 versions - Latest release: almost 2 years ago - 1 dependent repositories - 303 downloads last month - 102 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
eventregistry 8.6.1
A package that can be used to query information in Event Registry (http://eventregistry.org/)
49 versions - Latest release: almost 5 years ago - 5 dependent packages - 26 dependent repositories - 2.72 thousand downloads last month - 221 stars on GitHub - 1 maintainer
fuzzy-search 2.1.0
Tool for fuzzy searching in texts with historical language use and OCR/HTR errors
24 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 668 downloads last month - 19 stars on GitHub - 1 maintainer
pawpaw 1.0.0rc7
High Performance Text Processing & Segmentation Framework
15 versions - Latest release: 5 months ago - 116 downloads last month - 13 stars on GitHub - 1 maintainer
chemdataextractor-c 1.0.0
A toolkit for extracting chemical information from the scientific literature.
1 version - Latest release: about 1 year ago - 34 downloads last month - 278 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
chemdataextractor 1.3.0
A toolkit for extracting chemical information from the scientific literature.
8 versions - Latest release: over 7 years ago - 14 dependent repositories - 884 downloads last month - 278 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
deepke 2.2.7
DeepKE is a knowledge extraction toolkit for knowledge graph construction supporting low-resource...
111 versions - Latest release: 9 months ago - 2 dependent repositories - 761 downloads last month - 2,876 stars on GitHub - 6 maintainers
zensols.medcat 1.3.0
Concept annotation tool for Electronic Health Records
1 version - Latest release: over 1 year ago - 1 dependent package - 19 downloads last month - 405 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
snips-nlu 0.20.2
Snips Natural Language Understanding library
33 versions - Latest release: over 4 years ago - 1 dependent package - 35 dependent repositories - 2.93 thousand downloads last month - 3,867 stars on GitHub - 2 maintainers
pyphishtanklookup 1.3.2
Python CLI and module for PhishtankLookup
7 versions - Latest release: 4 months ago - 3 dependent repositories - 675 downloads last month - 0 stars on GitHub - 1 maintainer
pynutshell 1.0.2
An unsupervised text summarization and information retrieval library under the hood using natural...
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 30 downloads last month - 13 stars on GitHub - 1 maintainer
kidx-nlu 0.0.1a7
Kidx NLU a natural language parser for bots
7 versions - Latest release: about 5 years ago - 1 dependent repositories - 74 downloads last month - 2,904 stars on GitHub - 1 maintainer
cathodedataextractor 0.0.4
A document-level information extraction pipeline for layered cathode materials for sodium-ion bat...
4 versions - Latest release: 3 months ago - 18 downloads last month - 3 stars on GitHub - 1 maintainer