Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "extraction" keyword

articleparse 0.2.1 πŸ’°
Heuristic text extraction from news articles
3 versions - Latest release: over 6 years ago - 12 downloads last month - 9 stars on GitHub - 1 maintainer
conextraction 0.0.3
Extract the conversation
3 versions - Latest release: over 1 year ago - 30 downloads last month - 1 maintainer
turcy 0.0.42
A package for German Open Informtion Extraction
15 versions - Latest release: about 1 year ago - 1 dependent repositories - 150 downloads last month - 2 stars on GitHub - 1 maintainer
coconlp 0.0.13
Python implementation of many nlp algorithms
12 versions - Latest release: about 5 years ago - 1 dependent repositories - 123 downloads last month - 1 maintainer
aether.python 1.3.0
A python library with Aether Python functionality
22 versions - Latest release: over 3 years ago - 8 dependent repositories - 91 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
keybert 0.8.4
KeyBERT performs keyword extraction with state-of-the-art transformer models.
17 versions - Latest release: 4 months ago - 20 dependent packages - 105 dependent repositories - 114 thousand downloads last month - 3,261 stars on GitHub - 1 maintainer
ftw.crawler 1.4.0
Crawl sites, extract text and metadata, index it in Solr
6 versions - Latest release: over 6 years ago - 2 dependent repositories - 11 downloads last month - 3 stars on GitHub - 12 maintainers
Top 9.6% on pypi.org
tableh 0.0.01
Tableh, taking the "Matt Damon - Oscar Winning actor" out of "Mahhttt Dahhmonnn.
1 version - Latest release: 10 months ago - 2 dependent repositories - 1 maintainer
newsman 1.1.0
A tool for web news scraping.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
gracelib 1.0.9
A GRACE L1B data extraction library
1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 1 maintainer
hundate 1.0.4
NLP modul for hungarian date-entity recognition and translation to specific date values
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 31 downloads last month - 3 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
adversarial-robustness-toolbox 1.17.1
Toolbox for adversarial machine learning.
58 versions - Latest release: 4 months ago - 7 dependent packages - 126 dependent repositories - 24.8 thousand downloads last month - 4,498 stars on GitHub - 2 maintainers
scrapedia 0.1.0
A scraper used for the extraction of brazilizan soccer historic data from the webpage futpedia.gl...
1 version - Latest release: over 4 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
onesheet 0.1.5
Easily access metadata for image, video, sound, and document file.
35 versions - Latest release: almost 9 years ago - 1 dependent repositories - 67 downloads last month - 3 stars on GitHub - 1 maintainer
geoextract 0.3.1
Extraction of locations from plain text
1 version - Latest release: over 5 years ago - 3 dependent repositories - 11 downloads last month - 8 stars on GitHub - 2 maintainers
credslayer 0.1.3
Extract credentials and other useful info from network captures
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 146 downloads last month - 53 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
etk 2.2.8
extraction toolkit
27 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 148 downloads last month - 78 stars on GitHub - 2 maintainers
textpipeliner 0.3.1
textpipeliner - library for extracting specific words from sentences of a document
5 versions - Latest release: over 7 years ago - 3 dependent repositories - 29 downloads last month - 68 stars on GitHub - 1 maintainer
trtimeextractor 0.1.2
Time Extractor NLP project - locate dates and times in text documents
3 versions - Latest release: over 1 year ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
unicontent 0.5.2
Python module to extract structured metadata from URL, DOI or ISBN
6 versions - Latest release: about 7 years ago - 1 dependent repositories - 80 downloads last month - 11 stars on GitHub - 1 maintainer
palimpzest
Palimpzest is a system which enables anyone to process AI-powered analytical queries simply by de...
4 versions - 87 downloads last month - 10 stars on GitHub - 1 maintainer
pyisotools 2.4.6
Simple python library for extracting and rebuilding ISOs
41 versions - Latest release: over 1 year ago - 1 dependent repositories - 212 downloads last month - 10 stars on GitHub - 1 maintainer
artesian-sdk 3.1.3
Library provides read access to the Artesian API
50 versions - Latest release: 2 months ago - 1 dependent repositories - 335 downloads last month - 2 stars on GitHub - 7 maintainers
adaptkeybert 0.0.2 πŸ’°
AdaptKeyBERT extended keyphrase extraction with zero-shot and few-shot semi-supervised domain ada...
3 versions - Latest release: over 1 year ago - 73 downloads last month - 21 stars on GitHub - 1 maintainer
vltk 1.0.4
The Vision-Language Toolkit (VLTK)
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 42 downloads last month - 0 stars on GitHub - 1 maintainer
apstrim 3.0.0
Logger and extractor of time-series data (e.g. EPICS PVs or liteServer LDOs).
21 versions - Latest release: 8 months ago - 1 dependent repositories - 174 downloads last month - 0 stars on GitHub - 1 maintainer
pydomainextractor 0.13.9
A blazingly fast domain extraction library written in Rust
34 versions - Latest release: 2 months ago - 1 dependent repositories - 483 downloads last month - 64 stars on GitHub - 1 maintainer
aiida-tbextraction 0.2.0b1
AiiDA Plugin for extracting tight-binding models
2 versions - Latest release: over 4 years ago - 68 downloads last month - 1 maintainer
eddytools 0.2.0
Event Data Discovery tool
5 versions - Latest release: almost 6 years ago - 2 dependent repositories - 29 downloads last month - 4 stars on GitHub - 1 maintainer
sia-app 1.1.0
Application to facilitate the download, exploration and visual analysis of oceanographic data.
1 version - Latest release: 12 months ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
siaextractlib 0.2.2
Provide an easy to use API for download oceanographic data.
1 version - Latest release: almost 1 year ago - 1 dependent package - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
ximage 0.3.1
xarray-based tools for image/video processing
9 versions - Latest release: almost 4 years ago - 1 dependent package - 1 dependent repositories - 60 downloads last month - 8 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
krwordrank 1.0.3
KR-WordRank: Korean Unsupervised Word/Keyword Extractor
8 versions - Latest release: almost 4 years ago - 3 dependent packages - 28 dependent repositories - 711 downloads last month - 339 stars on GitHub - 1 maintainer
pdftext 0.3.7
Extract structured text from pdfs quickly
16 versions - Latest release: 25 days ago - 1 dependent package - 5.7 thousand downloads last month - 208 stars on GitHub - 1 maintainer
petact 0.1.2
A package extraction tool
3 versions - Latest release: almost 6 years ago - 1 dependent package - 35 dependent repositories - 1.96 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
efel 5.6.24
Electrophys Feature Extract Library (eFEL)
372 versions - Latest release: 20 days ago - 9 dependent packages - 27 dependent repositories - 19.2 thousand downloads last month - 63 stars on GitHub - 3 maintainers
aiatools 0.5.1
Tools for extracting information from App Inventor AIA files
14 versions - Latest release: 7 months ago - 69 downloads last month - 11 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
torchextractor 0.3.0
Pytorch feature extraction made simple
2 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 27.8 thousand downloads last month - 99 stars on GitHub - 1 maintainer
doxstractor 0.1.1
Doxstractor extracts strutured data from text in an easily configurable way.
9 versions - Latest release: about 1 month ago - 120 downloads last month - 4 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
tika 2.6.0 πŸ’°
Apache Tika Python library
35 versions - Latest release: over 1 year ago - 33 dependent packages - 528 dependent repositories - 355 thousand downloads last month - 1,426 stars on GitHub - 1 maintainer
gimie 0.6.1
Extract structured metadata from git repositories.
7 versions - Latest release: 7 months ago - 1 dependent repositories - 59 downloads last month - 4 stars on GitHub - 3 maintainers
xextract 0.1.8
Extract structured data from HTML and XML documents like a boss.
17 versions - Latest release: about 4 years ago - 1 dependent package - 4 dependent repositories - 1.14 thousand downloads last month - 50 stars on GitHub - 1 maintainer
docspotter 0.3
DocSpotter is a Python library designed to extract specific information from document images by c...
3 versions - Latest release: 2 months ago - 17 downloads last month - 1 stars on GitHub - 1 maintainer
pydoxtools 0.8.0
This library contains a set of tools in order to extract and synthesize structured information fr...
12 versions - Latest release: 4 months ago - 87 downloads last month - 55 stars on GitHub - 1 maintainer
pcu-keyphrase 2.0
Keyphrase extraction for PCU project
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 15 downloads last month - 1 stars on GitHub - 1 maintainer
df-extract 0.0.2
DecisionFacts Extraction Library extracts content from PDF, PPTX, Docx, png, jpg., and convert as...
3 versions - Latest release: 8 months ago - 1 dependent package - 1 dependent repositories - 58 downloads last month - 14 stars on GitHub - 1 maintainer
extract-drugs 1.3.0
A CLI for extracting drugs from text records
6 versions - Latest release: about 1 month ago - 138 downloads last month - 3 stars on GitHub - 1 maintainer
scrapfly-sdk 0.8.16
Scrapfly SDK for Scrapfly
38 versions - Latest release: about 1 month ago - 2 dependent repositories - 13.1 thousand downloads last month - 18 stars on GitHub - 1 maintainer
eatiht 0.1.14
A simple tool used to extract an article's text in html documents.
14 versions - Latest release: about 9 years ago - 11 dependent repositories - 40 downloads last month - 435 stars on GitHub - 1 maintainer
tf-idf 0.0.0
An implementation of TF-IDF for keyword extraction.
1 version - Latest release: 10 months ago - 1 dependent repositories - 1 stars on GitHub - 1 maintainer
reporter-utils 0.1.1
Shared utilities for data extraction.
2 versions - Latest release: 2 months ago - 2 dependent packages - 18 downloads last month - 0 stars on GitHub - 1 maintainer
diso 0.1.2
Differentiable Iso-Surface Extraction Package
12 versions - Latest release: about 2 months ago - 1 dependent package - 1.19 thousand downloads last month - 1 maintainer
etl-m-ibrahim-khalil
A simple etl package to extract data from a website and load it to a database
2 versions - 134 downloads last month - 1 maintainer
Top 7.1% on pypi.org
eyecite 2.6.3 πŸ’°
Tool for extracting legal citations from text strings.
21 versions - Latest release: about 2 months ago - 1 dependent package - 3 dependent repositories - 3 thousand downloads last month - 113 stars on GitHub - 1 maintainer
htmldata 1.1.1
Extract and modify HTML/CSS URLs, translate HTML documents <-> list data structures.
7 versions - Latest release: 10 months ago - 2 dependent repositories - 1 maintainer
citation-extractor 1.6.3
A tool to extract canonical references from text.
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 13 downloads last month - 20 stars on GitHub - 1 maintainer
ibex-1d 1.3.2 πŸ’°
Image 1D Barcode EXtractor - Detect and Extract 1D Barcode(s) in Photographs
3 versions - Latest release: over 1 year ago - 18 downloads last month - 3 stars on GitHub - 1 maintainer
fintonic-ocr-handler 0.6
LibrerΓ­a para procesar errores del ocr
4 versions - Latest release: over 1 year ago - 13 downloads last month - 1 maintainer
noun-hound 1.0.0
Finds nouns and noun phrases in any given text.
1 version - Latest release: over 8 years ago - 2 dependent repositories - 8 downloads last month - 1 maintainer
sopex 0.1
Library and CLI to extract the subject, predicate and object for a given english sentence
1 version - Latest release: over 11 years ago - 2 dependent repositories - 10 downloads last month - 9 stars on GitHub - 1 maintainer
llama-index-packs-amazon-product-extraction 0.1.3
llama-index packs amazon_product_extraction integration
4 versions - Latest release: 3 months ago - 20 downloads last month - 3,395 stars on GitHub - 1 maintainer
autoit-ripper 1.1.2
Extract AutoIt scripts embedded in PE binaries
5 versions - Latest release: 3 months ago - 1 dependent repositories - 187 downloads last month - 145 stars on GitHub - 1 maintainer
teklia-line-image-extractor 0.2.9
A tool for extracting a text line image from the contour with different methods
13 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 572 downloads last month - 0 stars on GitLab.com - 1 maintainer
jsonflow 0.1.1
Pandoc (Python Library)
4 versions - Latest release: 8 months ago - 19 downloads last month - 113 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
emot 3.1
Emoji and Emoticons detection package for Python
5 versions - Latest release: almost 3 years ago - 4 dependent packages - 41 dependent repositories - 50.7 thousand downloads last month - 188 stars on GitHub - 1 maintainer
article-extraction 0.3.0
Article text extraction library
1 version - Latest release: about 1 year ago - 1 dependent package - 35 downloads last month - 5 stars on GitHub - 1 maintainer
pyang-module-catalog-plugin 0.2
A pyang plugin to extract OpenConfig module catalog data from YANG modules
2 versions - Latest release: about 7 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
osf-eimtc 0.1.53
A Framework for Encrypted Internet and Malicious Traffic Classification.
55 versions - Latest release: 26 days ago - 1 dependent repositories - 513 downloads last month - 8 stars on GitHub - 4 maintainers
pyaca 0.3.1 πŸ’°
scripts accompanying the book An Introduction to Audio Content Analysis by Alexander Lerch
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 210 downloads last month - 148 stars on GitHub - 1 maintainer
huhuseg 0.6.1
Simple Chinese segmentator, keywords extractor and other examples
13 versions - Latest release: about 6 years ago - 1 dependent repositories - 58 downloads last month - 8 stars on GitHub - 1 maintainer
gse 0.1.9
extract metadata and dataset from GEO Series Matrix format data
3 versions - Latest release: over 10 years ago - 2 dependent repositories - 27 downloads last month - 1 maintainer
shegox-oidc-test 0.3.4
Python client library for convenient usage of SAP Business Document Processing services
1 version - Latest release: 8 months ago - 17 downloads last month - 19 stars on GitHub - 1 maintainer
sap-business-document-processing 0.4.1
Python client library for convenient usage of SAP Business Document Processing services
9 versions - Latest release: 26 days ago - 1 dependent repositories - 1.75 thousand downloads last month - 19 stars on GitHub - 3 maintainers
thingsvision 2.6.6
Extracting image features from state-of-the-art neural networks for Computer Vision made easy
210 versions - Latest release: 17 days ago - 1 dependent repositories - 3.34 thousand downloads last month - 141 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
xpath 0.0.1
A python library to extract objects from an object tree.
1 version - Latest release: 10 months ago - 3 dependent repositories - 1 maintainer
ovos-skill-installer 0.0.5
Mycroft skill installer from .zip or .tar.gz urls
4 versions - Latest release: about 3 years ago - 6 dependent packages - 1 dependent repositories - 3.01 thousand downloads last month - 0 stars on GitHub - 2 maintainers
scrapyz 0.3.3
Scrape Easy
6 versions - Latest release: almost 9 years ago - 2 dependent repositories - 70 downloads last month - 188 stars on GitHub - 1 maintainer
torex 0.1.1
Torrent extraction automation
2 versions - Latest release: over 8 years ago - 2 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
skeletor 1.3.0
Python 3 library to extract skeletons from 3D meshes
8 versions - Latest release: about 2 months ago - 2 dependent packages - 5 dependent repositories - 902 downloads last month - 175 stars on GitHub - 1 maintainer
fuzzpyxl 0.0.4
Helper functions to easily search for Excel-Cells by value, color, formatting or else
2 versions - Latest release: almost 2 years ago - 25 downloads last month - 0 stars on GitHub - 1 maintainer
newsworker 1.0.1
Advanced news feeds extractor and finder library. Helps to automatically extract news from websi...
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 124 downloads last month - 76 stars on GitHub - 1 maintainer
giveme5w1h 1.0.18 πŸ’°
Extraction of the journalistic five W and one H questions (5W1H) from news articles.
15 versions - Latest release: almost 3 years ago - 3 dependent repositories - 119 downloads last month - 500 stars on GitHub - 1 maintainer
thepipe-api 0.3.5
Automate information extraction for multimodal LLMs.
24 versions - Latest release: 16 days ago - 1.5 thousand downloads last month - 537 stars on GitHub - 1 maintainer
rakun2 0.25
RaKUn 2.0; Better faster stronger lighter
7 versions - Latest release: over 1 year ago - 1 dependent repositories - 481 downloads last month - 61 stars on GitHub - 1 maintainer
cloudsdp 0.1.11
11 versions - Latest release: 10 months ago - 94 downloads last month - 0 stars on GitHub - 1 maintainer
docile-benchmark 0.3.4
Tools to work with the DocILE dataset and benchmark
5 versions - Latest release: 18 days ago - 747 downloads last month - 106 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
libextract 0.0.12
A HT/XML web scraping tool
4 versions - Latest release: almost 9 years ago - 5 dependent repositories - 23 downloads last month - 500 stars on GitHub - 1 maintainer
iepy 0.9.6
Information Extraction framework in Python
7 versions - Latest release: over 7 years ago - 3 dependent repositories - 49 downloads last month - 903 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
extractcode 31.0.0 πŸ’°
A mostly universal archive extractor using 7zip, libarchive and the Python standard library for r...
9 versions - Latest release: about 2 years ago - 1 dependent package - 28 dependent repositories - 16.2 thousand downloads last month - 31 stars on GitHub - 4 maintainers
sound-extraction 2.1.2
Slice and segment your audio files easily with open source Python program. Our tool enables you t...
10 versions - Latest release: 11 months ago - 93 downloads last month - 4 stars on GitHub - 1 maintainer
evidencer 0.2
Framework for modular extraction information.
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
parsr-client 3.2.3
Python client for Parsr - Transforms PDF, Documents and Images into Enriched Structured Data
10 versions - Latest release: almost 4 years ago - 1 dependent package - 11 dependent repositories - 219 downloads last month - 5,634 stars on GitHub - 1 maintainer
newspaper4k 0.9.3
Simplified python article discovery & extraction.
5 versions - Latest release: 3 months ago - 9.48 thousand downloads last month - 306 stars on GitHub - 1 maintainer
carpenter 1.0.2
A utility library which repairs and analyzes tablular data
3 versions - Latest release: over 9 years ago - 37 downloads last month - 2 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
audiotools 0.1.0 removed πŸ’°
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
1 version - Latest release: over 7 years ago - 11 dependent repositories - 190 downloads last month - 5,550 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
woob 3.3.1
Woob, Web Outside Of Browsers
9 versions - Latest release: over 1 year ago - 2 dependent packages - 4 dependent repositories - 23.6 thousand downloads last month - 110 stars on GitLab.com - 2 maintainers
Top 6.0% on pypi.org
stanford-openie 1.3.2 πŸ’°
Minimalist wrapper around Stanford OpenIE
8 versions - Latest release: 5 months ago - 3 dependent packages - 14 dependent repositories - 751 downloads last month - 616 stars on GitHub - 1 maintainer
wordspy 2.0.0
Get words for any langauge.
2 versions - Latest release: 4 months ago - 22 downloads last month - 1 stars on GitHub - 1 maintainer
rake 1.0
('Rapid Automatic Keywords Extraction', 'Just a Practice')
1 version - Latest release: over 10 years ago - 7 dependent repositories - 263 downloads last month - 7 stars on GitHub - 1 maintainer
pythonrlsa 1.0.0
Python Run Length Smoothing Algorithm for Document Processing
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 357 downloads last month - 27 stars on GitHub - 1 maintainer