Top 6.2% forks on pypi.org
pypi.org : eatiht
A simple tool used to extract an article's text in html documents.
Registry
-
Source
- Documentation
- JSON
- codemeta.json
purl: pkg:pypi/eatiht
Keywords:
extract
, extracted
, extracting
, extraction
, filter
, filtered
, filtering
, out
, remove
, removed
, removing
, removal
, text
, textbody
, body
, content
, contents
, sentence
, sentences
, word
, words
, boilerplate
, boilerpipe
, delete
, tag
, tags
, unsupervised
, classification
, machine
, learning
, algorithm
, opening
, closing
, main
, article
, html
, hypertext
, Rodrigo
, Palacios
, rodrigo
, palacios
, im-rodrigo
, im_rodrigo
, rodricios
License: MIT
Latest release: over 10 years ago
First release: almost 11 years ago
Dependent repositories: 11
Downloads: 97 last month
Stars: 432 on GitHub
Forks: 43 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 12 days ago
im-rodrigo
6 packages1,378 downloads