pypi.org : doc2term
A fast NLP tokenizer that detects tokens and remove duplications and punctuations
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/doc2term
Keywords:
tokenizer
, NLP
, punctuation
, standarization
, duplicate-detection
, text-processing
, text-tokenizing
, doc2term
, nlp
, nlp-library
License: Apache-2.0
Latest release: almost 4 years ago
First release: almost 4 years ago
Dependent repositories: 1
Downloads: 33 last month
Stars: 2 on GitHub
Forks: 2 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 6 days ago