An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org : doc2term

A fast NLP tokenizer that detects tokens and remove duplications and punctuations

Registry - Source - Documentation - JSON
purl: pkg:pypi/doc2term
Keywords: tokenizer , NLP , punctuation , standarization , duplicate-detection , text-processing , text-tokenizing , doc2term , nlp , nlp-library
License: Apache-2.0
Latest release: almost 4 years ago
First release: almost 4 years ago
Dependent repositories: 1
Downloads: 33 last month
Stars: 2 on GitHub
Forks: 2 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 6 days ago

    Loading...
    Readme
    Loading...