An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

nuget.org : stanford.nlp.segmenter

Tokenization of raw text is a standard pre-processing step for many NLP tasks. For English, tokenization usually involves punctuation splitting and separation of some affixes like possessives. Other languages require more extensive token pre-processing, which is usually called segmentation.

Registry - Source - Homepage - JSON
purl: pkg:nuget/stanford.nlp.segmenter
Keywords: nlp , stanford , segmenter , tokenization , splitting , IKVM , dotnet , fsharp , recompiled-packages , stanford-nlp
License: MIT
Latest release: over 4 years ago
First release: almost 12 years ago
Downloads: 26,575 total
Stars: 607 on GitHub
Forks: 121 on GitHub
Total Commits: 208
Committers: 11
Average commits per author: 18.909
Development Distribution Score (DDS): 0.404
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Funding links: https://www.buymeacoffee.com/sergeytihon
Last synced: 5 days ago

4.2.0.2
Published: almost 3 years ago
Registry - Download
4.2.0
Published: over 4 years ago
Registry - Download
3.9.2
Published: about 6 years ago
Registry - Download
3.9.1
Published: over 7 years ago
Registry - Download
3.8.0
Published: almost 8 years ago
Registry - Download
3.7.0.1
Published: over 8 years ago
Registry - Download
3.7.0
Published: over 8 years ago
Registry - Download
3.6.0
Published: over 9 years ago
Registry - Download
3.5.2.1
Published: almost 10 years ago
Registry - Download
3.5.2
Published: about 10 years ago
Registry - Download
3.5.1
Published: over 10 years ago
Registry - Download
3.5.0
Published: over 10 years ago
Registry - Download
3.4.0
Published: about 11 years ago
Registry - Download
3.3.1.1
Published: about 11 years ago
Registry - Download
3.3.0
Published: over 11 years ago
Registry - Download
3.2.0
Published: almost 12 years ago
Registry - Download