An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

python-ucto

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto).

Ecosystem
pypi.org
Latest Release
0.6.10
2 months ago
Versions
25
Downloads
527 last month
Dependent Packages
1
Dependent Repos
4
Links
Registry pypi.org
Source Repository
Docs Documentation
JSON API View JSON
CodeMeta codemeta.json
Package Details
PURL pkg:pypi/python-ucto
spec
License GPL-3.0-only
First Release about 11 years ago
Last Synced 10 days ago
Repository
Stars 29 on GitHub
Forks 5 on GitHub
Commits 138
Committers 1
Avg per Author 138.0
DDS 0.0
Rankings on pypi.org
Dependent repos Top 7.5%