python-ucto
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto).
Ecosystem
pypi.org
pypi.org
Latest Release
2 months ago
0.6.10
2 months ago
Versions
25
25
Downloads
527 last month
527 last month
Dependent Packages
1
1
Dependent Repos
4
4
Loading...
Readme
Loading...
Links
| Registry | pypi.org |
| Source | Repository |
| Docs | Documentation |
| JSON API | View JSON |
| CodeMeta | codemeta.json |
Package Details
| PURL |
pkg:pypi/python-ucto
spec |
| License | GPL-3.0-only |
| First Release | about 11 years ago |
| Last Synced | 10 days ago |
Repository
| Stars | 29 on GitHub |
| Forks | 5 on GitHub |
| Commits | 138 |
| Committers | 1 |
| Avg per Author | 138.0 |
| DDS | 0.0 |
Rankings on pypi.org
Dependent repos
Top 7.5%