proxy.golang.org : github.com/proycon/python-ucto : v0.3.0
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
Registry -
Documentation -
Download -
JSON
purl: pkg:golang/github.com/proycon/python-ucto@v0.3.0
Published:
Indexed:
Related tag:
v0.3.0
Loading...
Readme
Loading...