{"@context":"https://w3id.org/codemeta/3.0","@type":"SoftwareSourceCode","identifier":"pkg:golang/github.com/proycon/python-ucto","name":"github.com/proycon/python-ucto","description":"This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).","version":"v0.6.10","softwareVersion":"v0.6.10","codeRepository":"https://github.com/proycon/python-ucto","issueTracker":"https://github.com/proycon/python-ucto/issues","programmingLanguage":{"@type":"ComputerLanguage","name":"Cython"},"dateCreated":"2015-12-07","dateModified":"2026-02-02","datePublished":"2026-02-02","copyrightYear":2015,"downloadUrl":"https://proxy.golang.org/github.com/proycon/python-ucto/@v/v0.6.10.zip","softwareHelp":{"@type":"WebSite","url":"https://pkg.go.dev/github.com/proycon/python-ucto#section-documentation"},"applicationCategory":"go","runtimePlatform":"go","developmentStatus":"active","sameAs":["https://pkg.go.dev/github.com/proycon/python-ucto"],"https://www.w3.org/ns/activitystreams#likes":29,"https://forgefed.org/ns#forks":5}