proxy.golang.org : github.com/proycon/python-ucto : v0.5.0
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
      Registry - 
      Documentation - 
      Download -
    JSON
    
    purl: pkg:golang/github.com/proycon/python-ucto@v0.5.0
    
Published: 
    
Indexed: 
      
Related tag:
        v0.5.0
  
Loading...
    Readme
      Loading...