An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

github.com/blevesearch/segment

Package segment is a library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29 http://www.unicode.org/reports/tr29/ Currently only segmentation at Word Boundaries is supported. The functionality is exposed in two ways: 1. You can use a bufio.Scanner with the SplitWords implementation of SplitFunc. The SplitWords function will identify the appropriate word boundaries in the input text and the Scanner will return tokens at the appropriate place. 2. Sometimes you would also like information returned about the type of token. To do this we have introduce a new type named Segmenter. It works just like Scanner but additionally a token type is returned.

Ecosystem
proxy.golang.org
Latest Release
v0.9.1
about 3 years ago
Versions
3
Dependent Packages
557
Dependent Repos
1,358
Links
Registry proxy.golang.org
Source Repository
Docs Documentation
JSON API View JSON
CodeMeta codemeta.json
Package Details
PURL pkg:golang/github.com/blevesearch/segment
spec
License Apache-2.0
Namespace github.com/blevesearch
First Release over 9 years ago
Last Synced 4 days ago
Repository
Stars 75 on GitHub
Forks 16 on GitHub
Docker Dependents 421
Docker Downloads 5,498,979,086
Rankings on proxy.golang.org
Overall Top 2.2%
Dependent packages Top 0.2%
Dependent repos Top 0.2%
Forks Top 5.4%
Docker downloads Top 0.1%