An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

proxy.golang.org "text-processing" keyword

View the packages on the proxy.golang.org package registry that are tagged with the "text-processing" keyword.

Top 1.4% on proxy.golang.org
github.com/abadojack/whatlanggo v1.0.1
Package whatlanggo detects natural languages and scripts ( writing systems ). Languages are repre...
2 versions - Latest release: over 6 years ago - 167 dependent packages - 224 dependent repositories - 578 stars on GitHub
github.com/npillmayer/cords v0.1.1
Package cords offers a versatile string enhancement to ease handling of texts. Cords (or sometim...
2 versions - Latest release: almost 4 years ago - 1 dependent package - 1 dependent repositories - 0 stars on GitHub
Top 6.6% on proxy.golang.org
github.com/pymupdf/PyMuPDF v1.12.6
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipula...
5 versions - Latest release: over 7 years ago - 8,167 stars on GitHub
Top 6.6% on proxy.golang.org
github.com/pymupdf/pymupdf v1.12.6
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipula...
5 versions - Latest release: over 7 years ago - 8,167 stars on GitHub
github.com/rsdoiel/pdtk v0.0.19
This is a pandoc preprocessor toolkit based on my experiment pdtmpl
6 versions - Latest release: 7 months ago - 0 stars on GitHub
github.com/opencars/translit v0.1.2
:man_teacher: Transliterate text from/to cyrillic
3 versions - Latest release: almost 6 years ago - 8 dependent packages - 5 dependent repositories - 0 stars on GitHub
Top 8.3% on proxy.golang.org
github.com/digineo/texd v0.7.0
texd wraps TeX in a web API
15 versions - Latest release: about 1 year ago - 12 stars on GitHub
Top 2.0% on proxy.golang.org
github.com/pemistahl/lingua-go v1.4.0
Package lingua accurately detects the natural language of written text, be it long or short. Its...
17 versions - Latest release: about 2 years ago - 31 dependent packages - 14 dependent repositories - 862 stars on GitHub
Top 8.2% on proxy.golang.org
toolman.org/text/interp v0.3.1
A simple library for interpolating string variables into a body of text.
3 versions - Latest release: over 4 years ago - 0 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/yujiahaol68/fcompl v0.1.3
A fast phrases completion library using trie tree. Also provide path compression for lower memory...
4 versions - Latest release: over 6 years ago - 2 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/raypereda/shuffle v0.0.0-20230308193521-c6818b3499c1
This command reads an input text file, and writes shuffled lines. Output is set to standard output.
2 versions - Latest release: over 2 years ago - 2 stars on GitHub
Top 9.0% on proxy.golang.org
github.com/toolmanorg/text-interp v0.3.1
A simple library for interpolating string variables into a body of text.
3 versions - Latest release: over 4 years ago - 0 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/AllenDang/PipeIt v0.0.0-20220113132018-849da234d858
PipeIt is a text transformation, conversion, cleansing and extraction tool.
1 version - Latest release: over 3 years ago - 74 stars on GitHub
Top 5.6% on proxy.golang.org
github.com/PyThaiNLP/pythainlp v5.1.2+incompatible
Thai natural language processing in Python
72 versions - Latest release: 5 months ago - 1,073 stars on GitHub
Top 5.6% on proxy.golang.org
github.com/pythainlp/pythainlp v5.1.2+incompatible
Thai natural language processing in Python
72 versions - Latest release: 5 months ago - 1,070 stars on GitHub
Top 5.1% on proxy.golang.org
github.com/cogcomp/cogcomp-nlpy v1.6.0
CogComp's light-weight Python NLP annotators
4 versions - Latest release: about 7 years ago - 116 stars on GitHub
Top 5.7% on proxy.golang.org
github.com/mit-lcp/bloatectomy v0.0.12
A python package for removing duplicate text in clinical notes or other documents
2 versions - Latest release: over 5 years ago - 36 stars on GitHub
Top 9.0% on proxy.golang.org
github.com/tsubasaogawa/regex-replacing-tee v0.0.1
main
2 versions - Latest release: almost 2 years ago - 0 stars on GitHub
Top 5.0% on proxy.golang.org
github.com/birchb1024/frangipanni v0.5.0
Program to convert lines of text into a tree structure.
6 versions - Latest release: over 4 years ago - 1,186 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/nicolasassi/gomtch v1.1.0
Find text even if it doesn't want to be found
2 versions - Latest release: about 4 years ago - 28 stars on GitHub
Top 5.7% on proxy.golang.org
github.com/MIT-LCP/bloatectomy v0.0.12
A python package for removing duplicate text in clinical notes or other documents
2 versions - Latest release: over 5 years ago - 36 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/whitfin/bytelines v2.5.0+incompatible
Read input lines as byte slices for high efficiency
9 versions - Latest release: over 1 year ago - 66 stars on GitHub
Top 5.5% on proxy.golang.org
github.com/ga1az/pathdigest v0.1.0
A command-line tool written in Go that analyzes Git repositories, local directories, or individua...
3 versions - Latest release: 4 months ago - 5 stars on GitHub
Top 9.5% on proxy.golang.org
github.com/shsms/mime v0.3.0
mime is a scripting tool for text processing, inspired by Emacs Keyboard Macros.
16 versions - Latest release: over 2 years ago - 7 stars on GitHub
Top 6.8% on proxy.golang.org
github.com/npillmayer/uax v0.2.0
Package uax is about Unicode Annexes and their algorithms. From the Unicode Consortium: A Unico...
2 versions - Latest release: almost 4 years ago - 2 dependent packages - 7 dependent repositories - 7 stars on GitHub
github.com/veggiemonk/inbtw v1.3.0
small Go utility to extract the text in between two tags
4 versions - Latest release: over 2 years ago - 0 stars on GitHub
Top 5.5% on proxy.golang.org
github.com/BytesParadise/libasciidoc v0.8.0
A Golang library for processing Asciidoc files.
9 versions - Latest release: almost 3 years ago - 205 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/derek73/python-nameparser v1.1.3
A simple Python module for parsing human names into their individual components
38 versions - Latest release: about 2 years ago - 686 stars on GitHub
Top 5.1% on proxy.golang.org
github.com/CogComp/cogcomp-nlpy v1.6.0
CogComp's light-weight Python NLP annotators
4 versions - Latest release: about 7 years ago - 116 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/jaycechant/drivel v0.1.0
A tool that make your article like drivel.
1 version - Latest release: over 5 years ago - 1 stars on GitHub
Top 3.8% on proxy.golang.org
github.com/sobhe/hazm v0.10.0
Persian NLP Toolkit
8 versions - Latest release: over 1 year ago - 1,281 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/LanguageMachines/frog v0.27.2
Frog is an integration of memory-based natural language processing (NLP) modules developed for Du...
19 versions - Latest release: over 2 years ago - 78 stars on GitHub
Top 9.8% on proxy.golang.org
github.com/thalesfsp/sypl v1.19.20
Package sypl provides a Simple Yet Powerful Logger built on top of the Golang logger. A sypl logg...
35 versions - Latest release: 10 months ago - 13 dependent packages - 3 dependent repositories - 2 stars on GitHub
Top 8.3% on proxy.golang.org
github.com/catatsuy/purl v0.2.2 💰
Streamlining Text Processing
12 versions - Latest release: 26 days ago - 221 stars on GitHub
Top 5.4% on proxy.golang.org
github.com/bytesparadise/libASCIIDoc v0.8.0
Package libasciidoc is an open source Go library that converts Asciidoc content into HTML.
9 versions - Latest release: almost 3 years ago - 168 stars on GitHub
Top 6.0% on proxy.golang.org
github.com/JayceChant/drivel v0.1.0
A tool that make your article like drivel.
1 version - Latest release: over 5 years ago - 1 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/balacode/cmdx v1.1.4
Package cmdx contains the cmdx (cx) command which amalgamates various useful command-line utiliti...
6 versions - Latest release: over 3 years ago - 9 stars on GitHub
Top 2.9% on proxy.golang.org
github.com/bytesparadise/libasciidoc v0.8.0
Package libasciidoc is an open source Go library that converts Asciidoc content into HTML.
9 versions - Latest release: almost 3 years ago - 10 dependent packages - 12 dependent repositories - 168 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/mycroftai/lingua-franca v0.4.2
Mycroft's multilingual text parsing and formatting library
1 version - Latest release: over 4 years ago - 77 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/MycroftAI/lingua-franca v0.4.2
Mycroft's multilingual text parsing and formatting library
1 version - Latest release: over 4 years ago - 77 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/maxim2266/trw v0.7.0
Package trw wraps around various text processing functions from the standard Go library to allow ...
4 versions - Latest release: over 1 year ago - 9 stars on GitHub
Top 5.5% on proxy.golang.org
github.com/acarl005/stripAnsi
A little Go package for removing ANSI color escape codes from strings.
Latest release: 14 days ago - 126 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/chmln/sd v1.0.0
Intuitive find & replace CLI (sed alternative)
9 versions - Latest release: almost 2 years ago - 6,323 stars on GitHub
Top 3.1% on proxy.golang.org
github.com/petar-dambovaliev/aho-corasick v0.0.0-20230725210150-fb29fc3c913e
efficient string matching in Golang via the aho-corasick algorithm.
2 versions - Latest release: about 2 years ago - 69 dependent packages - 85 dependent repositories - 37 stars on GitHub
Top 6.2% on proxy.golang.org
github.com/orsinium-labs/stopwords v1.0.2
🙅 Go package for detecting and removing stopwords from text.
3 versions - Latest release: about 2 months ago - 6 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/proycon/python-ucto v0.6.9
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost a...
22 versions - Latest release: 10 months ago - 29 stars on GitHub
Top 5.7% on proxy.golang.org
github.com/Automattic/go-search-replace v0.0.0-20221025050708-b7a3be7cbcb3
🚀 Search & replace URLs in WordPress SQL files.
1 version - Latest release: almost 3 years ago - 103 stars on GitHub
Top 5.7% on proxy.golang.org
github.com/automattic/go-search-replace v0.0.0-20210224172752-175fa9987943
🚀 Search & replace URLs in WordPress SQL files.
1 version - Latest release: over 4 years ago - 103 stars on GitHub
github.com/rsdoiel/pttk v0.0.19
This is a pandoc preprocessor toolkit based on my experiment pdtmpl
6 versions - Latest release: 7 months ago - 0 stars on GitHub
Top 8.6% on proxy.golang.org
github.com/wmentor/lemmas v0.0.6
text analysis for Russian
6 versions - Latest release: about 4 years ago - 1 stars on
Top 7.6% on proxy.golang.org
github.com/aereal/go-text-hatena
Hatena Notation (はてな記法) Parser written in Go
Latest release: 16 days ago - 17 stars on GitHub
Top 5.6% on proxy.golang.org
github.com/AlirezaTheH/perke v0.4.4
A keyphrase extractor for Persian
7 versions - Latest release: over 2 years ago - 69 stars on GitHub
Top 5.6% on proxy.golang.org
github.com/alirezatheh/perke v0.4.4
A keyphrase extractor for Persian
7 versions - Latest release: over 2 years ago - 69 stars on GitHub
Top 5.7% on proxy.golang.org
gbenson.net/go/strcase v1.0.1
Package strcase converts strings to various cases.
2 versions - Latest release: 5 months ago - 0 stars on GitHub
Top 3.7% on proxy.golang.org
github.com/01walid/goarabic v0.0.1
Package goarabic contains utility functions for working with Arabic strings.
1 version - Latest release: over 10 years ago - 6 dependent packages - 3 dependent repositories - 113 stars on GitHub
Top 7.6% on proxy.golang.org
github.com/sstadick/hck v0.11.4
A sharp cut(1) clone.
51 versions - Latest release: 7 months ago - 722 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/assafmo/xioc v1.1.12
Extract indicators of compromise from text, including "escaped" ones.
10 versions - Latest release: over 5 years ago - 160 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/ofabricio/calm v0.0.0-20220307115940-dce9d52bb68f
This is a library to scan, parse and tokenize text.
10 versions - Latest release: over 3 years ago - 1 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/proycon/pynlpl v1.2.9
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contai...
38 versions - Latest release: over 6 years ago - 477 stars on GitHub
github.com/jamesainslie/antimoji v0.9.16
Dealing with emoji slop from AI
26 versions - Latest release: 25 days ago - 0 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/divanvisagie/runify v0.1.0
Convert text into runes
9 versions - Latest release: over 1 year ago - 1 stars on GitHub
Top 4.5% on proxy.golang.org
github.com/qiniu/text v1.9.2
Qiniu Text Processing Libraries for Go
5 versions - Latest release: over 5 years ago - 13 dependent packages - 4 dependent repositories - 26 stars on GitHub
Top 7.6% on proxy.golang.org
github.com/nchern/cli-tools/pmail
This repo contains a set of handy command line tools
Latest release: 22 days ago - 0 stars on GitHub
github.com/wmentor/tokens v1.0.7
Text to tokes
8 versions - Latest release: almost 2 years ago - 4 dependent packages - 3 dependent repositories - 2 stars on GitHub
Top 6.1% on proxy.golang.org
github.com/drgsn/filefusion v0.1.4
FileFusion is a powerful file concatenation tool designed specifically for Large Language Model (...
9 versions - Latest release: 9 months ago - 6 stars on GitHub
Top 6.0% on proxy.golang.org
github.com/saucelabs/sypl v1.5.14
Package sypl provides a Simple Yet Powerful Logger built on top of the Golang logger. A sypl logg...
45 versions - Latest release: about 3 years ago - 4 dependent packages - 2 dependent repositories - 8 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/rtuin/textprocessing v0.0.0-20170504124411-bbd23e8b60fd
Package textprocessing is a golang wrapper / client for the text-processing.com APIs.
1 version - Latest release: over 8 years ago - 2 stars on GitHub
Top 7.6% on proxy.golang.org
github.com/palumacil/flesch-index v1.0.0
A readability scoring tool written in Go.
1 version - Latest release: over 5 years ago - 0 stars on GitHub
Top 5.9% on proxy.golang.org
github.com/gagolews/stringx v0.2.9
Drop-in replacements for base R string functions powered by stringi
10 versions - Latest release: 9 months ago - 28 stars on GitHub
Top 5.4% on proxy.golang.org
github.com/agentstation/tokenizer v0.0.8
High-performance tokenizer implementations in Go with unified CLI. Features Llama 3 tokenizer wit...
8 versions - Latest release: 27 days ago - 0 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/VladimirMarkelov/fb2text v0.0.0-20190131040324-9be8135c4d78
Convert FB2 book to text file with defined text width
1 version - Latest release: over 6 years ago - 6 stars on GitHub
Top 5.3% on proxy.golang.org
github.com/goplus/bpl v1.1.7
Binary Processing Language
7 versions - Latest release: over 3 years ago - 146 stars on GitHub
Top 5.6% on proxy.golang.org
github.com/roshan-research/hazm v0.10.0
Persian NLP Toolkit
8 versions - Latest release: over 1 year ago - 1,315 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/PaluMacil/ham v1.0.0
A Naive Bayes SMS spam classifier written in Go.
1 version - Latest release: over 5 years ago - 13 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/yaa110/rake-rs v0.1.4
Multilingual implementation of RAKE algorithm for Rust
4 versions - Latest release: over 6 years ago - 34 stars on GitHub
Top 9.6% on proxy.golang.org
github.com/gagolews/stringi v1.8.7
Fast and portable character string processing in R (with the Unicode ICU)
36 versions - Latest release: 6 months ago - 313 stars on GitHub
Top 6.1% on proxy.golang.org
github.com/PaluMacil/flesch-index v1.0.0
A readability scoring tool written in Go.
1 version - Latest release: over 5 years ago - 0 stars on GitHub
Top 5.4% on proxy.golang.org
github.com/cpcf/weft v1.1.0
Code gen kit for Go
2 versions - Latest release: about 1 month ago - 0 stars on GitHub
Top 2.1% on proxy.golang.org
github.com/acarl005/stripansi v0.0.0-20180116102854-5a71ef0e047d
A little Go package for removing ANSI color escape codes from strings.
1 version - Latest release: over 7 years ago - 1,669 dependent packages - 2,944 dependent repositories - 93 stars on GitHub
Top 6.2% on proxy.golang.org
github.com/pemistahl/lingua-go/cmd v0.0.0-20230905150314-320c87b00cfb
The most accurate natural language detection library for Go, suitable for long and short text alike
2 versions - Latest release: about 2 years ago - 954 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/uk-ipop/drug-extraction v1.3.0
A ToolBox for extracting drugs mentions from text fields with RxNorm integration.
15 versions - Latest release: over 1 year ago - 1 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/UK-IPOP/drug-extraction v1.3.0
Package main runs the CLI.
15 versions - Latest release: over 1 year ago - 1 stars on GitHub
Top 6.7% on proxy.golang.org
github.com/proycon/colibri-core v2.5.9+incompatible
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic...
30 versions - Latest release: over 2 years ago - 129 stars on GitHub
Top 6.2% on proxy.golang.org
github.com/orsinium-labs/anonymizer v1.0.1
🙈 Go package for anonymizing text. Removes all kinds of PII: names, places, phone numbers, etc.
2 versions - Latest release: 10 months ago - 2 stars on GitHub
Top 7.0% on proxy.golang.org
qiniupkg.com/text v1.10.0
Qiniu Text Processing Libraries for Go
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 33 stars on GitHub
Top 5.5% on proxy.golang.org
gopkg.in/01walid/goarabic.v0 v0.0.1
Package goarabic contains utility functions for working with Arabic strings.
1 version - Latest release: over 10 years ago - 105 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/palumacil/ham v1.0.0
A Naive Bayes SMS spam classifier written in Go.
1 version - Latest release: over 5 years ago - 13 stars on GitHub
github.com/sfischer13/datautils v0.0.0-20210501203744-bff137231a87
Package datautils is a collection of handy text manipulation tools.
1 version - Latest release: over 4 years ago - 8 stars on GitHub
Top 6.8% on proxy.golang.org
github.com/nchern/cli-tools
This repo contains a set of handy command line tools
Latest release: about 2 months ago - 0 stars on GitHub
Top 4.4% on proxy.golang.org
github.com/pemistahl/lingua-go/serialization
The most accurate natural language detection library for Go, suitable for long and short text alike
Latest release: about 1 month ago - 2 dependent packages - 862 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/aereal/gohn v0.0.0-20170803151215-81022b65c357
Hatena Notation (はてな記法) Parser written in Go
1 version - Latest release: about 8 years ago - 17 stars on GitHub
Top 6.9% on proxy.golang.org
github.com/gokultp/tql
tql is a tool enables querying formatted text files like log-files with SQL like query syntax
Latest release: 2 months ago - 2 stars on GitHub
Top 8.2% on proxy.golang.org
github.com/vladimirmarkelov/fb2text v0.0.0-20190131040324-9be8135c4d78
Convert FB2 book to text file with defined text width
1 version - Latest release: over 6 years ago - 6 stars on GitHub
Top 7.2% on proxy.golang.org
github.com/raypereda/normalize
normalize text
Latest release: 6 months ago - 1 stars on GitHub