An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

crates.io "text-processing" keyword

View the packages on the crates.io package registry that are tagged with the "text-processing" keyword.

shiva 1.4.9
Shiva library: Implementation in Rust of a parser and generator for documents of any type
37 versions - Latest release: 10 months ago - 1 dependent package - 43.7 thousand downloads total - 373 stars on GitHub - 1 maintainer
xxxxx_rust_sts 0.1.0
A collection of useful string and file utilities for Rust
1 version - Latest release: 2 months ago - 339 downloads total - 1 maintainer
Top 4.2% on crates.io
unic-char-property 0.9.0
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
4 versions - Latest release: over 6 years ago - 19 dependent packages - 2,618 dependent repositories - 25.6 million downloads total - 242 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-case 0.9.0
UNIC — Unicode Character Database — Case Properties
4 versions - Latest release: over 6 years ago - 2 dependent packages - 27 dependent repositories - 106 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 5.3% on crates.io
unic-ucd-ident 0.9.0
UNIC — Unicode Character Database — Identifier Properties
3 versions - Latest release: over 6 years ago - 8 dependent packages - 304 dependent repositories - 6.39 million downloads total - 242 stars on GitHub - 1 maintainer
Top 6.8% on crates.io
unic-ucd 0.9.0
UNIC — Unicode Character Database
11 versions - Latest release: over 6 years ago - 9 dependent packages - 28 dependent repositories - 112 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 7.4% on crates.io
unic-ucd-name 0.9.0
UNIC — Unicode Character Database — Name
4 versions - Latest release: over 6 years ago - 4 dependent packages - 30 dependent repositories - 111 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-hangul 0.9.0
UNIC — Unicode Character Database — Hangul Syllable Composition & Decomposition
2 versions - Latest release: over 6 years ago - 3 dependent packages - 89 dependent repositories - 346 thousand downloads total - 242 stars on GitHub - 1 maintainer
unic-cli 0.9.0
UNIC Command-Line Tools
3 versions - Latest release: over 6 years ago - 4.61 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 9.4% on crates.io
unic-idna 0.9.0
UNIC — Unicode IDNA Compatibility Processing
8 versions - Latest release: over 6 years ago - 2 dependent packages - 24 dependent repositories - 67.7 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 9.3% on crates.io
unic-ucd-name_aliases 0.9.0
UNIC — Unicode Character Database — Name Aliases
1 version - Latest release: over 6 years ago - 1 dependent package - 27 dependent repositories - 95.4 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-common 0.9.0
UNIC — Common Utilities
3 versions - Latest release: over 6 years ago - 2 dependent packages - 2,619 dependent repositories - 25.6 million downloads total - 242 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-ucd-version 0.9.0
UNIC — Unicode Character Database — Version
3 versions - Latest release: over 6 years ago - 18 dependent packages - 2,619 dependent repositories - 25.6 million downloads total - 242 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-char 0.9.0
UNIC — Unicode Character Tools
4 versions - Latest release: over 6 years ago - 1 dependent package - 11 dependent repositories - 48.8 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-age 0.9.0
UNIC — Unicode Character Database — Age
7 versions - Latest release: over 6 years ago - 3 dependent packages - 79 dependent repositories - 326 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-normal 0.9.0
UNIC — Unicode Character Database — Normalization Properties
10 versions - Latest release: over 6 years ago - 3 dependent packages - 90 dependent repositories - 359 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-char-range 0.9.0
UNIC — Unicode Character Tools — Character Range and Iteration
4 versions - Latest release: over 6 years ago - 20 dependent packages - 2,619 dependent repositories - 25.5 million downloads total - 242 stars on GitHub - 1 maintainer
Top 8.1% on crates.io
unic 0.9.0
UNIC: Unicode and Internationalization Crates
10 versions - Latest release: over 6 years ago - 4 dependent packages - 11 dependent repositories - 58.3 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-emoji 0.9.0
UNIC — Unicode Emoji
3 versions - Latest release: over 6 years ago - 1 dependent package - 11 dependent repositories - 47.9 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-ucd-bidi 0.9.0
UNIC — Unicode Character Database — Bidi Properties
9 versions - Latest release: over 6 years ago - 6 dependent packages - 458 dependent repositories - 766 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 8.9% on crates.io
unic-char-basics 0.9.0
UNIC — Unicode Character Tools — Basic Stable Character Properties
2 versions - Latest release: over 6 years ago - 2 dependent packages - 11 dependent repositories - 46 thousand downloads total - 242 stars on GitHub - 1 maintainer
Top 5.0% on crates.io
unic-segment 0.9.0
UNIC — Unicode Text Segmentation Algorithms
3 versions - Latest release: over 6 years ago - 8 dependent packages - 1,697 dependent repositories - 18 million downloads total - 242 stars on GitHub - 1 maintainer
Top 6.2% on crates.io
unic-ucd-segment 0.9.0
UNIC — Unicode Character Database — Segmentation Properties
3 versions - Latest release: over 6 years ago - 2 dependent packages - 1,688 dependent repositories - 18.1 million downloads total - 242 stars on GitHub - 1 maintainer
Top 8.0% on crates.io
unic-idna-punycode 0.9.0
UNIC — Implementation of Punycode (RFC 3492) algorithm
9 versions - Latest release: over 6 years ago - 3 dependent packages - 19 dependent repositories - 92.7 thousand downloads total - 242 stars on GitHub - 1 maintainer
rust-persian-tools 1.1.4
Official Rust implementation of Persian Tools
7 versions - Latest release: about 1 year ago - 1 dependent package - 8.63 thousand downloads total - 73 stars on GitHub - 1 maintainer
aho-corasick-unsafe 0.0.4 💰
Fast multiple substring searching.
4 versions - Latest release: about 1 year ago - 5.47 thousand downloads total - 1,137 stars on GitHub - 1 maintainer
Top 2.2% on crates.io
aho-corasick 1.1.3 💰
Fast multiple substring searching.
61 versions - Latest release: over 1 year ago - 144 dependent packages - 66,037 dependent repositories - 512 million downloads total - 1,137 stars on GitHub - 1 maintainer
spongebob 2.0.1
A utility to convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe.
8 versions - Latest release: 8 months ago - 8.05 thousand downloads total - 1 stars on GitHub - 1 maintainer
in_definite 1.1.0
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
19 versions - Latest release: 1 day ago - 3 dependent packages - 4 dependent repositories - 109 thousand downloads total - 2 stars on GitHub - 1 maintainer
stam-tools 0.11.1
Command-line tools for working with stand-off annotations on text (STAM)
26 versions - Latest release: about 1 month ago - 24.6 thousand downloads total - 3 stars on GitHub - 1 maintainer
email-address-extractor 1.0.1
A blazingly fast command line tool written in pure safe Rust to automatically extract email addre...
2 versions - Latest release: about 1 year ago - 2.42 thousand downloads total - 1 stars on GitHub - 1 maintainer
Top 6.7% on crates.io
unic-ucd-common 0.9.0
UNIC — Unicode Character Database — Common Properties
3 versions - Latest release: over 6 years ago - 6 dependent packages - 31 dependent repositories - 294 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 6.1% on crates.io
unic-bidi 0.9.0
UNIC — Unicode Bidirectional Algorithm
8 versions - Latest release: over 6 years ago - 6 dependent packages - 379 dependent repositories - 412 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 6.3% on crates.io
unic-normal 0.9.0
UNIC — Unicode Normalization Forms
9 versions - Latest release: over 6 years ago - 8 dependent packages - 79 dependent repositories - 314 thousand downloads total - 230 stars on GitHub - 1 maintainer
Top 9.5% on crates.io
unic-idna-mapping 0.9.0
UNIC — IDNA — IDNA Mapping Table
6 versions - Latest release: over 6 years ago - 1 dependent package - 19 dependent repositories - 65.3 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 5.1% on crates.io
unic-ucd-category 0.9.0
UNIC — Unicode Character Database — General Category
5 versions - Latest release: over 6 years ago - 14 dependent packages - 384 dependent repositories - 1.21 million downloads total - 234 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-block 0.9.0
UNIC — Unicode Character Database — Unicode Blocks
2 versions - Latest release: over 6 years ago - 3 dependent packages - 34 dependent repositories - 98.8 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 4.7% on crates.io
unic-emoji-char 0.9.0
UNIC — Unicode Emoji — Emoji Character Properties
3 versions - Latest release: over 6 years ago - 16 dependent packages - 438 dependent repositories - 3.12 million downloads total - 234 stars on GitHub - 1 maintainer
unic-ucd-utils
UNIC - Utilities for working with Unicode Code Points
4 versions - Latest release: 2 days ago - 1 dependent package - 5.57 thousand downloads total - 243 stars on GitHub - 1 maintainer
rake 0.3.6
Rust implementation of Rapid Automatic Keyword Extraction (RAKE) algorithm
13 versions - Latest release: 7 months ago - 1 dependent repositories - 25.1 thousand downloads total - 34 stars on GitHub - 1 maintainer
tashkil 0.1.0 💰
A lightweight library for removing Arabic diacritics
1 version - Latest release: almost 3 years ago - 1.39 thousand downloads total - 19 stars on GitHub - 1 maintainer
nlpo3 1.4.0
Thai natural language processing library, with Python and Node bindings
8 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 20.1 thousand downloads total - 35 stars on GitHub - 2 maintainers
nlpo3-cli 0.2.0
Command line interface for nlpO3, a Thai natural language processing library
3 versions - Latest release: about 4 years ago - 3.79 thousand downloads total - 35 stars on GitHub - 2 maintainers
drug-extraction-core 0.1.2
A core library for extracting drugs from text records
3 versions - Latest release: about 3 years ago - 1 dependent package - 4.23 thousand downloads total - 3 stars on GitHub - 1 maintainer
drug-extraction-cli 1.3.0
A CLI for extracting drugs from text records
6 versions - Latest release: over 1 year ago - 8.89 thousand downloads total - 3 stars on GitHub - 1 maintainer
Top 8.9% on crates.io
qp-trie 0.8.2
An idiomatic and fast QP-trie implementation in pure Rust, written with an emphasis on safety.
21 versions - Latest release: almost 2 years ago - 5 dependent packages - 25 dependent repositories - 186 thousand downloads total - 101 stars on GitHub - 1 maintainer
whichlang 0.1.1
A blazingly fast and lightweight language detection library for Rust.
2 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 103 thousand downloads total - 416 stars on GitHub - 3 maintainers
flashtext2 0.2.0
The FlashText algorithm implemented in Rust
6 versions - Latest release: about 1 year ago - 7.89 thousand downloads total - 8 stars on GitHub - 1 maintainer
vtext 0.2.0
NLP with Rust
4 versions - Latest release: about 5 years ago - 3 dependent repositories - 13.4 thousand downloads total - 153 stars on GitHub - 1 maintainer
skan 0.1.0
Skan is a Rust-native, Java Scanner-inspired library that provides type-safe, convenient methods ...
1 version - Latest release: 17 days ago - 207 downloads total - 0 stars on GitHub - 1 maintainer
rfgrep 0.2.1
Recursive file grep utility with advanced filtering - search, list, and analyze text files with r...
4 versions - Latest release: 14 days ago - 1.22 thousand downloads total - 6 stars on GitHub - 1 maintainer
sesdiff 0.3.1
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings ...
8 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 10.9 thousand downloads total - 4 stars on GitHub - 1 maintainer
abbreviation_extractor 0.1.4
A library for extracting abbreviations from text.
5 versions - Latest release: 12 months ago - 5.77 thousand downloads total - 1 stars on GitHub - 1 maintainer
wildcard 0.3.0
Wildcard matching
4 versions - Latest release: 10 months ago - 1.68 million downloads total - 197 stars on GitHub - 1 maintainer
codetypo-dict 0.12.7
Source Code Spelling Correction
2 versions - Latest release: 6 months ago - 1.5 thousand downloads total - 0 stars on GitHub - 1 maintainer
srch 0.0.1 💰
Text Search For Humans
1 version - Latest release: almost 2 years ago - 1.45 thousand downloads total - 76 stars on GitHub - 1 maintainer
stam 0.17.0
STAM is a powerful library for dealing with stand-off annotations on text. This is the Rust library.
29 versions - Latest release: about 2 months ago - 2 dependent packages - 30.8 thousand downloads total - 5 stars on GitHub - 1 maintainer
hck 0.11.4
A sharp cut(1) clone.
52 versions - Latest release: 6 months ago - 63.8 thousand downloads total - 723 stars on GitHub - 1 maintainer
lexmatch 0.3.0
This is a simple lexicon matching tool that, given a lexicon of words or phrases, identifies all ...
3 versions - Latest release: about 1 year ago - 3.78 thousand downloads total - 2 stars on GitHub - 1 maintainer
vi 0.8.0
An input method library for vietnamese IME
21 versions - Latest release: 2 months ago - 25.5 thousand downloads total - 153 stars on GitHub - 1 maintainer
moguls 0.1.1
Let the words of financial moguls inspire and guide you in your quest for financial excellence ...
2 versions - Latest release: almost 2 years ago - 2.48 thousand downloads total - 1 stars on GitHub - 1 maintainer
folia 0.0.6
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
6 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 8.78 thousand downloads total - 4 stars on GitHub - 1 maintainer
s3-concat 1.1.0
Concatenate Amazon S3 files remotely using flexible patterns
2 versions - Latest release: about 6 years ago - 3.16 thousand downloads total - 38 stars on GitHub - 1 maintainer
codetypo-vars 0.9.1
Source Code Spelling Correction
1 version - Latest release: 6 months ago - 727 downloads total - 0 stars on GitHub - 1 maintainer
typope 0.4.0
Pedantic source code checker for orthotypography mistakes and other typographical errors
6 versions - Latest release: 6 months ago - 4.86 thousand downloads total - 1 stars on GitHub - 1 maintainer
mime-rs 0.3.0
A text processing framework, inspired by Emacs lisp and keyboard macros.
1 version - Latest release: over 2 years ago - 1.71 thousand downloads total - 7 stars on GitHub - 1 maintainer
ised 0.3.2
An interactive tool for find-and-replace across many files
6 versions - Latest release: 4 months ago - 2.63 thousand downloads total - 5 stars on GitHub - 1 maintainer
taggie 0.1.0
Edit audio tags in your favorite text editor
1 version - Latest release: over 4 years ago - 1.58 thousand downloads total - 24 stars on GitHub - 1 maintainer
text-sanitizer 1.6.0
convert text to plain ASCII text
11 versions - Latest release: over 2 years ago - 11.8 thousand downloads total - 2 stars on GitHub - 1 maintainer
cindex 0.5.2
CSV indexing library
15 versions - Latest release: over 1 year ago - 2 dependent packages - 2 dependent repositories - 19.5 thousand downloads total - 0 stars on GitHub - 1 maintainer
lngcnv 1.10.2 💰
linguistics: display pronunciation, translate between dialects, convert between orthographies; su...
57 versions - Latest release: 4 months ago - 63.8 thousand downloads total - 22 stars on GitHub - 1 maintainer
rjc 0.2.3
rjc converts the output of many commands, file-types, and strings to JSON, YAML, or TOML
6 versions - Latest release: about 2 years ago - 7.27 thousand downloads total - 1 stars on GitHub - 1 maintainer
bible-io 1.0.0
A Rust library for working with Bible text data structures
1 version - Latest release: 11 days ago - 0 downloads total - 1 maintainer
sliceslice 0.4.3
A fast implementation of single-pattern substring search using SIMD acceleration
9 versions - Latest release: about 1 year ago - 2 dependent repositories - 1.6 million downloads total - 97 stars on GitHub - 1 maintainer
pray 1.5.0
A tui tool for preparing a prompt to the llms.
11 versions - Latest release: 8 months ago - 7.48 thousand downloads total - 1 stars on GitHub - 1 maintainer
uniaxe 0.1.1
A Rust crate to replace Unicode letters with Ascii equivalents
2 versions - Latest release: over 4 years ago - 2.93 thousand downloads total - 6 stars on GitHub - 1 maintainer
autoruby-cli 0.5.1
CLI to easily generate furigana for various document formats
8 versions - Latest release: almost 2 years ago - 9.82 thousand downloads total - 6 stars on GitHub - 1 maintainer
jackdauer 0.1.2
Use this Rust crate to easily parse various time formats to durations
3 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 8.04 thousand downloads total - 8 stars on GitHub - 1 maintainer
sakurs-cli 0.1.1 💰
Command-line interface for Sakurs sentence boundary detection
2 versions - Latest release: about 1 month ago - 444 downloads total - 1 stars on GitHub - 1 maintainer
sakurs-core 0.1.1 💰
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
2 versions - Latest release: about 1 month ago - 456 downloads total - 1 stars on GitHub - 2 maintainers
translitrs 0.2.2
Transliteration utility for Serbian language
3 versions - Latest release: over 2 years ago - 3.63 thousand downloads total - 6 stars on GitHub - 1 maintainer
merge-whitespace 1.1.0
Procedural macros for merging whitespace in const contexts
3 versions - Latest release: 9 months ago - 3.1 thousand downloads total - 2 stars on GitHub - 1 maintainer
merge-whitespace-utils 1.1.0
Procedural macros for merging whitespace in const contexts
1 version - Latest release: 9 months ago - 894 downloads total - 2 stars on GitHub - 1 maintainer
bidi 0.1.1
Implementation of the Unicode Bidirectional Algorithm (UBA).
2 versions - Latest release: almost 2 years ago - 2.65 thousand downloads total - 0 stars on GitHub - 1 maintainer
untanglr 1.1.0
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
7 versions - Latest release: about 3 years ago - 9.11 thousand downloads total - 14 stars on GitHub - 1 maintainer
fuzzy-string-distance 1.0.0
Fuzzy string distance comparisons
1 version - Latest release: 12 months ago - 1.06 thousand downloads total - 1 stars on GitHub - 1 maintainer
booky 0.7.0
A tool to analyze English text
7 versions - Latest release: 29 days ago - 2.5 thousand downloads total - 1 stars on GitHub - 1 maintainer
lazy-transform-str 0.0.6 💰
Lazy-copying lazy-allocated scanning `str` transformations. This is good e.g. for (un)escaping te...
6 versions - Latest release: almost 5 years ago - 1 dependent package - 5 dependent repositories - 29.2 thousand downloads total - 1 stars on GitHub - 1 maintainer
Top 8.4% on crates.io
bytelines 2.5.0
Read input lines as byte slices for high efficiency
10 versions - Latest release: over 1 year ago - 13 dependent packages - 30 dependent repositories - 267 thousand downloads total - 66 stars on GitHub - 1 maintainer
iterate-text 0.0.1
Library of helper functions and structures for iterating over text files
1 version - Latest release: over 4 years ago - 1 dependent repositories - 1.75 thousand downloads total - 1 stars on GitHub - 1 maintainer
bigstr 0.1.1 💰
A command-line tool to make string BIG
2 versions - Latest release: about 1 year ago - 2.22 thousand downloads total - 1 stars on GitHub - 1 maintainer
unic-ucd-core 0.6.0
UNIC - Unicode Character Database - Version
6 versions - Latest release: almost 8 years ago - 11 dependent packages - 1 dependent repositories - 19.4 thousand downloads total - 242 stars on GitHub - 1 maintainer
kda-tools 1.3.1 💰
Tools for doing data management on a match journal, specifally for Hunt Showdown, but it'll work...
3 versions - Latest release: about 2 years ago - 4.44 thousand downloads total - 3 stars on GitHub - 1 maintainer
supply-chain-trust-example-crate-000022 1.21.2
Single assignment cells and lazy values.
3 versions - Latest release: 6 months ago - 2.15 thousand downloads total - 1,027 stars on GitHub - 1 maintainer
typed-dialogflow 0.1.0
An easy-to-use typed Google Dialogflow client
1 version - Latest release: over 3 years ago - 1.5 thousand downloads total - 0 stars on GitHub - 1 maintainer
analiticcl 0.4.8
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find va...
16 versions - Latest release: 6 months ago - 20.6 thousand downloads total - 37 stars on GitHub - 1 maintainer
Top 9.6% on crates.io
daachorse 1.0.0
Daachorse: Double-Array Aho-Corasick
11 versions - Latest release: about 3 years ago - 12 dependent packages - 4 dependent repositories - 560 thousand downloads total - 227 stars on GitHub - 2 maintainers
aneubeck-daachorse 1.1.1
Daachorse: Double-Array Aho-Corasick
2 versions - Latest release: 12 months ago - 34.5 thousand downloads total - 227 stars on GitHub - 3 maintainers
suffixsort 0.2.0
Library for suffix (inverse lexicographic) sorting
1 version - Latest release: 16 days ago - 0 downloads total - 1 maintainer
matcher_rs 0.5.8
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matchin...
41 versions - Latest release: 15 days ago - 44.4 thousand downloads total - 15 stars on GitHub - 1 maintainer