An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

crates.io "text-processing" keyword

Top 2.2% on crates.io
aho-corasick 1.1.4 💰
Fast multiple substring searching.
62 versions - Latest release: 5 months ago - 144 dependent packages - 66,037 dependent repositories - 718 million downloads total - 1,214 stars on GitHub - 1 maintainer
aho-corasick-unsafe 💰
Fast multiple substring searching.
4 versions - Latest release: about 5 hours ago - 6.31 thousand downloads total - 1,214 stars on GitHub - 1 maintainer
waken_snowball 0.1.0
Rust implementation of Snowball stemming algorithms for 33 languages
1 version - Latest release: 8 months ago - 1.16 thousand downloads total - 841 stars on GitHub - 1 maintainer
trustformers-tokenizers 0.1.0
Tokenizers for TrustformeRS
4 versions - Latest release: about 20 hours ago - 292 downloads total - 1 maintainer
rustling 0.8.0 💰
A blazingly fast library for computational linguistics
8 versions - Latest release: 1 day ago - 5.48 thousand downloads total - 1 stars on GitHub - 1 maintainer
zahirscan 0.2.14
Token-efficient content compression for AI analysis using probabilistic template mining
10 versions - Latest release: 1 day ago - 198 downloads total - 0 stars on GitHub - 1 maintainer
Top 9.6% on crates.io
daachorse 1.0.0
Daachorse: Double-Array Aho-Corasick
11 versions - Latest release: over 3 years ago - 12 dependent packages - 4 dependent repositories - 711 thousand downloads total - 244 stars on GitHub - 2 maintainers
aneubeck-daachorse 1.1.1
Daachorse: Double-Array Aho-Corasick
2 versions - Latest release: over 1 year ago - 81.4 thousand downloads total - 244 stars on GitHub - 3 maintainers
bidi 0.1.1
Implementation of the Unicode Bidirectional Algorithm (UBA).
2 versions - Latest release: over 2 years ago - 2.93 thousand downloads total - 0 stars on GitHub - 1 maintainer
html-to-markdown-cli 2.28.6 💰
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
97 versions - Latest release: 2 days ago - 5.85 thousand downloads total - 533 stars on GitHub - 1 maintainer
lazy-transform-str 0.0.6 💰
Lazy-copying lazy-allocated scanning `str` transformations. This is good e.g. for (un)escaping te...
6 versions - Latest release: over 5 years ago - 1 dependent package - 5 dependent repositories - 30.8 thousand downloads total - 1 stars on GitHub - 1 maintainer
capitalize 0.3.4 💰
Change first character to upper case and the rest to lower case, and other common alternatives
7 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 116 thousand downloads total - 1 stars on GitHub - 1 maintainer
rawk-core 0.4.0
Core library for an AWK interpreter with the goal to be POSIX compatible.
14 versions - Latest release: 3 days ago - 308 downloads total - 0 stars on GitHub - 1 maintainer
unic-utils 0.6.0
UNIC - Utilities
2 versions - Latest release: over 8 years ago - 9 dependent packages - 1 dependent repositories - 8.95 thousand downloads total - 242 stars on GitHub - 1 maintainer
langdetect-rs 0.2.3
Language detection in Rust. Port of Mimino666's langdetect.
5 versions - Latest release: 4 months ago - 206 downloads total - 1 maintainer
nucleo-matcher 0.3.1
plug and play high performance fuzzy matcher
5 versions - Latest release: about 2 years ago - 4 dependent packages - 11 dependent repositories - 1.63 million downloads total - 1,218 stars on GitHub - 3 maintainers
lngcnv 1.10.2 💰
linguistics: display pronunciation, translate between dialects, convert between orthographies; su...
57 versions - Latest release: 10 months ago - 67.5 thousand downloads total - 22 stars on GitHub - 1 maintainer
text-tags 0.1.0
A lightweight, text-tag markup parser
1 version - Latest release: 8 months ago - 448 downloads total - 0 stars on GitHub - 1 maintainer
shiva 1.4.9
Shiva library: Implementation in Rust of a parser and generator for documents of any type
37 versions - Latest release: over 1 year ago - 1 dependent package - 47.3 thousand downloads total - 397 stars on GitHub - 1 maintainer
unic-ucd-core 0.6.0
UNIC - Unicode Character Database - Version
6 versions - Latest release: over 8 years ago - 11 dependent packages - 1 dependent repositories - 20.9 thousand downloads total - 242 stars on GitHub - 1 maintainer
correct_word 0.2.0
A No brainer 'did you mean' library for Rust
4 versions - Latest release: about 1 year ago - 2 dependent packages - 5.89 thousand downloads total - 0 stars on GitHub - 1 maintainer
trexter 0.1.1
Text progression tracking library
2 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 4.04 thousand downloads total - 0 stars on GitHub - 1 maintainer
slicestring 0.3.3
slicestring is a crate for slicing Strings
10 versions - Latest release: over 2 years ago - 2 dependent packages - 1 dependent repositories - 21.2 thousand downloads total - 0 stars on GitHub - 1 maintainer
in_definite 1.1.2
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
21 versions - Latest release: 7 months ago - 3 dependent packages - 4 dependent repositories - 133 thousand downloads total - 2 stars on GitHub - 1 maintainer
stam 0.18.6
STAM is a powerful library for dealing with stand-off annotations on text. This is the Rust library.
36 versions - Latest release: 4 months ago - 2 dependent packages - 36.3 thousand downloads total - 5 stars on GitHub - 1 maintainer
primo 0.0.1
Sort a file, correctly handling multi-digits numbers
1 version - Latest release: almost 9 years ago - 2.11 thousand downloads total - 1 stars on GitHub - 1 maintainer
korrektor-utils 0.1.2 💰
Utils library for korrektor-rs
3 versions - Latest release: almost 3 years ago - 1 dependent package - 4.52 thousand downloads total - 5 stars on GitHub - 1 maintainer
rjc 0.2.3
rjc converts the output of many commands, file-types, and strings to JSON, YAML, or TOML
6 versions - Latest release: over 2 years ago - 7.83 thousand downloads total - 1 stars on GitHub - 1 maintainer
intspan 0.8.7
Command line tools for IntSpan related bioinformatics operations
55 versions - Latest release: 12 months ago - 1 dependent package - 1 dependent repositories - 79.9 thousand downloads total - 12 stars on GitHub - 1 maintainer
iterate-text 0.0.1
Library of helper functions and structures for iterating over text files
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 2.03 thousand downloads total - 1 stars on GitHub - 1 maintainer
kweepeer 0.1.2
A generic webservice for interactive query expansion, expansion is provided via various modules
3 versions - Latest release: about 1 year ago - 1.98 thousand downloads total - 0 stars on GitHub - 1 maintainer
unic-ucd-utils
UNIC - Utilities for working with Unicode Code Points
4 versions - Latest release: 5 days ago - 1 dependent package - 5.8 thousand downloads total - 243 stars on GitHub - 1 maintainer
rawk-cli 0.1.0
The rawk cli, which is an AWK interpreter clone. The goal is to be POSIX compatible.
1 version - Latest release: 5 days ago - 0 downloads total - 0 stars on GitHub - 1 maintainer
suffixsort 0.3.0
Library for suffix (inverse lexicographic) sorting
2 versions - Latest release: 7 months ago - 964 downloads total - 0 stars on GitHub - 1 maintainer
uniaxe 0.1.1
A Rust crate to replace Unicode letters with Ascii equivalents
2 versions - Latest release: about 5 years ago - 3.27 thousand downloads total - 6 stars on GitHub - 1 maintainer
kaolinite 0.9.5
A crate to assist in the creation of TUI text editors.
23 versions - Latest release: over 1 year ago - 30.2 thousand downloads total - 18 stars on GitHub - 1 maintainer
lexmatch 0.3.0
This is a simple lexicon matching tool that, given a lexicon of words or phrases, identifies all ...
3 versions - Latest release: over 1 year ago - 4.05 thousand downloads total - 2 stars on GitHub - 1 maintainer
codetypo-vars 0.9.1
Source Code Spelling Correction
1 version - Latest release: about 1 year ago - 1.15 thousand downloads total - 0 stars on GitHub - 1 maintainer
tfidf-summarizer 2.0.0
Basic tf-idf compute for documents
1 version - Latest release: over 2 years ago - 1.97 thousand downloads total - 0 stars on GitHub - 1 maintainer
rustorch-text 0.1.2
NLP utilities and datasets for RusTorch
2 versions - Latest release: 6 days ago - 13 downloads total - 1 maintainer
hck 0.11.5
A sharp cut(1) clone.
53 versions - Latest release: 4 months ago - 68.3 thousand downloads total - 720 stars on GitHub - 1 maintainer
translitrs 0.2.2 💰
Transliteration utility for Serbian language
3 versions - Latest release: about 3 years ago - 3.96 thousand downloads total - 6 stars on GitHub - 1 maintainer
vtext 0.2.0
NLP with Rust
4 versions - Latest release: almost 6 years ago - 3 dependent repositories - 14.9 thousand downloads total - 153 stars on GitHub - 1 maintainer
analiticcl 0.4.9
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find va...
17 versions - Latest release: 3 months ago - 22.5 thousand downloads total - 37 stars on GitHub - 1 maintainer
dedup 0.2.0
A blazingly fast command-line text deduplicator.
1 version - Latest release: almost 8 years ago - 2.15 thousand downloads total - 15 stars on GitHub - 1 maintainer
spongmock
MoCkInG SpOnGeBoB SqUaRePaNtS TeXt gEnErAtOr
12 versions - Latest release: 6 days ago - 13.3 thousand downloads total - 1 maintainer
arabic_text_utils 0.1.0
A Rust library for Arabic text processing and manipulation
1 version - Latest release: about 1 year ago - 815 downloads total - 1 maintainer
merge-whitespace 1.1.0
Procedural macros for merging whitespace in const contexts
3 versions - Latest release: over 1 year ago - 3.42 thousand downloads total - 2 stars on GitHub - 1 maintainer
Top 8.4% on crates.io
bytelines 2.5.0
Read input lines as byte slices for high efficiency
10 versions - Latest release: about 2 years ago - 13 dependent packages - 30 dependent repositories - 307 thousand downloads total - 66 stars on GitHub - 1 maintainer
codetypo-dict 0.12.7
Source Code Spelling Correction
2 versions - Latest release: about 1 year ago - 2.01 thousand downloads total - 0 stars on GitHub - 1 maintainer
textcon 0.2.1
Template text files with file/directory references for AI/LLM consumption
4 versions - Latest release: 2 months ago - 446 downloads total - 1 stars on GitHub - 1 maintainer
sakurs-core 0.1.1 💰
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
2 versions - Latest release: 8 months ago - 926 downloads total - 3 stars on GitHub - 2 maintainers
awk-rs 0.1.0
A 100% POSIX-compatible AWK implementation in Rust
1 version - Latest release: 3 months ago - 61 downloads total - 1 maintainer
ter
A cli to run text expressions and perform basic text operations such as filtering, ignoring and r...
2 versions - Latest release: 6 days ago - 2.76 thousand downloads total - 77 stars on GitHub - 1 maintainer
Top 9.4% on crates.io
unic-idna 0.9.0
UNIC — Unicode IDNA Compatibility Processing
8 versions - Latest release: about 7 years ago - 2 dependent packages - 24 dependent repositories - 82.4 thousand downloads total - 245 stars on GitHub - 1 maintainer
unic-cli 0.9.0
UNIC Command-Line Tools
3 versions - Latest release: about 7 years ago - 5.03 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-case 0.9.0
UNIC — Unicode Character Database — Case Properties
4 versions - Latest release: about 7 years ago - 2 dependent packages - 27 dependent repositories - 129 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-ucd-bidi 0.9.0
UNIC — Unicode Character Database — Bidi Properties
9 versions - Latest release: about 7 years ago - 6 dependent packages - 458 dependent repositories - 1.32 million downloads total - 245 stars on GitHub - 1 maintainer
Top 9.3% on crates.io
unic-ucd-name_aliases 0.9.0
UNIC — Unicode Character Database — Name Aliases
1 version - Latest release: about 7 years ago - 1 dependent package - 27 dependent repositories - 116 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 5.0% on crates.io
unic-segment 0.9.0
UNIC — Unicode Text Segmentation Algorithms
3 versions - Latest release: about 7 years ago - 8 dependent packages - 1,697 dependent repositories - 22.4 million downloads total - 245 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-char 0.9.0
UNIC — Unicode Character Tools
4 versions - Latest release: about 7 years ago - 1 dependent package - 11 dependent repositories - 58.6 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-emoji 0.9.0
UNIC — Unicode Emoji
3 versions - Latest release: about 7 years ago - 1 dependent package - 11 dependent repositories - 61.1 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 6.8% on crates.io
unic-ucd 0.9.0
UNIC — Unicode Character Database
11 versions - Latest release: about 7 years ago - 9 dependent packages - 28 dependent repositories - 133 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.1% on crates.io
unic 0.9.0
UNIC: Unicode and Internationalization Crates
10 versions - Latest release: about 7 years ago - 4 dependent packages - 11 dependent repositories - 68.4 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.0% on crates.io
unic-idna-punycode 0.9.0
UNIC — Implementation of Punycode (RFC 3492) algorithm
9 versions - Latest release: about 7 years ago - 3 dependent packages - 19 dependent repositories - 115 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.9% on crates.io
unic-char-basics 0.9.0
UNIC — Unicode Character Tools — Basic Stable Character Properties
2 versions - Latest release: about 7 years ago - 2 dependent packages - 11 dependent repositories - 55.9 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 7.4% on crates.io
unic-ucd-name 0.9.0
UNIC — Unicode Character Database — Name
4 versions - Latest release: about 7 years ago - 4 dependent packages - 30 dependent repositories - 137 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-age 0.9.0
UNIC — Unicode Character Database — Age
7 versions - Latest release: about 7 years ago - 3 dependent packages - 79 dependent repositories - 427 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 6.1% on crates.io
unic-bidi 0.9.0
UNIC — Unicode Bidirectional Algorithm
8 versions - Latest release: about 7 years ago - 6 dependent packages - 379 dependent repositories - 843 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-hangul 0.9.0
UNIC — Unicode Character Database — Hangul Syllable Composition & Decomposition
2 versions - Latest release: about 7 years ago - 3 dependent packages - 89 dependent repositories - 461 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-char-range 0.9.0
UNIC — Unicode Character Tools — Character Range and Iteration
4 versions - Latest release: about 7 years ago - 20 dependent packages - 2,619 dependent repositories - 38.1 million downloads total - 245 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-normal 0.9.0
UNIC — Unicode Character Database — Normalization Properties
10 versions - Latest release: about 7 years ago - 3 dependent packages - 90 dependent repositories - 470 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-block 0.9.0
UNIC — Unicode Character Database — Unicode Blocks
2 versions - Latest release: about 7 years ago - 3 dependent packages - 34 dependent repositories - 120 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 5.1% on crates.io
unic-ucd-category 0.9.0
UNIC — Unicode Character Database — General Category
5 versions - Latest release: about 7 years ago - 14 dependent packages - 384 dependent repositories - 1.92 million downloads total - 234 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-common 0.9.0
UNIC — Common Utilities
3 versions - Latest release: about 7 years ago - 2 dependent packages - 2,619 dependent repositories - 38.2 million downloads total - 245 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-char-property 0.9.0
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
4 versions - Latest release: about 7 years ago - 19 dependent packages - 2,618 dependent repositories - 38.2 million downloads total - 245 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-ucd-version 0.9.0
UNIC — Unicode Character Database — Version
3 versions - Latest release: about 7 years ago - 18 dependent packages - 2,619 dependent repositories - 38.1 million downloads total - 245 stars on GitHub - 1 maintainer
Top 9.5% on crates.io
unic-idna-mapping 0.9.0
UNIC — IDNA — IDNA Mapping Table
6 versions - Latest release: about 7 years ago - 1 dependent package - 19 dependent repositories - 80 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 5.3% on crates.io
unic-ucd-ident 0.9.0
UNIC — Unicode Character Database — Identifier Properties
3 versions - Latest release: about 7 years ago - 8 dependent packages - 304 dependent repositories - 14.4 million downloads total - 245 stars on GitHub - 1 maintainer
Top 6.2% on crates.io
unic-ucd-segment 0.9.0
UNIC — Unicode Character Database — Segmentation Properties
3 versions - Latest release: about 7 years ago - 2 dependent packages - 1,688 dependent repositories - 22.4 million downloads total - 245 stars on GitHub - 1 maintainer
Top 6.3% on crates.io
unic-normal 0.9.0
UNIC — Unicode Normalization Forms
9 versions - Latest release: about 7 years ago - 8 dependent packages - 79 dependent repositories - 416 thousand downloads total - 230 stars on GitHub - 1 maintainer
Top 4.7% on crates.io
unic-emoji-char 0.9.0
UNIC — Unicode Emoji — Emoji Character Properties
3 versions - Latest release: about 7 years ago - 16 dependent packages - 438 dependent repositories - 6.25 million downloads total - 234 stars on GitHub - 1 maintainer
Top 6.7% on crates.io
unic-ucd-common 0.9.0
UNIC — Unicode Character Database — Common Properties
3 versions - Latest release: about 7 years ago - 6 dependent packages - 31 dependent repositories - 353 thousand downloads total - 234 stars on GitHub - 1 maintainer
docker-puzzles 0.1.3
Docker Puzzles is a CLI tool for putting together Dockerfiles from pieces.
4 versions - Latest release: over 7 years ago - 6.43 thousand downloads total - 1 stars on GitHub - 1 maintainer
seams 0.1.1
High-throughput sentence extractor for Project Gutenberg texts with dialog-aware detection
2 versions - Latest release: 8 months ago - 1.07 thousand downloads total - 3 stars on GitHub - 1 maintainer
merge-whitespace-utils 1.1.0
Procedural macros for merging whitespace in const contexts
1 version - Latest release: over 1 year ago - 1.19 thousand downloads total - 2 stars on GitHub - 1 maintainer
filenamify 0.1.2 💰
Convert a string to a valid filename
3 versions - Latest release: over 1 year ago - 3 dependent packages - 4 dependent repositories - 99.9 thousand downloads total - 6 stars on GitHub - 1 maintainer
vi 0.8.0
An input method library for vietnamese IME
21 versions - Latest release: 9 months ago - 28 thousand downloads total - 154 stars on GitHub - 1 maintainer
text_analysis 0.4.9
A robust multilingual text analysis CLI with context, N-grams, named entities, and CSV/JSON export.
26 versions - Latest release: 6 months ago - 23.3 thousand downloads total - 7 stars on GitHub - 1 maintainer
supply-chain-trust-example-crate-000022
Single assignment cells and lazy values.
3 versions - Latest release: 7 days ago - 2.47 thousand downloads total - 1,027 stars on GitHub - 1 maintainer
hangul 0.1.3
Utilities to manipulate Hangul Syllables
4 versions - Latest release: over 6 years ago - 2 dependent packages - 2 dependent repositories - 28 thousand downloads total - 9 stars on GitHub - 1 maintainer
rustic-fuzz 2.0.0
A Rust crate for sorting strings based on their Levenshtein distance to a reference string.
3 versions - Latest release: 5 months ago - 2.36 thousand downloads total - 1 maintainer
fnew 1.0.1
A Unicode-aware line-oriented drop-in replacement for coreutils' fold.
2 versions - Latest release: about 6 years ago - 3.27 thousand downloads total - 3 stars on GitHub - 1 maintainer
sttx 0.1.0
Utility belt for transforming speech-to-text data
1 version - Latest release: almost 2 years ago - 1.47 thousand downloads total - 0 stars on GitHub - 1 maintainer
pretok 0.1.0
A string pre-tokenizer for C-like syntaxes.
1 version - Latest release: over 5 years ago - 1 dependent repositories - 1.69 thousand downloads total - 0 stars on GitHub - 1 maintainer
nucleo 0.5.0
plug and play high performance fuzzy matcher
9 versions - Latest release: almost 2 years ago - 7 dependent packages - 10 dependent repositories - 458 thousand downloads total - 1,294 stars on GitHub - 3 maintainers
untanglr 1.1.0
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
7 versions - Latest release: over 3 years ago - 9.9 thousand downloads total - 14 stars on GitHub - 1 maintainer
xxxxx_rust_sts 0.1.0
A collection of useful string and file utilities for Rust
1 version - Latest release: 9 months ago - 482 downloads total - 1 maintainer
int64grep 0.2.1
A tiny Rust crate that provides simple line-based search helpers and a small CLI similar to grep.
3 versions - Latest release: 5 months ago - 93 downloads total - 0 stars on GitHub - 1 maintainer
scrivener-rtf 0.1.0
Pure Rust RTF parser and generator, optimized for Scrivener workflows
1 version - Latest release: 9 days ago - 0 downloads total - 1 maintainer