An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

crates.io "text-processing" keyword

Top 2.2% on crates.io
aho-corasick 1.1.4 πŸ’°
Fast multiple substring searching.
62 versions - Latest release: 4 months ago - 144 dependent packages - 66,037 dependent repositories - 676 million downloads total - 1,204 stars on GitHub - 1 maintainer
aho-corasick-unsafe 0.0.4 πŸ’°
Fast multiple substring searching.
4 versions - Latest release: over 1 year ago - 6.18 thousand downloads total - 1,204 stars on GitHub - 1 maintainer
waken_snowball 0.1.0
Rust implementation of Snowball stemming algorithms for 33 languages
1 version - Latest release: 7 months ago - 978 downloads total - 838 stars on GitHub - 1 maintainer
spongebob 2.0.1
A utility to convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe.
8 versions - Latest release: about 1 year ago - 8.71 thousand downloads total - 1 stars on GitHub - 1 maintainer
typope 0.4.0
Pedantic source code checker for orthotypography mistakes and other typographical errors
6 versions - Latest release: 11 months ago - 5.41 thousand downloads total - 1 stars on GitHub - 1 maintainer
smoltok-core 0.1.1
Byte-Pair Encoding tokenizer implementation in Rust
2 versions - Latest release: about 2 months ago - 28 downloads total - 3 stars on GitHub - 1 maintainer
red-sed 1.0.2
An experimental drop-in replacement for GNU sed, written in Rust
2 versions - Latest release: about 1 month ago - 32 downloads total - 1 maintainer
pray 1.5.0
A tui tool for preparing a prompt to the llms.
11 versions - Latest release: about 1 year ago - 8.25 thousand downloads total - 1 stars on GitHub - 1 maintainer
wakuchin 0.3.0 πŸ’°
A next generation wakuchin researcher software written in Rust
3 versions - Latest release: over 3 years ago - 1 dependent package - 4.43 thousand downloads total - 1 stars on GitHub - 1 maintainer
jackdauer 0.1.2
Use this Rust crate to easily parse various time formats to durations
3 versions - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 8.67 thousand downloads total - 8 stars on GitHub - 1 maintainer
fastchr 0.3.0
Faster memchr using SIMD intrinsics
3 versions - Latest release: almost 8 years ago - 1 dependent package - 6.3 thousand downloads total - 15 stars on GitHub - 1 maintainer
uroman 0.6.4
A self-contained Rust reimplementation of the uroman universal romanizer.
15 versions - Latest release: 22 days ago - 4.83 thousand downloads total - 35 stars on GitHub - 1 maintainer
deepfrog 0.2.1
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
2 versions - Latest release: almost 5 years ago - 3.03 thousand downloads total - 19 stars on GitHub - 1 maintainer
drug-extraction-cli 1.3.0 removed
A CLI for extracting drugs from text records
6 versions - Latest release: almost 2 years ago - 9.34 thousand downloads total - 3 stars on GitHub - 1 maintainer
headson 0.16.1
Budget‑constrained JSON preview renderer
70 versions - Latest release: 20 days ago - 4.71 thousand downloads total - 49 stars on GitHub - 1 maintainer
rawk-core 0.0.6
Core library for the AWK interpreter
6 versions - Latest release: 22 days ago - 144 downloads total - 0 stars on GitHub - 1 maintainer
buup 0.25.3
Core transformation library with zero dependencies
29 versions - Latest release: 5 months ago - 13.9 thousand downloads total - 7 stars on GitHub - 1 maintainer
prunist 0.16.1
Experimental library for pruning tree structures based on priority rules; API may change
7 versions - Latest release: 20 days ago - 348 downloads total - 49 stars on GitHub - 1 maintainer
moguls 0.1.1
Let the words of financial moguls inspire and guide you in your quest for financial excellence ...
2 versions - Latest release: over 2 years ago - 2.66 thousand downloads total - 1 stars on GitHub - 1 maintainer
aneubeck-daachorse 1.1.1
Daachorse: Double-Array Aho-Corasick
2 versions - Latest release: over 1 year ago - 73.3 thousand downloads total - 227 stars on GitHub - 3 maintainers
Top 9.6% on crates.io
daachorse 1.0.0
Daachorse: Double-Array Aho-Corasick
11 versions - Latest release: over 3 years ago - 12 dependent packages - 4 dependent repositories - 658 thousand downloads total - 232 stars on GitHub - 2 maintainers
Top 6.8% on crates.io
unic-ucd 0.9.0
UNIC β€” Unicode Character Database
11 versions - Latest release: almost 7 years ago - 9 dependent packages - 28 dependent repositories - 130 thousand downloads total - 245 stars on GitHub - 1 maintainer
unic-cli 0.9.0
UNIC Command-Line Tools
3 versions - Latest release: almost 7 years ago - 5.01 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 9.3% on crates.io
unic-ucd-name_aliases 0.9.0
UNIC β€” Unicode Character Database β€” Name Aliases
1 version - Latest release: almost 7 years ago - 1 dependent package - 27 dependent repositories - 113 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-case 0.9.0
UNIC β€” Unicode Character Database β€” Case Properties
4 versions - Latest release: almost 7 years ago - 2 dependent packages - 27 dependent repositories - 125 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-ucd-bidi 0.9.0
UNIC β€” Unicode Character Database β€” Bidi Properties
9 versions - Latest release: almost 7 years ago - 6 dependent packages - 458 dependent repositories - 1.22 million downloads total - 245 stars on GitHub - 1 maintainer
Top 6.7% on crates.io
unic-ucd-common 0.9.0
UNIC β€” Unicode Character Database β€” Common Properties
3 versions - Latest release: almost 7 years ago - 6 dependent packages - 31 dependent repositories - 344 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 5.1% on crates.io
unic-ucd-category 0.9.0
UNIC β€” Unicode Character Database β€” General Category
5 versions - Latest release: almost 7 years ago - 14 dependent packages - 384 dependent repositories - 1.74 million downloads total - 234 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-hangul 0.9.0
UNIC β€” Unicode Character Database β€” Hangul Syllable Composition & Decomposition
2 versions - Latest release: almost 7 years ago - 3 dependent packages - 89 dependent repositories - 438 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 4.7% on crates.io
unic-emoji-char 0.9.0
UNIC β€” Unicode Emoji β€” Emoji Character Properties
3 versions - Latest release: almost 7 years ago - 16 dependent packages - 438 dependent repositories - 5.75 million downloads total - 234 stars on GitHub - 1 maintainer
Top 8.2% on crates.io
unic-ucd-block 0.9.0
UNIC β€” Unicode Character Database β€” Unicode Blocks
2 versions - Latest release: almost 7 years ago - 3 dependent packages - 34 dependent repositories - 116 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-ucd-version 0.9.0
UNIC β€” Unicode Character Database β€” Version
3 versions - Latest release: almost 7 years ago - 18 dependent packages - 2,619 dependent repositories - 35.8 million downloads total - 245 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-char-range 0.9.0
UNIC β€” Unicode Character Tools β€” Character Range and Iteration
4 versions - Latest release: almost 7 years ago - 20 dependent packages - 2,619 dependent repositories - 35.8 million downloads total - 245 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-char 0.9.0
UNIC β€” Unicode Character Tools
4 versions - Latest release: almost 7 years ago - 1 dependent package - 11 dependent repositories - 56.7 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.1% on crates.io
unic 0.9.0
UNIC: Unicode and Internationalization Crates
10 versions - Latest release: almost 7 years ago - 4 dependent packages - 11 dependent repositories - 66.5 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.0% on crates.io
unic-idna-punycode 0.9.0
UNIC β€” Implementation of Punycode (RFC 3492) algorithm
9 versions - Latest release: almost 7 years ago - 3 dependent packages - 19 dependent repositories - 112 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 9.9% on crates.io
unic-emoji 0.9.0
UNIC β€” Unicode Emoji
3 versions - Latest release: almost 7 years ago - 1 dependent package - 11 dependent repositories - 58.2 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 9.5% on crates.io
unic-idna-mapping 0.9.0
UNIC β€” IDNA β€” IDNA Mapping Table
6 versions - Latest release: almost 7 years ago - 1 dependent package - 19 dependent repositories - 77.9 thousand downloads total - 234 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-age 0.9.0
UNIC β€” Unicode Character Database β€” Age
7 versions - Latest release: almost 7 years ago - 3 dependent packages - 79 dependent repositories - 405 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 8.9% on crates.io
unic-char-basics 0.9.0
UNIC β€” Unicode Character Tools β€” Basic Stable Character Properties
2 versions - Latest release: almost 7 years ago - 2 dependent packages - 11 dependent repositories - 53.9 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 5.9% on crates.io
unic-common 0.9.0
UNIC β€” Common Utilities
3 versions - Latest release: almost 7 years ago - 2 dependent packages - 2,619 dependent repositories - 35.8 million downloads total - 245 stars on GitHub - 1 maintainer
Top 6.2% on crates.io
unic-ucd-segment 0.9.0
UNIC β€” Unicode Character Database β€” Segmentation Properties
3 versions - Latest release: almost 7 years ago - 2 dependent packages - 1,688 dependent repositories - 21.7 million downloads total - 245 stars on GitHub - 1 maintainer
Top 4.2% on crates.io
unic-char-property 0.9.0
UNIC β€” Unicode Character Tools β€” Character Property taxonomy, contracts and build macros
4 versions - Latest release: almost 7 years ago - 19 dependent packages - 2,618 dependent repositories - 35.8 million downloads total - 245 stars on GitHub - 1 maintainer
Top 9.4% on crates.io
unic-idna 0.9.0
UNIC β€” Unicode IDNA Compatibility Processing
8 versions - Latest release: almost 7 years ago - 2 dependent packages - 24 dependent repositories - 80.3 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 7.4% on crates.io
unic-ucd-name 0.9.0
UNIC β€” Unicode Character Database β€” Name
4 versions - Latest release: almost 7 years ago - 4 dependent packages - 30 dependent repositories - 133 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 6.3% on crates.io
unic-normal 0.9.0
UNIC β€” Unicode Normalization Forms
9 versions - Latest release: almost 7 years ago - 8 dependent packages - 79 dependent repositories - 394 thousand downloads total - 230 stars on GitHub - 1 maintainer
Top 5.0% on crates.io
unic-segment 0.9.0
UNIC β€” Unicode Text Segmentation Algorithms
3 versions - Latest release: almost 7 years ago - 8 dependent packages - 1,697 dependent repositories - 21.6 million downloads total - 245 stars on GitHub - 1 maintainer
Top 7.1% on crates.io
unic-ucd-normal 0.9.0
UNIC β€” Unicode Character Database β€” Normalization Properties
10 versions - Latest release: almost 7 years ago - 3 dependent packages - 90 dependent repositories - 447 thousand downloads total - 245 stars on GitHub - 1 maintainer
Top 5.3% on crates.io
unic-ucd-ident 0.9.0
UNIC β€” Unicode Character Database β€” Identifier Properties
3 versions - Latest release: almost 7 years ago - 8 dependent packages - 304 dependent repositories - 12.7 million downloads total - 245 stars on GitHub - 1 maintainer
Top 6.1% on crates.io
unic-bidi 0.9.0
UNIC β€” Unicode Bidirectional Algorithm
8 versions - Latest release: almost 7 years ago - 6 dependent packages - 379 dependent repositories - 763 thousand downloads total - 234 stars on GitHub - 1 maintainer
nlpo3 1.4.0
Thai natural language processing library, with Python and Node bindings
8 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28.4 thousand downloads total - 38 stars on GitHub - 2 maintainers
dcsv 0.3.3
Dyanmic csv reader,writer,editor
13 versions - Latest release: about 2 years ago - 3 dependent packages - 4 dependent repositories - 22.1 thousand downloads total - 2 stars on GitHub - 1 maintainer
nlpo3-cli 0.2.0
Command line interface for nlpO3, a Thai natural language processing library
3 versions - Latest release: over 4 years ago - 4.09 thousand downloads total - 36 stars on GitHub - 2 maintainers
r4d 3.1.0
Text oriented macro processor
70 versions - Latest release: over 3 years ago - 1 dependent package - 93.6 thousand downloads total - 16 stars on GitHub - 1 maintainer
strip-codeblocks 0.1.0
A Rust library to strip markdown code blocks from text, preserving only the inner content
1 version - Latest release: 3 months ago - 28 downloads total - 1 maintainer
dictutils 0.1.2
Dictionary utilities for Mdict and other formats
3 versions - Latest release: 3 months ago - 131 downloads total - 1 stars on GitHub - 1 maintainer
matcher_py 0.7.1
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matchin...
31 versions - Latest release: 3 days ago - 30.2 thousand downloads total - 15 stars on GitHub - 1 maintainer
matcher_rs 0.7.1
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matchin...
45 versions - Latest release: 3 days ago - 48.5 thousand downloads total - 15 stars on GitHub - 1 maintainer
matcher_c 0.7.1
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matchin...
31 versions - Latest release: 3 days ago - 30.2 thousand downloads total - 15 stars on GitHub - 1 maintainer
unaccent 0.1.1
A Rust crate to remove accents from strings, inspired by PostgreSQL's unaccent extension.
2 versions - Latest release: about 1 year ago - 26.3 thousand downloads total - 5 stars on GitHub - 1 maintainer
kda-tools 1.3.1 πŸ’°
Tools for doing data management on a match journal, specifally for Hunt Showdown, but it'll work...
3 versions - Latest release: over 2 years ago - 4.79 thousand downloads total - 3 stars on GitHub - 1 maintainer
skan 0.1.0
Skan is a Rust-native, Java Scanner-inspired library that provides type-safe, convenient methods ...
1 version - Latest release: 6 months ago - 578 downloads total - 0 stars on GitHub - 1 maintainer
sakurs-cli 0.1.1 πŸ’°
Command-line interface for Sakurs sentence boundary detection
2 versions - Latest release: 7 months ago - 786 downloads total - 3 stars on GitHub - 1 maintainer
detex 0.2.1
Strip TeX/LaTeX commands from input files
3 versions - Latest release: about 2 months ago - 57 downloads total - 1 maintainer
ised 0.3.2
An interactive tool for find-and-replace across many files
6 versions - Latest release: 9 months ago - 3.19 thousand downloads total - 6 stars on GitHub - 1 maintainer
bidi 0.1.1
Implementation of the Unicode Bidirectional Algorithm (UBA).
2 versions - Latest release: over 2 years ago - 2.91 thousand downloads total - 0 stars on GitHub - 1 maintainer
whetstone 1.0.0
Parses and evaluate string representations of mathematical expressions in various syntaxes
1 version - Latest release: 4 months ago - 24 downloads total - 1 maintainer
lazy-transform-str 0.0.6 πŸ’°
Lazy-copying lazy-allocated scanning `str` transformations. This is good e.g. for (un)escaping te...
6 versions - Latest release: over 5 years ago - 1 dependent package - 5 dependent repositories - 30.3 thousand downloads total - 1 stars on GitHub - 1 maintainer
Top 9.1% on crates.io
sd 1.0.0
An intuitive find & replace CLI
23 versions - Latest release: over 2 years ago - 1 dependent package - 2 dependent repositories - 487 thousand downloads total - 6,762 stars on GitHub - 2 maintainers
extract 0.1.1
A tool for extracting text from text.
2 versions - Latest release: almost 9 years ago - 4.17 thousand downloads total - 2 stars on GitHub - 1 maintainer
rustling 0.5.0
A blazingly fast library for computational linguistics
5 versions - Latest release: 6 days ago - 4.27 thousand downloads total - 0 stars on GitHub - 1 maintainer
sesdiff 0.3.1
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings ...
8 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11.8 thousand downloads total - 4 stars on GitHub - 1 maintainer
Top 6.5% on crates.io
edit-distance 2.2.1
Levenshtein edit distance between strings, a measure for similarity.
9 versions - Latest release: 5 months ago - 20 dependent packages - 526 dependent repositories - 2.08 million downloads total - 48 stars on GitHub - 1 maintainer
unic-ucd-core 0.6.0
UNIC - Unicode Character Database - Version
6 versions - Latest release: over 8 years ago - 11 dependent packages - 1 dependent repositories - 20.7 thousand downloads total - 242 stars on GitHub - 1 maintainer
nucleo-matcher 0.3.1
plug and play high performance fuzzy matcher
5 versions - Latest release: about 2 years ago - 4 dependent packages - 11 dependent repositories - 1.21 million downloads total - 1,218 stars on GitHub - 3 maintainers
in_definite 1.1.2
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
21 versions - Latest release: 6 months ago - 3 dependent packages - 4 dependent repositories - 125 thousand downloads total - 2 stars on GitHub - 1 maintainer
slicestring 0.3.3
slicestring is a crate for slicing Strings
10 versions - Latest release: about 2 years ago - 2 dependent packages - 1 dependent repositories - 20.6 thousand downloads total - 0 stars on GitHub - 1 maintainer
cindex 0.5.2
CSV indexing library
15 versions - Latest release: about 2 years ago - 2 dependent packages - 2 dependent repositories - 20.8 thousand downloads total - 0 stars on GitHub - 1 maintainer
correct_word 0.2.0
A No brainer 'did you mean' library for Rust
4 versions - Latest release: about 1 year ago - 2 dependent packages - 5.71 thousand downloads total - 0 stars on GitHub - 1 maintainer
shiva 1.4.9
Shiva library: Implementation in Rust of a parser and generator for documents of any type
37 versions - Latest release: over 1 year ago - 1 dependent package - 46.9 thousand downloads total - 397 stars on GitHub - 1 maintainer
capitalize 0.3.4 πŸ’°
Change first character to upper case and the rest to lower case, and other common alternatives
7 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 85.2 thousand downloads total - 1 stars on GitHub - 1 maintainer
rust-persian-tools 1.1.4
Official Rust implementation of Persian Tools
7 versions - Latest release: over 1 year ago - 1 dependent package - 9.44 thousand downloads total - 73 stars on GitHub - 1 maintainer
korrektor-utils 0.1.2 πŸ’°
Utils library for korrektor-rs
3 versions - Latest release: almost 3 years ago - 1 dependent package - 4.46 thousand downloads total - 5 stars on GitHub - 1 maintainer
trexter 0.1.1
Text progression tracking library
2 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 3.94 thousand downloads total - 0 stars on GitHub - 1 maintainer
unic-utils 0.6.0
UNIC - Utilities
2 versions - Latest release: over 8 years ago - 9 dependent packages - 1 dependent repositories - 8.89 thousand downloads total - 242 stars on GitHub - 1 maintainer
text-tags 0.1.0
A lightweight, text-tag markup parser
1 version - Latest release: 8 months ago - 447 downloads total - 5 stars on GitHub - 1 maintainer
intspan 0.8.7
Command line tools for IntSpan related bioinformatics operations
55 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 79.3 thousand downloads total - 12 stars on GitHub - 1 maintainer
unic-ucd-utils
UNIC - Utilities for working with Unicode Code Points
4 versions - Latest release: about 1 month ago - 1 dependent package - 5.79 thousand downloads total - 243 stars on GitHub - 1 maintainer
file-chunker 0.1.1
Efficiently process a file in (approximately) equally-sized parts
2 versions - Latest release: about 4 years ago - 1 dependent package - 2 dependent repositories - 28.1 thousand downloads total - 1 stars on GitHub - 1 maintainer
langdetect-rs 0.2.3
Language detection in Rust. Port of Mimino666's langdetect.
5 versions - Latest release: 3 months ago - 161 downloads total - 1 maintainer
rjc 0.2.3
rjc converts the output of many commands, file-types, and strings to JSON, YAML, or TOML
6 versions - Latest release: over 2 years ago - 7.79 thousand downloads total - 1 stars on GitHub - 1 maintainer
kweepeer 0.1.2
A generic webservice for interactive query expansion, expansion is provided via various modules
3 versions - Latest release: 11 months ago - 1.97 thousand downloads total - 0 stars on GitHub - 1 maintainer
iterate-text 0.0.1
Library of helper functions and structures for iterating over text files
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 1.98 thousand downloads total - 1 stars on GitHub - 1 maintainer
primo 0.0.1
Sort a file, correctly handling multi-digits numbers
1 version - Latest release: almost 9 years ago - 2.11 thousand downloads total - 1 stars on GitHub - 1 maintainer
lngcnv 1.10.2 πŸ’°
linguistics: display pronunciation, translate between dialects, convert between orthographies; su...
57 versions - Latest release: 9 months ago - 67.3 thousand downloads total - 22 stars on GitHub - 1 maintainer
translitrs 0.2.2 πŸ’°
Transliteration utility for Serbian language
3 versions - Latest release: about 3 years ago - 3.95 thousand downloads total - 6 stars on GitHub - 1 maintainer
vtext 0.2.0
NLP with Rust
4 versions - Latest release: over 5 years ago - 3 dependent repositories - 14.7 thousand downloads total - 153 stars on GitHub - 1 maintainer
lexmatch 0.3.0
This is a simple lexicon matching tool that, given a lexicon of words or phrases, identifies all ...
3 versions - Latest release: over 1 year ago - 4.03 thousand downloads total - 2 stars on GitHub - 1 maintainer
analiticcl 0.4.8
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find va...
16 versions - Latest release: 12 months ago - 22.3 thousand downloads total - 37 stars on GitHub - 1 maintainer
tdoc 0.9.2
Library and assorted CLI tools for working with FTML (Formatted Text Markup Language) documents
17 versions - Latest release: about 1 month ago - 723 downloads total - 0 stars on GitHub - 1 maintainer