crates.io "preprocessing" keyword
napparent-tabular 0.1.0
napparent tabular preprocessing — apparent effect features on Apache Arrow batches1 version - Latest release: about 21 hours ago - 0 downloads total - 1 maintainer
linfa-preprocessing 0.8.1 💰
A Machine Learning framework for Rust9 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 49.3 thousand downloads total - 4,446 stars on GitHub - 1 maintainer
brassfibre 0.2.0
Provides multiple-dtype columner storage, known as DataFrame in pandas/R2 versions - Latest release: over 9 years ago - 4.17 thousand downloads total - 22 stars on GitHub - 1 maintainer
frlearn_preprocess 0.1.0
Preprocessing models (for example range normalization) for FuzzyRough.1 version - Latest release: 3 months ago - 106 downloads total - 1 maintainer
langmail 0.11.2
Email preprocessing for LLMs10 versions - Latest release: 24 days ago - 187 downloads total - 2 stars on GitHub - 1 maintainer
simpleaf 0.25.0
A rust framework to make using alevin-fry and alevin-fry-ATAC even simpler.37 versions - Latest release: 3 days ago - 33.6 thousand downloads total - 66 stars on GitHub - 1 maintainer
rsomics-fastq-split 0.1.0
Split a FASTQ into N files or by line count. Rust port of fastp's split (deterministic --split_by...1 version - Latest release: 6 days ago - 12 downloads total - 1 maintainer
seq_geom_xform 0.4.0
Transform/normalize complex single-cell fragment geometries into simple geometries.6 versions - Latest release: about 3 years ago - 1 dependent package - 8.44 thousand downloads total - 4 stars on GitHub - 1 maintainer
sentence_segmentation 1.3.0
A rule-based sentence_segmenter, inspired by ruby pragmatic segmenter by diasks2 (repo: https://g...7 versions - Latest release: 5 months ago - 5.12 thousand downloads total - 0 stars on GitHub - 1 maintainer
fasterp 0.2.1
High-performance FASTQ preprocessing tool - often faster than fastp with the same interface3 versions - Latest release: 6 months ago - 92 downloads total - 6 stars on GitHub - 1 maintainer
rsomics-fastp
Pure-Rust FASTQ preprocessor — adapter trimming, quality filtering, polyG trimming, JSON QC repor...2 versions - Latest release: 5 days ago - 25 downloads total - 1 maintainer
rsomics-fastq-filter 0.1.0
FASTQ per-read quality + length filter. Rust port of fastp's quality/length filter (pass/fail who...1 version - Latest release: 7 days ago - 12 downloads total - 0 stars on GitHub - 1 maintainer
rsomics-fastq-umi 0.2.0
FASTQ inline-UMI extract + stamp. Rust port of fastp's UMI processing — full --umi_loc set (read1...2 versions - Latest release: 7 days ago - 24 downloads total - 1 maintainer
triplets-core 0.23.0-alpha
Core types, traits, and algorithms for the triplets data pipeline framework.9 versions - Latest release: 6 days ago - 179 downloads total - 1 stars on GitHub - 1 maintainer
triplets 0.23.0-alpha
A composable, deterministic text data pipeline for ML. Ingest, denoise, chunk, split, and sample ...40 versions - Latest release: 6 days ago - 503 downloads total - 0 stars on GitHub - 1 maintainer
triplets-hf-source 0.23.0-alpha
Hugging Face integration for the triplets data pipeline framework.9 versions - Latest release: 6 days ago - 106 downloads total - 1 maintainer
triplets-debug 0.23.0-alpha
Reusable debug/demo runners for the triplets data pipeline framework.9 versions - Latest release: 6 days ago - 128 downloads total - 1 maintainer
alevin-fry 0.14.0
A suite of tools for the rapid, accurate and memory-frugal processing single-cell and single-nucl...18 versions - Latest release: about 1 month ago - 19.7 thousand downloads total - 201 stars on GitHub - 3 maintainers
uniquewords-rs 0.9.1
Count the frequencies of words in text file(s) or stdin12 versions - Latest release: over 1 year ago - 13.3 thousand downloads total - 0 stars on GitHub - 1 maintainer
nuclease 0.2.0
Streaming FASTQ preprocessor with a focus on extensibility3 versions - Latest release: 13 days ago - 36 downloads total - 1 maintainer
cbundl 0.1.4
webpack but for C code.5 versions - Latest release: over 1 year ago - 4.12 thousand downloads total - 2 stars on GitHub - 1 maintainer
ppx-impl 1.0.1
C-style pre-processor library implementation. See 'ppx' for the library you should use.10 versions - Latest release: 4 months ago - 362 downloads total - 0 stars on GitHub - 1 maintainer
corpus-preproc 0.1.0
A preprocessor for text and HTML corpora1 version - Latest release: over 4 years ago - 1.69 thousand downloads total - 2 stars on GitHub - 1 maintainer
textsanity-core 0.1.0
Pure-Rust core for textsanity: unicode/whitespace/encoding cleanup.1 version - Latest release: 14 days ago - 0 downloads total - 1 maintainer
speech-prep 0.1.0
Speech-focused audio preprocessing — VAD, format detection, decoding, noise reduction, chunking1 version - Latest release: about 2 months ago - 0 downloads total - 1 maintainer
wgsl-template 1.0.0
Macro expansion for wgsl using ppx, a C-style macro library7 versions - Latest release: 4 months ago - 100 downloads total - 0 stars on GitHub - 1 maintainer
ppx-macros 1.1.0
Macros for expanding C-style pre-processor macros at compile time. See 'ppx' for the library you ...5 versions - Latest release: 4 months ago - 147 downloads total - 0 stars on GitHub - 1 maintainer
ppx 1.0.1
C-style pre-processor library12 versions - Latest release: 4 months ago - 281 downloads total - 0 stars on GitHub - 1 maintainer
katana 1.0.2
A fast and accurate rule-based sentence segmentation tool for Rust. A port from Louie Mullie's Sc...3 versions - Latest release: over 10 years ago - 11.4 thousand downloads total - 3 stars on GitHub - 1 maintainer
yosina 3.0.0
Japanese text transliteration library9 versions - Latest release: about 1 month ago - 1.03 thousand downloads total - 20 stars on GitHub - 1 maintainer
wax-cli 0.2.1
An extension of HTML written in Rust13 versions - Latest release: over 3 years ago - 15 thousand downloads total - 2 stars on GitHub - 1 maintainer
cuticula 0.2.0
Data Preprocessing library for Machine Learning6 versions - Latest release: about 10 years ago - 1 dependent repositories - 11.3 thousand downloads total - 89 stars on GitHub - 1 maintainer
libradicl 0.13.0
support library for alevin-fry20 versions - Latest release: 2 months ago - 3 dependent packages - 1 dependent repositories - 25.8 thousand downloads total - 5 stars on GitHub - 3 maintainers
ferrolearn-preprocess 0.2.2
Preprocessing transformers for the ferrolearn ML framework6 versions - Latest release: 24 days ago - 147 downloads total - 15 stars on GitHub - 2 maintainers
qs-data-preprocess 0.1.1
Historical market data storage and preprocessing CLI2 versions - Latest release: 3 months ago - 25 downloads total - 1 maintainer
robust_scaler 0.1.0
A RobustScaler for Rust, compatible with scikit-learn's RobustScaler1 version - Latest release: 9 months ago - 385 downloads total - 1 stars on GitHub - 1 maintainer
ghostflow-data 1.0.0
Data loading utilities for GhostFlow ML framework2 versions - Latest release: 5 months ago - 257 downloads total - 1 maintainer
sklears-preprocessing 0.1.1 💰
Data preprocessing for sklears: scaling, encoding, imputation, transformations6 versions - Latest release: 27 days ago - 503 downloads total - 1 stars on GitHub - 1 maintainer
exg-luna 0.0.3
LUNA seizure-detection preprocessing pipeline for EEG — built on exg1 version - Latest release: 2 months ago - 0 downloads total - 0 stars on GitHub - 1 maintainer
contractions 0.5.4
Contractions is a rust library to expand contractions in English.3 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 32.4 thousand downloads total - 1 stars on GitHub - 1 maintainer
presto-cli 0.1.0
Presto accelerates preprocessing with precision.1 version - Latest release: about 1 year ago - 744 downloads total - 0 stars on GitHub - 1 maintainer
radtk 0.2.0
A toolkit for working with RAD files1 version - Latest release: almost 2 years ago - 1.24 thousand downloads total - 1 stars on GitHub - 1 maintainer
gather-all-code-from-crates 0.1.0
a Rust crate designed to extract, filter, and reconstruct code elements from Rust projects. It pr...1 version - Latest release: over 1 year ago - 927 downloads total - 18 stars on GitHub - 1 maintainer
math_images_processor 0.1.1
A Rust library for preprocessing images of mathematical formulas, ideally for machine learning ap...2 versions - Latest release: over 1 year ago - 2.15 thousand downloads total - 2 stars on GitHub - 1 maintainer
greenglas 0.3.0
Data Preprocessing library for Machine Learning2 versions - Latest release: over 4 years ago - 1 dependent repositories - 3.4 thousand downloads total - 19 stars on GitHub - 1 maintainer
Related Keywords
machine-learning
8
bioinformatics
6
fastq
6
nlp
5
rna-seq
5
rust
5
single-cell
5
preprocessor
5
macro
4
ai
4
triplet-mining
4
pre-processing
4
macros
4
compile-time
4
train-test-split
4
pre-processor
4
dataset-sampling
4
single-nucleus
3
atac-seq
3
fastp
3
preproc
3
cli
3
expansion
3
encoding
3
bm25
2
parser
2
artificial-intelligence
2
science
2
text-processing
2
scaler
2
data-preprocessing
2
transformation
2
unicode
2
ml
2
algorithms
2
llm
2
normalization
2
text
2
vad
1
voice
1
webgpu
1
compiler
1
wgsl
1
scalpel
1
html
1
symbols
1
hyphens
1
japanese
1
transliteration
1
cut
1
segmentation
1
framework
1
mathematics
1
image
1
syntax
1
queries
1
crates
1
rad-file
1
tui
1
data-science
1
data-analysis
1
language
1
tcp
1
seizure
1
luna
1
eeg
1
scikitlearn-machine-learning
1
scikit-learn
1
rust-lang
1
scaling
1
dataset
1
data-loading
1
parquet
1
ohlcv
1
market-data
1
feature-engineering
1
encoder
1
speech
1
qc
1
quality-control
1
genomics
1
text-analysis
1
sentence-segmentation
1
sentence-boundary-detection
1
natural-language-processing
1
multilingual
1
split
1
mime
1
email
1
fuzzy
1
dataframe
1
columner
1
datastore
1
scientific-computing
1
linfa
1
transparency
1
tabular
1
napparent
1
arrow
1
audio
1