An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

crates.io "preprocessing" keyword

napparent-tabular 0.1.0
napparent tabular preprocessing — apparent effect features on Apache Arrow batches
1 version - Latest release: about 21 hours ago - 0 downloads total - 1 maintainer
linfa-preprocessing 0.8.1 💰
A Machine Learning framework for Rust
9 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 49.3 thousand downloads total - 4,446 stars on GitHub - 1 maintainer
brassfibre 0.2.0
Provides multiple-dtype columner storage, known as DataFrame in pandas/R
2 versions - Latest release: over 9 years ago - 4.17 thousand downloads total - 22 stars on GitHub - 1 maintainer
frlearn_preprocess 0.1.0
Preprocessing models (for example range normalization) for FuzzyRough.
1 version - Latest release: 3 months ago - 106 downloads total - 1 maintainer
langmail 0.11.2
Email preprocessing for LLMs
10 versions - Latest release: 24 days ago - 187 downloads total - 2 stars on GitHub - 1 maintainer
simpleaf 0.25.0
A rust framework to make using alevin-fry and alevin-fry-ATAC even simpler.
37 versions - Latest release: 3 days ago - 33.6 thousand downloads total - 66 stars on GitHub - 1 maintainer
rsomics-fastq-split 0.1.0
Split a FASTQ into N files or by line count. Rust port of fastp's split (deterministic --split_by...
1 version - Latest release: 6 days ago - 12 downloads total - 1 maintainer
seq_geom_xform 0.4.0
Transform/normalize complex single-cell fragment geometries into simple geometries.
6 versions - Latest release: about 3 years ago - 1 dependent package - 8.44 thousand downloads total - 4 stars on GitHub - 1 maintainer
sentence_segmentation 1.3.0
A rule-based sentence_segmenter, inspired by ruby pragmatic segmenter by diasks2 (repo: https://g...
7 versions - Latest release: 5 months ago - 5.12 thousand downloads total - 0 stars on GitHub - 1 maintainer
fasterp 0.2.1
High-performance FASTQ preprocessing tool - often faster than fastp with the same interface
3 versions - Latest release: 6 months ago - 92 downloads total - 6 stars on GitHub - 1 maintainer
rsomics-fastp
Pure-Rust FASTQ preprocessor — adapter trimming, quality filtering, polyG trimming, JSON QC repor...
2 versions - Latest release: 5 days ago - 25 downloads total - 1 maintainer
rsomics-fastq-filter 0.1.0
FASTQ per-read quality + length filter. Rust port of fastp's quality/length filter (pass/fail who...
1 version - Latest release: 7 days ago - 12 downloads total - 0 stars on GitHub - 1 maintainer
rsomics-fastq-umi 0.2.0
FASTQ inline-UMI extract + stamp. Rust port of fastp's UMI processing — full --umi_loc set (read1...
2 versions - Latest release: 7 days ago - 24 downloads total - 1 maintainer
triplets-core 0.23.0-alpha
Core types, traits, and algorithms for the triplets data pipeline framework.
9 versions - Latest release: 6 days ago - 179 downloads total - 1 stars on GitHub - 1 maintainer
triplets 0.23.0-alpha
A composable, deterministic text data pipeline for ML. Ingest, denoise, chunk, split, and sample ...
40 versions - Latest release: 6 days ago - 503 downloads total - 0 stars on GitHub - 1 maintainer
triplets-hf-source 0.23.0-alpha
Hugging Face integration for the triplets data pipeline framework.
9 versions - Latest release: 6 days ago - 106 downloads total - 1 maintainer
triplets-debug 0.23.0-alpha
Reusable debug/demo runners for the triplets data pipeline framework.
9 versions - Latest release: 6 days ago - 128 downloads total - 1 maintainer
alevin-fry 0.14.0
A suite of tools for the rapid, accurate and memory-frugal processing single-cell and single-nucl...
18 versions - Latest release: about 1 month ago - 19.7 thousand downloads total - 201 stars on GitHub - 3 maintainers
uniquewords-rs 0.9.1
Count the frequencies of words in text file(s) or stdin
12 versions - Latest release: over 1 year ago - 13.3 thousand downloads total - 0 stars on GitHub - 1 maintainer
nuclease 0.2.0
Streaming FASTQ preprocessor with a focus on extensibility
3 versions - Latest release: 13 days ago - 36 downloads total - 1 maintainer
cbundl 0.1.4
webpack but for C code.
5 versions - Latest release: over 1 year ago - 4.12 thousand downloads total - 2 stars on GitHub - 1 maintainer
ppx-impl 1.0.1
C-style pre-processor library implementation. See 'ppx' for the library you should use.
10 versions - Latest release: 4 months ago - 362 downloads total - 0 stars on GitHub - 1 maintainer
corpus-preproc 0.1.0
A preprocessor for text and HTML corpora
1 version - Latest release: over 4 years ago - 1.69 thousand downloads total - 2 stars on GitHub - 1 maintainer
textsanity-core 0.1.0
Pure-Rust core for textsanity: unicode/whitespace/encoding cleanup.
1 version - Latest release: 14 days ago - 0 downloads total - 1 maintainer
speech-prep 0.1.0
Speech-focused audio preprocessing — VAD, format detection, decoding, noise reduction, chunking
1 version - Latest release: about 2 months ago - 0 downloads total - 1 maintainer
wgsl-template 1.0.0
Macro expansion for wgsl using ppx, a C-style macro library
7 versions - Latest release: 4 months ago - 100 downloads total - 0 stars on GitHub - 1 maintainer
ppx-macros 1.1.0
Macros for expanding C-style pre-processor macros at compile time. See 'ppx' for the library you ...
5 versions - Latest release: 4 months ago - 147 downloads total - 0 stars on GitHub - 1 maintainer
ppx 1.0.1
C-style pre-processor library
12 versions - Latest release: 4 months ago - 281 downloads total - 0 stars on GitHub - 1 maintainer
katana 1.0.2
A fast and accurate rule-based sentence segmentation tool for Rust. A port from Louie Mullie's Sc...
3 versions - Latest release: over 10 years ago - 11.4 thousand downloads total - 3 stars on GitHub - 1 maintainer
yosina 3.0.0
Japanese text transliteration library
9 versions - Latest release: about 1 month ago - 1.03 thousand downloads total - 20 stars on GitHub - 1 maintainer
wax-cli 0.2.1
An extension of HTML written in Rust
13 versions - Latest release: over 3 years ago - 15 thousand downloads total - 2 stars on GitHub - 1 maintainer
cuticula 0.2.0
Data Preprocessing library for Machine Learning
6 versions - Latest release: about 10 years ago - 1 dependent repositories - 11.3 thousand downloads total - 89 stars on GitHub - 1 maintainer
libradicl 0.13.0
support library for alevin-fry
20 versions - Latest release: 2 months ago - 3 dependent packages - 1 dependent repositories - 25.8 thousand downloads total - 5 stars on GitHub - 3 maintainers
ferrolearn-preprocess 0.2.2
Preprocessing transformers for the ferrolearn ML framework
6 versions - Latest release: 24 days ago - 147 downloads total - 15 stars on GitHub - 2 maintainers
qs-data-preprocess 0.1.1
Historical market data storage and preprocessing CLI
2 versions - Latest release: 3 months ago - 25 downloads total - 1 maintainer
robust_scaler 0.1.0
A RobustScaler for Rust, compatible with scikit-learn's RobustScaler
1 version - Latest release: 9 months ago - 385 downloads total - 1 stars on GitHub - 1 maintainer
ghostflow-data 1.0.0
Data loading utilities for GhostFlow ML framework
2 versions - Latest release: 5 months ago - 257 downloads total - 1 maintainer
sklears-preprocessing 0.1.1 💰
Data preprocessing for sklears: scaling, encoding, imputation, transformations
6 versions - Latest release: 27 days ago - 503 downloads total - 1 stars on GitHub - 1 maintainer
exg-luna 0.0.3
LUNA seizure-detection preprocessing pipeline for EEG — built on exg
1 version - Latest release: 2 months ago - 0 downloads total - 0 stars on GitHub - 1 maintainer
contractions 0.5.4
Contractions is a rust library to expand contractions in English.
3 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 32.4 thousand downloads total - 1 stars on GitHub - 1 maintainer
presto-cli 0.1.0
Presto accelerates preprocessing with precision.
1 version - Latest release: about 1 year ago - 744 downloads total - 0 stars on GitHub - 1 maintainer
radtk 0.2.0
A toolkit for working with RAD files
1 version - Latest release: almost 2 years ago - 1.24 thousand downloads total - 1 stars on GitHub - 1 maintainer
gather-all-code-from-crates 0.1.0
a Rust crate designed to extract, filter, and reconstruct code elements from Rust projects. It pr...
1 version - Latest release: over 1 year ago - 927 downloads total - 18 stars on GitHub - 1 maintainer
math_images_processor 0.1.1
A Rust library for preprocessing images of mathematical formulas, ideally for machine learning ap...
2 versions - Latest release: over 1 year ago - 2.15 thousand downloads total - 2 stars on GitHub - 1 maintainer
greenglas 0.3.0
Data Preprocessing library for Machine Learning
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 3.4 thousand downloads total - 19 stars on GitHub - 1 maintainer