Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pre-processing" keyword

tubular 1.2.2
Package to perform pre processing steps for machine learning models
18 versions - Latest release: 3 months ago - 1 dependent repositories - 804 downloads last month - 36 stars on GitHub - 1 maintainer
nhp-prep 0.4.0
Pre-processing data tool for NHP Lab @ CMU
11 versions - Latest release: 10 months ago - 45 downloads last month - 2 maintainers
Top 4.5% on pypi.org
urduhack 1.1.1 💰
Natural Language Processing (NLP) library for Urdu language.
23 versions - Latest release: almost 4 years ago - 1 dependent package - 8 dependent repositories - 14 thousand downloads last month - 274 stars on GitHub - 1 maintainer
cleandat 0.0.3
Python functions to facilitate the pre-processing of data for ML tasks in a clinical context.
3 versions - Latest release: 4 months ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
dfm-tools 0.23.0
dfm_tools are pre- and post-processing tools for Delft3D FM
15 versions - Latest release: 6 days ago - 290 downloads last month - 59 stars on GitHub - 3 maintainers
Top 6.0% on pypi.org
mosestokenizer 1.2.1
Wrappers for several pre-processing scripts from the Moses toolkit.
5 versions - Latest release: over 2 years ago - 9 dependent packages - 74 dependent repositories - 37.8 thousand downloads last month - 18 stars on GitHub - 1 maintainer
tubelearns 2.1.0
Python script for extracting, cleaning, and tokenizing YouTube video transcripts for Pre-Processi...
8 versions - Latest release: 2 months ago - 68 downloads last month - 5 stars on GitHub - 2 maintainers
delia 1.2.7
DICOM Extraction for Large-scale Image Analysis (DELIA).
33 versions - Latest release: 5 months ago - 1 dependent package - 264 downloads last month - 12 stars on GitHub - 1 maintainer
swprepost 2.0.0
A Python Package for Surface Wave Inversion Pre- and Post-Processing
5 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 98 downloads last month - 30 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
smogn 0.1.2
A Python implementation of Synthetic Minority Over-Sampling Technique for Regression with Gaussia...
6 versions - Latest release: about 4 years ago - 2 dependent packages - 9 dependent repositories - 1.67 thousand downloads last month - 288 stars on GitHub - 1 maintainer
simple-encoders 0.1.6
Simple encoders to pre-process categoric variables for machine learning systems
7 versions - Latest release: over 2 years ago - 1 dependent repositories - 59 downloads last month - 0 stars on GitHub - 1 maintainer
pyzohar 0.1.13
a private package on data pre-processing.
22 versions - Latest release: 9 months ago - 1 dependent repositories - 170 downloads last month - 1 maintainer
pyimbalreg 0.0.3
Pre-processing technics for imbalanced datasets in regression modelling
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 9 stars on GitHub - 1 maintainer
preprocessingtext 0.0.4
A series of methods to help you work pre processing of text in general, like stem, tokenizer and ...
4 versions - Latest release: over 5 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
preprocessing 0.1.13
pre-processing package for text strings
2 versions - Latest release: over 6 years ago - 89 dependent repositories - 9.49 thousand downloads last month - 7 stars on GitHub - 1 maintainer
parallelio 0.9
Basic tools for working with natural language text data
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 29 downloads last month - 3 stars on GitHub - 1 maintainer
overtokenizer 0.2.0
Unicode-based language-agnostic (over-) tokenizer.
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 19 downloads last month - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 58 downloads last month - 45 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora
1 version - Latest release: over 4 years ago - 1 dependent repositories - 13 downloads last month - 45 stars on GitHub - 1 maintainer
entity-embeddings-categorical 0.6.7
Discover relevant information about categorical data with entity embeddings using Neural Networks...
13 versions - Latest release: over 4 years ago - 1 dependent repositories - 99 downloads last month - 69 stars on GitHub - 1 maintainer
cpip 0.9.9
CPIP is a C/C++ Preprocessor implemented in Python.
2 versions - Latest release: 11 months ago - 1 dependent repositories - 1.14 thousand downloads last month - 39 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 49 downloads last month - 45 stars on GitHub - 1 maintainer
torch-assimilate 0.2.1
torch-assimilate is a data assimilation package based on PyTorch, xarray and dask
1 version - Latest release: over 3 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitLab.com - 1 maintainer
bedrock 0.1.0.dev10
Bedrock is a high-level text pre-processing API, written in Python and can run on NLTK or Spacy a...
1 version - Latest release: about 6 years ago - 1 dependent repositories - 420 downloads last month - 3 stars on GitHub - 1 maintainer
augmax 0.3.2
Efficiently Composable Data Augmentation on the GPU with Jax
8 versions - Latest release: 3 months ago - 1 dependent repositories - 3.18 thousand downloads last month - 20 stars on GitHub - 1 maintainer
arabicpreprocessing 0.4
Arabic PreProcessing Functions that must applied before run any ML model.
3 versions - Latest release: over 4 years ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
imbalancedlearningregression 0.0.1
Python implementations of preprocesssing imbalanced data for regression
1 version - Latest release: about 2 years ago - 1 dependent repositories - 113 downloads last month - 36 stars on GitHub - 4 maintainers
nanoprep-ffm 0.0.17
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data
13 versions - Latest release: 10 months ago - 73 downloads last month - 0 stars on GitHub - 1 maintainer
advancedanalytics 1.3.0
Python support for 'The Art and Science of Data Analytics'
46 versions - Latest release: over 4 years ago - 417 downloads last month - 7 stars on GitHub - 1 maintainer
nanoprep-ccchu 0.0.0 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data
1 version - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc-test2 0.0.0 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data
1 version - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc-test3 0.0.3 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data
4 versions - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc 0.0.3 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data
1 version - Latest release: over 1 year ago - 1 stars on GitHub
tubelearn 1.0.0 removed
Python script for extracting and cleaning YouTube video transcripts for Pre-Processing in machine...
1 version - Latest release: 8 months ago - 1 maintainer