Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "pre-processing" keyword
tubular 1.2.2
Package to perform pre processing steps for machine learning models18 versions - Latest release: 3 months ago - 1 dependent repositories - 804 downloads last month - 36 stars on GitHub - 1 maintainer
nhp-prep 0.4.0
Pre-processing data tool for NHP Lab @ CMU11 versions - Latest release: 10 months ago - 45 downloads last month - 2 maintainers
Top 4.5% on pypi.org
23 versions - Latest release: almost 4 years ago - 1 dependent package - 8 dependent repositories - 14 thousand downloads last month - 274 stars on GitHub - 1 maintainer
urduhack 1.1.1 💰
Natural Language Processing (NLP) library for Urdu language.23 versions - Latest release: almost 4 years ago - 1 dependent package - 8 dependent repositories - 14 thousand downloads last month - 274 stars on GitHub - 1 maintainer
cleandat 0.0.3
Python functions to facilitate the pre-processing of data for ML tasks in a clinical context.3 versions - Latest release: 4 months ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
dfm-tools 0.23.0
dfm_tools are pre- and post-processing tools for Delft3D FM15 versions - Latest release: 6 days ago - 290 downloads last month - 59 stars on GitHub - 3 maintainers
Top 6.0% on pypi.org
5 versions - Latest release: over 2 years ago - 9 dependent packages - 74 dependent repositories - 37.8 thousand downloads last month - 18 stars on GitHub - 1 maintainer
mosestokenizer 1.2.1
Wrappers for several pre-processing scripts from the Moses toolkit.5 versions - Latest release: over 2 years ago - 9 dependent packages - 74 dependent repositories - 37.8 thousand downloads last month - 18 stars on GitHub - 1 maintainer
tubelearns 2.1.0
Python script for extracting, cleaning, and tokenizing YouTube video transcripts for Pre-Processi...8 versions - Latest release: 2 months ago - 68 downloads last month - 5 stars on GitHub - 2 maintainers
delia 1.2.7
DICOM Extraction for Large-scale Image Analysis (DELIA).33 versions - Latest release: 5 months ago - 1 dependent package - 264 downloads last month - 12 stars on GitHub - 1 maintainer
swprepost 2.0.0
A Python Package for Surface Wave Inversion Pre- and Post-Processing5 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 98 downloads last month - 30 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
6 versions - Latest release: about 4 years ago - 2 dependent packages - 9 dependent repositories - 1.67 thousand downloads last month - 288 stars on GitHub - 1 maintainer
smogn 0.1.2
A Python implementation of Synthetic Minority Over-Sampling Technique for Regression with Gaussia...6 versions - Latest release: about 4 years ago - 2 dependent packages - 9 dependent repositories - 1.67 thousand downloads last month - 288 stars on GitHub - 1 maintainer
simple-encoders 0.1.6
Simple encoders to pre-process categoric variables for machine learning systems7 versions - Latest release: over 2 years ago - 1 dependent repositories - 59 downloads last month - 0 stars on GitHub - 1 maintainer
pyzohar 0.1.13
a private package on data pre-processing.22 versions - Latest release: 9 months ago - 1 dependent repositories - 170 downloads last month - 1 maintainer
pyimbalreg 0.0.3
Pre-processing technics for imbalanced datasets in regression modelling2 versions - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 9 stars on GitHub - 1 maintainer
preprocessingtext 0.0.4
A series of methods to help you work pre processing of text in general, like stem, tokenizer and ...4 versions - Latest release: over 5 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
2 versions - Latest release: over 6 years ago - 89 dependent repositories - 9.49 thousand downloads last month - 7 stars on GitHub - 1 maintainer
preprocessing 0.1.13
pre-processing package for text strings2 versions - Latest release: over 6 years ago - 89 dependent repositories - 9.49 thousand downloads last month - 7 stars on GitHub - 1 maintainer
parallelio 0.9
Basic tools for working with natural language text data2 versions - Latest release: over 6 years ago - 1 dependent repositories - 29 downloads last month - 3 stars on GitHub - 1 maintainer
overtokenizer 0.2.0
Unicode-based language-agnostic (over-) tokenizer.2 versions - Latest release: about 6 years ago - 1 dependent repositories - 19 downloads last month - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora6 versions - Latest release: over 4 years ago - 1 dependent repositories - 58 downloads last month - 45 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora1 version - Latest release: over 4 years ago - 1 dependent repositories - 13 downloads last month - 45 stars on GitHub - 1 maintainer
entity-embeddings-categorical 0.6.7
Discover relevant information about categorical data with entity embeddings using Neural Networks...13 versions - Latest release: over 4 years ago - 1 dependent repositories - 99 downloads last month - 69 stars on GitHub - 1 maintainer
cpip 0.9.9
CPIP is a C/C++ Preprocessor implemented in Python.2 versions - Latest release: 11 months ago - 1 dependent repositories - 1.14 thousand downloads last month - 39 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora4 versions - Latest release: about 3 years ago - 1 dependent repositories - 49 downloads last month - 45 stars on GitHub - 1 maintainer
torch-assimilate 0.2.1
torch-assimilate is a data assimilation package based on PyTorch, xarray and dask1 version - Latest release: over 3 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitLab.com - 1 maintainer
bedrock 0.1.0.dev10
Bedrock is a high-level text pre-processing API, written in Python and can run on NLTK or Spacy a...1 version - Latest release: about 6 years ago - 1 dependent repositories - 420 downloads last month - 3 stars on GitHub - 1 maintainer
augmax 0.3.2
Efficiently Composable Data Augmentation on the GPU with Jax8 versions - Latest release: 3 months ago - 1 dependent repositories - 3.18 thousand downloads last month - 20 stars on GitHub - 1 maintainer
arabicpreprocessing 0.4
Arabic PreProcessing Functions that must applied before run any ML model.3 versions - Latest release: over 4 years ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
imbalancedlearningregression 0.0.1
Python implementations of preprocesssing imbalanced data for regression1 version - Latest release: about 2 years ago - 1 dependent repositories - 113 downloads last month - 36 stars on GitHub - 4 maintainers
nanoprep-ffm 0.0.17
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data13 versions - Latest release: 10 months ago - 73 downloads last month - 0 stars on GitHub - 1 maintainer
advancedanalytics 1.3.0
Python support for 'The Art and Science of Data Analytics'46 versions - Latest release: over 4 years ago - 417 downloads last month - 7 stars on GitHub - 1 maintainer
nanoprep-ccchu 0.0.0 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data1 version - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc-test2 0.0.0 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data1 version - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc-test3 0.0.3 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data4 versions - Latest release: over 1 year ago - 1 stars on GitHub
nanoprep-ccc 0.0.3 removed
A fully-equipped, fast, and memory-efficient pre-processor for ONT transcriptomic data1 version - Latest release: over 1 year ago - 1 stars on GitHub
tubelearn 1.0.0 removed
Python script for extracting and cleaning YouTube video transcripts for Pre-Processing in machine...1 version - Latest release: 8 months ago - 1 maintainer
Related Keywords
data
7
text
7
python
6
machine
5
learning
5
nlp
5
nanopore-sequencing
5
natural-language-processing
4
machine-learning
4
regression
4
source-code-analysis
3
mining-software-repositories
3
preprocessing
3
big
3
large
3
source
3
code
3
corpus
3
language-modeling
3
post-processing
3
over-sampling
3
synthetic data
3
word-segmentation
3
imbalanced data
3
NLP
2
pandas
2
cleaning
2
raw data
2
tokenization
2
video
2
transcript
2
data-science
2
machine learning
2
natural language processing
2
smote
2
deep-learning
2
entity-embedding
1
keras
1
neural-networks
1
utility-library
1
cpip
1
c
1
embeddings
1
categorical-data
1
text-data
1
io
1
files
1
undersampling
1
data augmentation
1
imblanaced regression
1
PyImbalReg
1
under-sampling
1
Analytics
1
data map
1
postprocessing
1
NLTK
1
Sci-Learn
1
sklearn
1
StatsModels
1
web scraping
1
word cloud
1
decision trees
1
random forest
1
neural network
1
cross validation
1
topic analysis
1
sentiment analytic
1
c-plus-plus
1
pre-processor
1
preprocessor
1
statistics
1
meteorology
1
testbed
1
forecast
1
assimilation
1
gpu
1
jax
1
Arabic
1
Gaussian noise
1
condensed nearest neighbour
1
edited nearest neighbour
1
Tomek links
1
ADASYN
1
encoding
1
machine-learning-datasets
1
modelbuilder
1
hisfiles
1
mapfiles
1
D-HYDRO
1
D-FlowFM
1
dfm_tools
1
ml
1
maschine-learning
1
etl-pipeline
1
etl
1
ehr
1
clinical-trials
1
clinical-data
1
urduhack
1
urdu-text-processsing
1