Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "preprocessing" keyword
oc-preprocessing 0.0.5
This package is meant to preprocess OpenCitations source dumps so to make them easily usable in O...2 versions - Latest release: about 1 year ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
podium-nlp 0.1.1
Podium: a framework agnostic Python NLP library for data loading and preprocessing2 versions - Latest release: about 3 years ago - 1 dependent repositories - 24 downloads last month - 60 stars on GitHub - 1 maintainer
dcmpi 0.0.2.8
DICOM Preprocessing Interface.11 versions - Latest release: almost 4 years ago - 4 dependent repositories - 35 downloads last month - 2 stars on GitHub - 1 maintainer
fifa-preprocessing 1.1.2
A package providing methods to preprocess data, with the intent to perform Machine Learning.8 versions - Latest release: about 4 years ago - 1 dependent repositories - 27 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
18 versions - Latest release: 8 months ago - 1 dependent package - 4 dependent repositories - 7.98 thousand downloads last month - 395 stars on GitHub - 1 maintainer
contextualspellcheck 0.4.4 💰
Contextual spell correction using BERT (bidirectional representations)18 versions - Latest release: 8 months ago - 1 dependent package - 4 dependent repositories - 7.98 thousand downloads last month - 395 stars on GitHub - 1 maintainer
nlpiper 0.3.1
NLPiper, a lightweight package integrated with a universe of frameworks to pre-process documents.5 versions - Latest release: about 2 years ago - 2 dependent repositories - 38 downloads last month - 17 stars on GitHub - 3 maintainers
bowline 0.2.2
Configurable tools to easily pre and post process your data for data-science and machine learning.5 versions - Latest release: about 2 years ago - 46 downloads last month - 2 stars on GitHub - 1 maintainer
advancedanalytics 1.3.0
Python support for 'The Art and Science of Data Analytics'46 versions - Latest release: almost 5 years ago - 351 downloads last month - 7 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
6 versions - Latest release: about 4 years ago - 11 dependent packages - 146 dependent repositories - 5.23 thousand downloads last month - 300 stars on GitHub - 1 maintainer
tweet-preprocessor 0.6.0
Elegant tweet preprocessing6 versions - Latest release: about 4 years ago - 11 dependent packages - 146 dependent repositories - 5.23 thousand downloads last month - 300 stars on GitHub - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.4 thousand downloads last month - 30 stars on GitHub - 2 maintainers
autodatap 1.5.2
Automating Data Preprocessing33 versions - Latest release: 8 months ago - 193 downloads last month - 0 stars on GitHub - 1 maintainer
catprep 0.0.4
A preprocessing library for categorical variables4 versions - Latest release: almost 8 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 1 maintainer
prep-ml 0.1.1
Preprocessing for ML models made easy.2 versions - Latest release: about 3 years ago - 1 dependent repositories - 15 downloads last month - 1 stars on GitHub - 1 maintainer
warfit-learn 0.2.1
A toolkit for reproducible research in warfarin dose estimation3 versions - Latest release: over 3 years ago - 1 dependent repositories - 28 downloads last month - 10 stars on GitHub - 1 maintainer
ocm2 0.2.0
This python package extracts subdatasets from OCM-2 HDF file, georeference them and exports them ...4 versions - Latest release: about 1 year ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
objectdetectiontools 1.3.4
A set of functions useful when doing object detection.21 versions - Latest release: over 1 year ago - 79 downloads last month - 0 stars on GitLab.com - 1 maintainer
cube-helper 2.2.3
Cube Helper is a package to make equalisation, concatenation, and analysis of Iris cubes easier.8 versions - Latest release: over 2 years ago - 1 dependent repositories - 41 downloads last month - 2 stars on GitHub - 2 maintainers
chunkyp 0.0.2
Ray-based preprocesisng pipeline.2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 29 downloads last month - 0 stars on GitHub - 1 maintainer
featureforge 0.1.6
A library to build and test machine learning features7 versions - Latest release: almost 9 years ago - 5 dependent repositories - 36 downloads last month - 382 stars on GitHub - 2 maintainers
cleanflow 1.3.3a1
A a framework for cleaning, pre-processing and exploring data in a scalable and distributed manner.11 versions - Latest release: about 6 years ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
keras-aug 0.5.8
A library that includes pure TF/Keras preprocessing and augmentation layers9 versions - Latest release: 7 months ago - 54 downloads last month - 8 stars on GitHub - 1 maintainer
aicademecv 0.1
Image processing library with a specific focus on data preparation for skin based deep learning p...1 version - Latest release: almost 5 years ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
nltp 0.1.0
Simple automated text preprocessor1 version - Latest release: almost 4 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
preprocessingninja 0.0.1
A data preprocessing helper consists of your basic preprocessing needs1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 1 stars on GitHub - 1 maintainer
utp 0.2
helper functions for typical python problems1 version - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitHub - 1 maintainer
duplipy 0.2.0
A package for formatting and text replication, with added support for image augmentation.11 versions - Latest release: 6 months ago - 81 downloads last month - 0 stars on GitHub - 1 maintainer
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views16 versions - Latest release: about 1 month ago - 107 downloads last month - 1 maintainer
mordineznlp 0.1.0
Powerfull python tool for modern NLP processing34 versions - Latest release: about 2 years ago - 1 dependent repositories - 89 downloads last month - 2 stars on GitHub - 1 maintainer
findanywhere
1 versionsktutor 0.2.3
sktutor helps your machines learn.23 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.66 thousand downloads last month - 2 stars on GitHub - 1 maintainer
masksemi 0.0.1
Code for converting a list of labels in a scikit-like semi-supervised labels.1 version - Latest release: about 3 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
mercury-imgpprcs 0.0.1
Mercury: Image Pre-processing Open Source API for Artificial Intelligence1 version - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
chariot 0.5.6
Deliver the ready-to-train data to your NLP model.19 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 122 stars on GitHub - 1 maintainer
sparklanes 0.2.4
A lightweight framework to build and execute data processing pipelines in pyspark (Apache Spark's...5 versions - Latest release: over 5 years ago - 1 dependent repositories - 27 downloads last month - 16 stars on GitHub - 1 maintainer
motion-learning-toolbox 1.0.6
Python library for preprocessing of XR motion tracking data for machine learning applications.5 versions - Latest release: about 1 month ago - 47 downloads last month - 5 stars on GitHub - 2 maintainers
csv2mne 0.0.1
Data formater2 versions - Latest release: over 1 year ago - 12 downloads last month - 1 maintainer
demv 1.0.2
Debiaser for Multiple Variables(DEMV) is a pre-processing algorithm for binary and multi-class da...3 versions - Latest release: 3 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
logprep 11.3.0
Logprep allows to collect, process and forward log messages from various data sources.36 versions - Latest release: 19 days ago - 408 downloads last month - 24 stars on GitHub - 3 maintainers
jionlp-py39 1.3.45
Chinese NLPreprocessing & Parsing2 versions - Latest release: over 2 years ago - 1 dependent repositories - 17 downloads last month - 3,015 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
31 versions - Latest release: about 1 month ago - 7 dependent packages - 6 dependent repositories - 2.99 thousand downloads last month - 3,015 stars on GitHub - 1 maintainer
jionlp 1.5.11
Chinese NLP Preprocessing & Parsing31 versions - Latest release: about 1 month ago - 7 dependent packages - 6 dependent repositories - 2.99 thousand downloads last month - 3,015 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
14 versions - Latest release: about 3 years ago - 4 dependent repositories - 1.61 thousand downloads last month - 74 stars on GitHub - 4 maintainers
multi-imbalance 0.0.14
Python package for tackling multiclass imbalance problems.14 versions - Latest release: about 3 years ago - 4 dependent repositories - 1.61 thousand downloads last month - 74 stars on GitHub - 4 maintainers
bio-volumentations 1.2.0
Library for 3D-5D augmentations of volumetric multi-dimensional biomedical images and their annot...7 versions - Latest release: 27 days ago - 147 downloads last month - 1,971 stars on GitHub - 2 maintainers
openav 1.0.0a11
OpenAV8 versions - Latest release: 21 days ago - 737 downloads last month - 3 stars on GitHub - 1 maintainer
take-text-preprocess 0.0.5
Text Preprocesser10 versions - Latest release: over 2 years ago - 4 dependent packages - 1 dependent repositories - 193 downloads last month - 1 maintainer
nlcodec 0.4.0
nlcodec is a collection of encoding schemes for natural language sequences. nlcodec.db is a effi...10 versions - Latest release: almost 3 years ago - 1 dependent package - 2 dependent repositories - 94 downloads last month - 5 stars on GitHub - 1 maintainer
arm-preprocessing 0.2.2
Implementation of several preprocessing techniques for Association Rule Mining (ARM)5 versions - Latest release: 2 months ago - 30 downloads last month - 2 stars on GitHub - 2 maintainers
qd 0.8.9
QD-Engineering Python Library for CAE32 versions - Latest release: over 4 years ago - 1 dependent repositories - 522 downloads last month - 1 maintainer
Top 5.4% on pypi.org
27 versions - Latest release: about 1 month ago - 8 dependent packages - 31 dependent repositories - 484 thousand downloads last month - 69 stars on GitHub - 1 maintainer
courlan 1.1.0
Clean, filter and sample URLs to optimize data collection – includes spam, content type and langu...27 versions - Latest release: about 1 month ago - 8 dependent packages - 31 dependent repositories - 484 thousand downloads last month - 69 stars on GitHub - 1 maintainer
cmip6-preprocessing 0.6.0
Analysis ready CMIP6 data the easy way11 versions - Latest release: almost 2 years ago - 1 dependent package - 118 downloads last month - 187 stars on GitHub - 1 maintainer
nlp-preprocessing-qvm9 0.0.4
Fix the import error2 versions - Latest release: over 1 year ago - 16 downloads last month - 1 maintainer
arrowtextclassifier 1.0.0
ArrowTextClassifier is a simple text classification tool written in pytorch that allows you to tr...4 versions - Latest release: about 1 month ago - 525 downloads last month - 1 maintainer
arac 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.1 version - Latest release: 3 months ago - 19 downloads last month - 1 maintainer
designer2 2.0.7
designerV235 versions - Latest release: 2 months ago - 166 downloads last month - 11 stars on GitHub - 3 maintainers
lughaatnlp 1.0.5
A Python package for natural language processing tasks for the Urdu language, including normaliza...5 versions - Latest release: about 2 months ago - 175 downloads last month - 0 stars on GitHub - 1 maintainer
arctix 0.0.5
A library to get a text summary of nested objects12 versions - Latest release: 20 days ago - 6.38 thousand downloads last month - 0 stars on GitHub - 1 maintainer
ecg-qc 1.0b6
a package to compute if ECG signal quality is optimal or noisy6 versions - Latest release: over 2 years ago - 3 dependent repositories - 71 downloads last month - 36 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 898 downloads last month - 373 stars on GitHub - 1 maintainer
nonechucks 0.4.2
nonechucks is a library that provides wrappers for PyTorch's datasets, samplers, and transforms t...18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 898 downloads last month - 373 stars on GitHub - 1 maintainer
sk-transformers 0.11.0
A collection of various pandas & scikit-learn compatible transformers for all kinds of preprocess...25 versions - Latest release: about 1 year ago - 1 dependent repositories - 298 downloads last month - 8 stars on GitHub - 1 maintainer
alkymi 0.3.1
alkymi - Pythonic task automation10 versions - Latest release: 17 days ago - 1 dependent repositories - 266 downloads last month - 44 stars on GitHub - 1 maintainer
pynmranalysis 1.1.3
python library for NMR preprocessing and analysis9 versions - Latest release: almost 3 years ago - 1 dependent repositories - 70 downloads last month - 3 stars on GitHub - 1 maintainer
dmriprep 0.5.0
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.8 versions - Latest release: about 3 years ago - 1 dependent repositories - 100 downloads last month - 62 stars on GitHub - 2 maintainers
unstructured-api-tools 0.10.11
A library that prepares raw documents for downstream ML tasks.33 versions - Latest release: 10 months ago - 2 dependent repositories - 256 downloads last month - 28 stars on GitHub - 1 maintainer
ab-data-processing 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.1 version - Latest release: 4 months ago - 26 downloads last month - 43 stars on GitHub - 1 maintainer
one-data-processing 0.0.14
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.14 versions - Latest release: 4 months ago - 104 downloads last month - 43 stars on GitHub - 1 maintainer
kubeagi 0.0.4
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.4 versions - Latest release: 3 months ago - 45 downloads last month - 3 stars on GitHub - 1 maintainer
a-data-processing 0.0.1
A library that prepares raw documents for downstream ML tasks.1 version - Latest release: 4 months ago - 29 downloads last month - 43 stars on GitHub - 1 maintainer
vflow 0.1.4
A framework for doing stability analysis with PCS.7 versions - Latest release: 4 months ago - 1 dependent repositories - 67 downloads last month - 60 stars on GitHub - 2 maintainers
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets1 version - Latest release: about 1 year ago - 22 downloads last month - 219 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
21 versions - Latest release: 4 months ago - 6 dependent packages - 137 dependent repositories - 2.55 million downloads last month - 526 stars on GitHub - 2 maintainers
seqio 0.0.19
SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.21 versions - Latest release: 4 months ago - 6 dependent packages - 137 dependent repositories - 2.55 million downloads last month - 526 stars on GitHub - 2 maintainers
pytough 1.6.2
Python scripting library for TOUGH2 simulation3 versions - Latest release: about 2 months ago - 1.44 thousand downloads last month - 90 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
21 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 429 downloads last month - 218 stars on GitHub - 1 maintainer
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets21 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 429 downloads last month - 218 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
35 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 10.3 thousand downloads last month - 945 stars on GitHub - 1 maintainer
nnaudio 0.3.3
A fast GPU audio processing toolbox with 1D convolutional neural network35 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 10.3 thousand downloads last month - 945 stars on GitHub - 1 maintainer
niaarm 0.3.9
A minimalistic framework for numerical association rule mining21 versions - Latest release: about 2 months ago - 2 dependent packages - 1 dependent repositories - 317 downloads last month - 14 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
10 versions - Latest release: 7 months ago - 4 dependent packages - 41 dependent repositories - 5.59 thousand downloads last month - 134 stars on GitHub - 3 maintainers
autoreject 0.4.3
Automated rejection and repair of epochs in M/EEG.10 versions - Latest release: 7 months ago - 4 dependent packages - 41 dependent repositories - 5.59 thousand downloads last month - 134 stars on GitHub - 3 maintainers
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.5 versions - Latest release: 4 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
adcl 0.1.7
Data preprocessing and cleaning tools for data science projects8 versions - Latest release: about 1 month ago - 139 downloads last month - 0 stars on GitHub - 1 maintainer
massivetools 0.0.1 removed
Useful packages to perform computer vision tasks1 version - Latest release: 2 months ago - 1 maintainer
computer-vision-utils 0.0.1
Useful packages to perform computer vision tasks1 version - Latest release: 2 months ago - 42 downloads last month - 1 maintainer
mngdataclean 0.4.2
Text preprocessing package5 versions - Latest release: 3 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
sleepeegpy 0.5.1
Sleep EEG preprocessing, analysis and visualization2 versions - Latest release: 6 months ago - 25 downloads last month - 11 stars on GitHub - 1 maintainer
kditransform 0.2.0
Kernel density integral transformation2 versions - Latest release: 6 months ago - 156 downloads last month - 4 stars on GitHub - 1 maintainer
langchain-addons 0.0.2
...3 versions - Latest release: 12 months ago - 163 downloads last month - 1 maintainer
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.14 versions - Latest release: 12 months ago - 50 downloads last month - 2 stars on GitHub - 1 maintainer
sleepeeg 0.4.1
Sleep EEG preprocessing and analysis13 versions - Latest release: 9 months ago - 92 downloads last month - 10 stars on GitHub - 1 maintainer
scalde-data-factory 0.0.1
Data preparations tools for data science projects1 version - Latest release: about 1 year ago - 7 downloads last month - 1 maintainer
hot-fair-utilities 1.2.3 💰
Utilities for AI - Assisted Mapping fAIr22 versions - Latest release: 7 months ago - 125 downloads last month - 11 stars on GitHub - 2 maintainers
hotlib 1.0.33
Utilities for an AI-assisted mapping tool developed for HOT.33 versions - Latest release: over 1 year ago - 1 dependent repositories - 226 downloads last month - 1 maintainer
wrainfo 0.9.5
Is a software to process FURUNO weather radar data.8 versions - Latest release: 11 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
drpt 0.8.2
Tool for preparing a dataset for publishing by dropping, renaming, scaling, and obfuscating colum...18 versions - Latest release: over 1 year ago - 147 downloads last month - 0 stars on GitHub - 1 maintainer
bids-derivatives 0.0.1
Python package for querying BIDS Apps` processed derivatives.2 versions - Latest release: almost 2 years ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
python-standardize-it 1.0.2
Tool for standardizing strings against a known set of standards3 versions - Latest release: almost 2 years ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
weboa 0.6.8 removed
weboa is a cli tool to create templates for websites and preprocessors18 versions - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 0 stars on GitHub - 1 maintainer
templatext 0.0.2
Text preprocessing template for NLP.2 versions - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
sparkora 0.0.1
Exploratory data analysis toolkit for Pyspark1 version - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 53 stars on GitHub - 1 maintainer
smart-data-tools 0.3.1
This library contains adjusted tools for data preprocessing and working with mixed data types.2 versions - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 18 stars on GitHub - 1 maintainer
silk-ml 0.1.1
Simple Intelligent Learning Kit (SILK) for Machine learning5 versions - Latest release: over 4 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 1 maintainer
seqtools 1.4.1
A library for transparent transformation of indexable containers (lists, etc.)13 versions - Latest release: about 2 months ago - 2 dependent repositories - 702 downloads last month - 46 stars on GitHub - 1 maintainer
seq-qc 2.0.4
utilities for performing various preprocessing steps on sequencing reads10 versions - Latest release: over 6 years ago - 2 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
20 versions - Latest release: 6 months ago - 1 dependent package - 5 dependent repositories - 252 downloads last month - 1 maintainer
ryd 0.9.2
Ruamel Yaml Doc preprocessor (pronounced: /rɑɪt/, like the verb "write")20 versions - Latest release: 6 months ago - 1 dependent package - 5 dependent repositories - 252 downloads last month - 1 maintainer
recipipe 0.0.5
Improved pipelines for data science projects.6 versions - Latest release: about 4 years ago - 1 dependent repositories - 47 downloads last month - 4 stars on GitHub - 1 maintainer
Related Keywords
python
71
machine-learning
43
nlp
36
data
29
data-science
24
pandas
19
text
17
natural-language-processing
16
NLP
15
pipeline
14
machine learning
14
dataset
13
sklearn
12
processing
12
data science
12
pytorch
12
scikit-learn
11
deep-learning
10
eeg
10
natural language processing
10
python3
9
classification
9
parsing
9
ml
9
text-processing
9
PDF
8
normalization
8
postprocessing
7
computer-vision
7
learning
7
statistics
6
cleaning
6
pypi
6
image
6
automl
6
data-preprocessing
6
analysis
6
WEB
5
WORD
5
science
5
pipelines
5
regression
5
data processing
5
fmri
5
keras
5
data-analysis
5
stream
5
data-cleaning
5
tensorflow
5
feature-engineering
5
dataframe
5
lemmatization
4
automated
4
feature-selection
4
neuroimaging
4
numpy
4
datascience
4
Preprocessing
4
transformer
4
augmentation
4
deep learning
4
langchain
4
llm
4
mri
4
eda
4
preprocess
4
visualization
4
machine
4
mne-python
4
tokenization
4
neuroscience
3
xgboost
3
feature engineering
3
time-series
3
pre-processing
3
stanza
3
io
3
semi-supervised-learning
3
ray
3
lightgbm
3
toolkit
3
language
3
spacy
3
Statistics
3
ComputerVision
3
ArtificialIntelligence
3
artificial-intelligence
3
XML
3
CV
3
pyspark
3
HTML
3
data-mining
3
hacktoberfest
3
data-processing
3
eeg-analysis
3
mining
3
images
3
wrapper
3
features
3
deep
3