Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "preprocessing" keyword

oc-preprocessing 0.0.5
This package is meant to preprocess OpenCitations source dumps so to make them easily usable in O...
2 versions - Latest release: about 1 year ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
podium-nlp 0.1.1
Podium: a framework agnostic Python NLP library for data loading and preprocessing
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 24 downloads last month - 60 stars on GitHub - 1 maintainer
dcmpi 0.0.2.8
DICOM Preprocessing Interface.
11 versions - Latest release: almost 4 years ago - 4 dependent repositories - 35 downloads last month - 2 stars on GitHub - 1 maintainer
fifa-preprocessing 1.1.2
A package providing methods to preprocess data, with the intent to perform Machine Learning.
8 versions - Latest release: about 4 years ago - 1 dependent repositories - 27 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
contextualspellcheck 0.4.4 💰
Contextual spell correction using BERT (bidirectional representations)
18 versions - Latest release: 8 months ago - 1 dependent package - 4 dependent repositories - 7.98 thousand downloads last month - 395 stars on GitHub - 1 maintainer
nlpiper 0.3.1
NLPiper, a lightweight package integrated with a universe of frameworks to pre-process documents.
5 versions - Latest release: about 2 years ago - 2 dependent repositories - 38 downloads last month - 17 stars on GitHub - 3 maintainers
bowline 0.2.2
Configurable tools to easily pre and post process your data for data-science and machine learning.
5 versions - Latest release: about 2 years ago - 46 downloads last month - 2 stars on GitHub - 1 maintainer
advancedanalytics 1.3.0
Python support for 'The Art and Science of Data Analytics'
46 versions - Latest release: almost 5 years ago - 351 downloads last month - 7 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
tweet-preprocessor 0.6.0
Elegant tweet preprocessing
6 versions - Latest release: about 4 years ago - 11 dependent packages - 146 dependent repositories - 5.23 thousand downloads last month - 300 stars on GitHub - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.4 thousand downloads last month - 30 stars on GitHub - 2 maintainers
autodatap 1.5.2
Automating Data Preprocessing
33 versions - Latest release: 8 months ago - 193 downloads last month - 0 stars on GitHub - 1 maintainer
catprep 0.0.4
A preprocessing library for categorical variables
4 versions - Latest release: almost 8 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 1 maintainer
prep-ml 0.1.1
Preprocessing for ML models made easy.
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 15 downloads last month - 1 stars on GitHub - 1 maintainer
warfit-learn 0.2.1
A toolkit for reproducible research in warfarin dose estimation
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 28 downloads last month - 10 stars on GitHub - 1 maintainer
ocm2 0.2.0
This python package extracts subdatasets from OCM-2 HDF file, georeference them and exports them ...
4 versions - Latest release: about 1 year ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
objectdetectiontools 1.3.4
A set of functions useful when doing object detection.
21 versions - Latest release: over 1 year ago - 79 downloads last month - 0 stars on GitLab.com - 1 maintainer
cube-helper 2.2.3
Cube Helper is a package to make equalisation, concatenation, and analysis of Iris cubes easier.
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 41 downloads last month - 2 stars on GitHub - 2 maintainers
chunkyp 0.0.2
Ray-based preprocesisng pipeline.
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 29 downloads last month - 0 stars on GitHub - 1 maintainer
featureforge 0.1.6
A library to build and test machine learning features
7 versions - Latest release: almost 9 years ago - 5 dependent repositories - 36 downloads last month - 382 stars on GitHub - 2 maintainers
cleanflow 1.3.3a1
A a framework for cleaning, pre-processing and exploring data in a scalable and distributed manner.
11 versions - Latest release: about 6 years ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
keras-aug 0.5.8
A library that includes pure TF/Keras preprocessing and augmentation layers
9 versions - Latest release: 7 months ago - 54 downloads last month - 8 stars on GitHub - 1 maintainer
aicademecv 0.1
Image processing library with a specific focus on data preparation for skin based deep learning p...
1 version - Latest release: almost 5 years ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
nltp 0.1.0
Simple automated text preprocessor
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
preprocessingninja 0.0.1
A data preprocessing helper consists of your basic preprocessing needs
1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 1 stars on GitHub - 1 maintainer
utp 0.2
helper functions for typical python problems
1 version - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitHub - 1 maintainer
duplipy 0.2.0
A package for formatting and text replication, with added support for image augmentation.
11 versions - Latest release: 6 months ago - 81 downloads last month - 0 stars on GitHub - 1 maintainer
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views
16 versions - Latest release: about 1 month ago - 107 downloads last month - 1 maintainer
mordineznlp 0.1.0
Powerfull python tool for modern NLP processing
34 versions - Latest release: about 2 years ago - 1 dependent repositories - 89 downloads last month - 2 stars on GitHub - 1 maintainer
findanywhere
1 version
sktutor 0.2.3
sktutor helps your machines learn.
23 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.66 thousand downloads last month - 2 stars on GitHub - 1 maintainer
masksemi 0.0.1
Code for converting a list of labels in a scikit-like semi-supervised labels.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
mercury-imgpprcs 0.0.1
Mercury: Image Pre-processing Open Source API for Artificial Intelligence
1 version - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
chariot 0.5.6
Deliver the ready-to-train data to your NLP model.
19 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 122 stars on GitHub - 1 maintainer
sparklanes 0.2.4
A lightweight framework to build and execute data processing pipelines in pyspark (Apache Spark's...
5 versions - Latest release: over 5 years ago - 1 dependent repositories - 27 downloads last month - 16 stars on GitHub - 1 maintainer
motion-learning-toolbox 1.0.6
Python library for preprocessing of XR motion tracking data for machine learning applications.
5 versions - Latest release: about 1 month ago - 47 downloads last month - 5 stars on GitHub - 2 maintainers
csv2mne 0.0.1
Data formater
2 versions - Latest release: over 1 year ago - 12 downloads last month - 1 maintainer
demv 1.0.2
Debiaser for Multiple Variables(DEMV) is a pre-processing algorithm for binary and multi-class da...
3 versions - Latest release: 3 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
logprep 11.3.0
Logprep allows to collect, process and forward log messages from various data sources.
36 versions - Latest release: 19 days ago - 408 downloads last month - 24 stars on GitHub - 3 maintainers
jionlp-py39 1.3.45
Chinese NLPreprocessing & Parsing
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 17 downloads last month - 3,015 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
jionlp 1.5.11
Chinese NLP Preprocessing & Parsing
31 versions - Latest release: about 1 month ago - 7 dependent packages - 6 dependent repositories - 2.99 thousand downloads last month - 3,015 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
multi-imbalance 0.0.14
Python package for tackling multiclass imbalance problems.
14 versions - Latest release: about 3 years ago - 4 dependent repositories - 1.61 thousand downloads last month - 74 stars on GitHub - 4 maintainers
bio-volumentations 1.2.0
Library for 3D-5D augmentations of volumetric multi-dimensional biomedical images and their annot...
7 versions - Latest release: 27 days ago - 147 downloads last month - 1,971 stars on GitHub - 2 maintainers
openav 1.0.0a11
OpenAV
8 versions - Latest release: 21 days ago - 737 downloads last month - 3 stars on GitHub - 1 maintainer
take-text-preprocess 0.0.5
Text Preprocesser
10 versions - Latest release: over 2 years ago - 4 dependent packages - 1 dependent repositories - 193 downloads last month - 1 maintainer
nlcodec 0.4.0
nlcodec is a collection of encoding schemes for natural language sequences. nlcodec.db is a effi...
10 versions - Latest release: almost 3 years ago - 1 dependent package - 2 dependent repositories - 94 downloads last month - 5 stars on GitHub - 1 maintainer
arm-preprocessing 0.2.2
Implementation of several preprocessing techniques for Association Rule Mining (ARM)
5 versions - Latest release: 2 months ago - 30 downloads last month - 2 stars on GitHub - 2 maintainers
qd 0.8.9
QD-Engineering Python Library for CAE
32 versions - Latest release: over 4 years ago - 1 dependent repositories - 522 downloads last month - 1 maintainer
Top 5.4% on pypi.org
courlan 1.1.0
Clean, filter and sample URLs to optimize data collection – includes spam, content type and langu...
27 versions - Latest release: about 1 month ago - 8 dependent packages - 31 dependent repositories - 484 thousand downloads last month - 69 stars on GitHub - 1 maintainer
cmip6-preprocessing 0.6.0
Analysis ready CMIP6 data the easy way
11 versions - Latest release: almost 2 years ago - 1 dependent package - 118 downloads last month - 187 stars on GitHub - 1 maintainer
nlp-preprocessing-qvm9 0.0.4
Fix the import error
2 versions - Latest release: over 1 year ago - 16 downloads last month - 1 maintainer
arrowtextclassifier 1.0.0
ArrowTextClassifier is a simple text classification tool written in pytorch that allows you to tr...
4 versions - Latest release: about 1 month ago - 525 downloads last month - 1 maintainer
arac 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
1 version - Latest release: 3 months ago - 19 downloads last month - 1 maintainer
designer2 2.0.7
designerV2
35 versions - Latest release: 2 months ago - 166 downloads last month - 11 stars on GitHub - 3 maintainers
lughaatnlp 1.0.5
A Python package for natural language processing tasks for the Urdu language, including normaliza...
5 versions - Latest release: about 2 months ago - 175 downloads last month - 0 stars on GitHub - 1 maintainer
arctix 0.0.5
A library to get a text summary of nested objects
12 versions - Latest release: 20 days ago - 6.38 thousand downloads last month - 0 stars on GitHub - 1 maintainer
ecg-qc 1.0b6
a package to compute if ECG signal quality is optimal or noisy
6 versions - Latest release: over 2 years ago - 3 dependent repositories - 71 downloads last month - 36 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
nonechucks 0.4.2
nonechucks is a library that provides wrappers for PyTorch's datasets, samplers, and transforms t...
18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 898 downloads last month - 373 stars on GitHub - 1 maintainer
sk-transformers 0.11.0
A collection of various pandas & scikit-learn compatible transformers for all kinds of preprocess...
25 versions - Latest release: about 1 year ago - 1 dependent repositories - 298 downloads last month - 8 stars on GitHub - 1 maintainer
alkymi 0.3.1
alkymi - Pythonic task automation
10 versions - Latest release: 17 days ago - 1 dependent repositories - 266 downloads last month - 44 stars on GitHub - 1 maintainer
pynmranalysis 1.1.3
python library for NMR preprocessing and analysis
9 versions - Latest release: almost 3 years ago - 1 dependent repositories - 70 downloads last month - 3 stars on GitHub - 1 maintainer
dmriprep 0.5.0
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.
8 versions - Latest release: about 3 years ago - 1 dependent repositories - 100 downloads last month - 62 stars on GitHub - 2 maintainers
unstructured-api-tools 0.10.11
A library that prepares raw documents for downstream ML tasks.
33 versions - Latest release: 10 months ago - 2 dependent repositories - 256 downloads last month - 28 stars on GitHub - 1 maintainer
ab-data-processing 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
1 version - Latest release: 4 months ago - 26 downloads last month - 43 stars on GitHub - 1 maintainer
one-data-processing 0.0.14
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
14 versions - Latest release: 4 months ago - 104 downloads last month - 43 stars on GitHub - 1 maintainer
kubeagi 0.0.4
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
4 versions - Latest release: 3 months ago - 45 downloads last month - 3 stars on GitHub - 1 maintainer
a-data-processing 0.0.1
A library that prepares raw documents for downstream ML tasks.
1 version - Latest release: 4 months ago - 29 downloads last month - 43 stars on GitHub - 1 maintainer
vflow 0.1.4
A framework for doing stability analysis with PCS.
7 versions - Latest release: 4 months ago - 1 dependent repositories - 67 downloads last month - 60 stars on GitHub - 2 maintainers
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
1 version - Latest release: about 1 year ago - 22 downloads last month - 219 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
seqio 0.0.19
SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
21 versions - Latest release: 4 months ago - 6 dependent packages - 137 dependent repositories - 2.55 million downloads last month - 526 stars on GitHub - 2 maintainers
pytough 1.6.2
Python scripting library for TOUGH2 simulation
3 versions - Latest release: about 2 months ago - 1.44 thousand downloads last month - 90 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
21 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 429 downloads last month - 218 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
nnaudio 0.3.3
A fast GPU audio processing toolbox with 1D convolutional neural network
35 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 10.3 thousand downloads last month - 945 stars on GitHub - 1 maintainer
niaarm 0.3.9
A minimalistic framework for numerical association rule mining
21 versions - Latest release: about 2 months ago - 2 dependent packages - 1 dependent repositories - 317 downloads last month - 14 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
autoreject 0.4.3
Automated rejection and repair of epochs in M/EEG.
10 versions - Latest release: 7 months ago - 4 dependent packages - 41 dependent repositories - 5.59 thousand downloads last month - 134 stars on GitHub - 3 maintainers
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.
5 versions - Latest release: 4 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
adcl 0.1.7
Data preprocessing and cleaning tools for data science projects
8 versions - Latest release: about 1 month ago - 139 downloads last month - 0 stars on GitHub - 1 maintainer
massivetools 0.0.1 removed
Useful packages to perform computer vision tasks
1 version - Latest release: 2 months ago - 1 maintainer
computer-vision-utils 0.0.1
Useful packages to perform computer vision tasks
1 version - Latest release: 2 months ago - 42 downloads last month - 1 maintainer
mngdataclean 0.4.2
Text preprocessing package
5 versions - Latest release: 3 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
sleepeegpy 0.5.1
Sleep EEG preprocessing, analysis and visualization
2 versions - Latest release: 6 months ago - 25 downloads last month - 11 stars on GitHub - 1 maintainer
kditransform 0.2.0
Kernel density integral transformation
2 versions - Latest release: 6 months ago - 156 downloads last month - 4 stars on GitHub - 1 maintainer
langchain-addons 0.0.2
...
3 versions - Latest release: 12 months ago - 163 downloads last month - 1 maintainer
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.
14 versions - Latest release: 12 months ago - 50 downloads last month - 2 stars on GitHub - 1 maintainer
sleepeeg 0.4.1
Sleep EEG preprocessing and analysis
13 versions - Latest release: 9 months ago - 92 downloads last month - 10 stars on GitHub - 1 maintainer
scalde-data-factory 0.0.1
Data preparations tools for data science projects
1 version - Latest release: about 1 year ago - 7 downloads last month - 1 maintainer
hot-fair-utilities 1.2.3 💰
Utilities for AI - Assisted Mapping fAIr
22 versions - Latest release: 7 months ago - 125 downloads last month - 11 stars on GitHub - 2 maintainers
hotlib 1.0.33
Utilities for an AI-assisted mapping tool developed for HOT.
33 versions - Latest release: over 1 year ago - 1 dependent repositories - 226 downloads last month - 1 maintainer
wrainfo 0.9.5
Is a software to process FURUNO weather radar data.
8 versions - Latest release: 11 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
drpt 0.8.2
Tool for preparing a dataset for publishing by dropping, renaming, scaling, and obfuscating colum...
18 versions - Latest release: over 1 year ago - 147 downloads last month - 0 stars on GitHub - 1 maintainer
bids-derivatives 0.0.1
Python package for querying BIDS Apps` processed derivatives.
2 versions - Latest release: almost 2 years ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
python-standardize-it 1.0.2
Tool for standardizing strings against a known set of standards
3 versions - Latest release: almost 2 years ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
weboa 0.6.8 removed
weboa is a cli tool to create templates for websites and preprocessors
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 0 stars on GitHub - 1 maintainer
templatext 0.0.2
Text preprocessing template for NLP.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
sparkora 0.0.1
Exploratory data analysis toolkit for Pyspark
1 version - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 53 stars on GitHub - 1 maintainer
smart-data-tools 0.3.1
This library contains adjusted tools for data preprocessing and working with mixed data types.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 18 stars on GitHub - 1 maintainer
silk-ml 0.1.1
Simple Intelligent Learning Kit (SILK) for Machine learning
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 1 maintainer
seqtools 1.4.1
A library for transparent transformation of indexable containers (lists, etc.)
13 versions - Latest release: about 2 months ago - 2 dependent repositories - 702 downloads last month - 46 stars on GitHub - 1 maintainer
seq-qc 2.0.4
utilities for performing various preprocessing steps on sequencing reads
10 versions - Latest release: over 6 years ago - 2 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
ryd 0.9.2
Ruamel Yaml Doc preprocessor (pronounced: /rɑɪt/, like the verb "write")
20 versions - Latest release: 6 months ago - 1 dependent package - 5 dependent repositories - 252 downloads last month - 1 maintainer
recipipe 0.0.5
Improved pipelines for data science projects.
6 versions - Latest release: about 4 years ago - 1 dependent repositories - 47 downloads last month - 4 stars on GitHub - 1 maintainer