Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "preprocessing" keyword

toughio 1.14.2
Pre- and post-processing Python library for TOUGH
63 versions - Latest release: 24 days ago - 1 dependent repositories - 1.06 thousand downloads last month - 52 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
autots 0.6.14
Automated Time Series Forecasting
61 versions - Latest release: 2 days ago - 1 dependent package - 12 dependent repositories - 34.4 thousand downloads last month - 1,024 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
xmip 0.7.2
Analysis ready CMIP6 data the easy way
4 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 860 downloads last month - 176 stars on GitHub - 1 maintainer
ecg-qc 1.0b6
a package to compute if ECG signal quality is optimal or noisy
6 versions - Latest release: over 2 years ago - 3 dependent repositories - 71 downloads last month - 36 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
nonechucks 0.4.2
nonechucks is a library that provides wrappers for PyTorch's datasets, samplers, and transforms t...
18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 898 downloads last month - 373 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
seqio-nightly 0.0.18.dev20240518
SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
911 versions - Latest release: 1 day ago - 3 dependent packages - 10 dependent repositories - 441 thousand downloads last month - 526 stars on GitHub - 1 maintainer
lughaatnlp 1.0.5
A Python package for natural language processing tasks for the Urdu language, including normaliza...
5 versions - Latest release: about 1 month ago - 82 downloads last month - 0 stars on GitHub - 1 maintainer
sk-transformers 0.11.0
A collection of various pandas & scikit-learn compatible transformers for all kinds of preprocess...
25 versions - Latest release: about 1 year ago - 1 dependent repositories - 298 downloads last month - 8 stars on GitHub - 1 maintainer
bio-volumentations 1.2.0
Library for 3D-5D augmentations of volumetric multi-dimensional biomedical images and their annot...
7 versions - Latest release: 13 days ago - 163 downloads last month - 1,968 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
unstructured 0.14.0
A library that prepares raw documents for downstream ML tasks.
133 versions - Latest release: 2 days ago - 113 dependent packages - 3,374 dependent repositories - 1.13 million downloads last month - 4,064 stars on GitHub - 1 maintainer
alkymi 0.3.1
alkymi - Pythonic task automation
10 versions - Latest release: 3 days ago - 1 dependent repositories - 266 downloads last month - 44 stars on GitHub - 1 maintainer
logprep 11.3.0
Logprep allows to collect, process and forward log messages from various data sources.
36 versions - Latest release: 5 days ago - 500 downloads last month - 24 stars on GitHub - 3 maintainers
pynmranalysis 1.1.3
python library for NMR preprocessing and analysis
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 70 downloads last month - 3 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
jionlp 1.5.11
Chinese NLP Preprocessing & Parsing
30 versions - Latest release: 24 days ago - 7 dependent packages - 6 dependent repositories - 3.21 thousand downloads last month - 2,995 stars on GitHub - 1 maintainer
jionlp-py39 1.3.45
Chinese NLPreprocessing & Parsing
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 24 downloads last month - 2,986 stars on GitHub - 1 maintainer
dmriprep 0.5.0
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.
8 versions - Latest release: about 3 years ago - 1 dependent repositories - 100 downloads last month - 62 stars on GitHub - 2 maintainers
unstructured-api-tools 0.10.11
A library that prepares raw documents for downstream ML tasks.
33 versions - Latest release: 9 months ago - 2 dependent repositories - 256 downloads last month - 28 stars on GitHub - 1 maintainer
indic-num2words 1.2.1
Package to convert numbers to words with support of multiple indian languages.
4 versions - Latest release: about 1 month ago - 1 dependent package - 1 dependent repositories - 13.6 thousand downloads last month - 31 stars on GitHub - 1 maintainer
ab-data-processing 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
1 version - Latest release: 4 months ago - 26 downloads last month - 43 stars on GitHub - 1 maintainer
one-data-processing 0.0.14
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
14 versions - Latest release: 4 months ago - 104 downloads last month - 43 stars on GitHub - 1 maintainer
kubeagi 0.0.4
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
4 versions - Latest release: 3 months ago - 45 downloads last month - 3 stars on GitHub - 1 maintainer
a-data-processing 0.0.1
A library that prepares raw documents for downstream ML tasks.
1 version - Latest release: 4 months ago - 29 downloads last month - 43 stars on GitHub - 1 maintainer
vflow 0.1.4
A framework for doing stability analysis with PCS.
7 versions - Latest release: 3 months ago - 1 dependent repositories - 67 downloads last month - 60 stars on GitHub - 2 maintainers
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
1 version - Latest release: about 1 year ago - 22 downloads last month - 219 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
unstructured-inference 0.7.31
A library for performing inference using trained models.
84 versions - Latest release: 12 days ago - 8 dependent packages - 16 dependent repositories - 299 thousand downloads last month - 112 stars on GitHub - 1 maintainer
tmtoolkit 0.12.0
Text Mining and Topic Modeling Toolkit
35 versions - Latest release: about 1 year ago - 2 dependent packages - 10 dependent repositories - 2.96 thousand downloads last month - 12 stars on GitHub - 2 maintainers
Top 3.1% on pypi.org
seqio 0.0.19
SeqIO: Task-based datasets, preprocessing, and evaluation for sequence models.
21 versions - Latest release: 4 months ago - 6 dependent packages - 137 dependent repositories - 2.55 million downloads last month - 526 stars on GitHub - 2 maintainers
pytough 1.6.2
Python scripting library for TOUGH2 simulation
3 versions - Latest release: about 2 months ago - 1.44 thousand downloads last month - 90 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 429 downloads last month - 218 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
nnaudio 0.3.3
A fast GPU audio processing toolbox with 1D convolutional neural network
35 versions - Latest release: 3 months ago - 4 dependent packages - 5 dependent repositories - 10.3 thousand downloads last month - 945 stars on GitHub - 1 maintainer
niaarm 0.3.9
A minimalistic framework for numerical association rule mining
21 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 317 downloads last month - 14 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
jaconv 0.3.4 💰
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, Zenkaku and more
11 versions - Latest release: about 1 year ago - 25 dependent packages - 198 dependent repositories - 1.38 million downloads last month - 286 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
courlan 1.1.0
Clean, filter and sample URLs to optimize data collection – includes spam, content type and langu...
27 versions - Latest release: 19 days ago - 8 dependent packages - 31 dependent repositories - 475 thousand downloads last month - 65 stars on GitHub - 1 maintainer
cmip6-preprocessing 0.6.0
Analysis ready CMIP6 data the easy way
11 versions - Latest release: almost 2 years ago - 1 dependent package - 197 downloads last month - 183 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
autoreject 0.4.3
Automated rejection and repair of epochs in M/EEG.
10 versions - Latest release: 6 months ago - 4 dependent packages - 41 dependent repositories - 5.59 thousand downloads last month - 134 stars on GitHub - 3 maintainers
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.
5 versions - Latest release: 3 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
arctix 0.0.5
A library to get a text summary of nested objects
11 versions - Latest release: 6 days ago - 4.74 thousand downloads last month - 0 stars on GitHub - 1 maintainer
adcl 0.1.7
Data preprocessing and cleaning tools for data science projects
8 versions - Latest release: 17 days ago - 139 downloads last month - 0 stars on GitHub - 1 maintainer
massivetools 0.0.1 removed
Useful packages to perform computer vision tasks
1 version - Latest release: about 2 months ago - 1 maintainer
computer-vision-utils 0.0.1
Useful packages to perform computer vision tasks
1 version - Latest release: about 2 months ago - 42 downloads last month - 1 maintainer
kdp 1.5.2
Data Preprocessing model based on Keras preprocessing layers
5 versions - Latest release: 19 days ago - 242 downloads last month - 1 stars on GitHub - 1 maintainer
mngdataclean 0.4.2
Text preprocessing package
5 versions - Latest release: 3 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
arac 0.0.1
Data Processing is used for data processing through MinIO, databases, Web APIs, etc.
1 version - Latest release: 3 months ago - 11 downloads last month - 1 maintainer
arm-preprocessing 0.2.2
Implementation of several preprocessing techniques for Association Rule Mining (ARM)
5 versions - Latest release: about 2 months ago - 43 downloads last month - 2 stars on GitHub - 2 maintainers
sleepeegpy 0.5.1
Sleep EEG preprocessing, analysis and visualization
2 versions - Latest release: 5 months ago - 25 downloads last month - 11 stars on GitHub - 1 maintainer
openav 1.0.0a11
OpenAV
8 versions - Latest release: 7 days ago - 784 downloads last month - 3 stars on GitHub - 1 maintainer
autodatap 1.5.2
Automating Data Preprocessing
33 versions - Latest release: 8 months ago - 204 downloads last month - 0 stars on GitHub - 1 maintainer
motion-learning-toolbox 1.0.6
Python library for preprocessing of XR motion tracking data for machine learning applications.
5 versions - Latest release: 26 days ago - 30 downloads last month - 5 stars on GitHub - 2 maintainers
skloverlay 1.2.0
SKLearn Classification Interface
5 versions - Latest release: 8 months ago - 37 downloads last month - 0 stars on GitHub - 1 maintainer
kditransform 0.2.0
Kernel density integral transformation
2 versions - Latest release: 6 months ago - 156 downloads last month - 4 stars on GitHub - 1 maintainer
demv 1.0.2
Debiaser for Multiple Variables(DEMV) is a pre-processing algorithm for binary and multi-class da...
3 versions - Latest release: 3 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
langchain-addons 0.0.2
...
3 versions - Latest release: 11 months ago - 163 downloads last month - 1 maintainer
wiz-craft 1.1.1
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data explorat...
6 versions - Latest release: 7 months ago - 18.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
duplipy 0.2.0
A package for formatting and text replication, with added support for image augmentation.
11 versions - Latest release: 5 months ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
cv2filters 0.2.6 💰
Unleash the power of OpenCV with CV2Filters, the ultimate image processing wrapper for all skill ...
9 versions - Latest release: 7 months ago - 176 downloads last month - 3 stars on GitHub - 1 maintainer
designer2 2.0.7
designerV2
35 versions - Latest release: about 2 months ago - 442 downloads last month - 9 stars on GitHub - 3 maintainers
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.
14 versions - Latest release: 12 months ago - 50 downloads last month - 2 stars on GitHub - 1 maintainer
wrapenv 0.1.4
A wrapper for callable functions with environments to register pre- and post-processing functions...
3 versions - Latest release: 12 months ago - 35 downloads last month - 0 stars on GitHub - 1 maintainer
keras-aug 0.5.8
A library that includes pure TF/Keras preprocessing and augmentation layers
9 versions - Latest release: 6 months ago - 69 downloads last month - 8 stars on GitHub - 1 maintainer
genbiox 0.3.1
A Comprehensive Bioinformatics Package for Genome Analysis
4 versions - Latest release: about 1 year ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
sleepeeg 0.4.1
Sleep EEG preprocessing and analysis
13 versions - Latest release: 8 months ago - 92 downloads last month - 10 stars on GitHub - 1 maintainer
scalde-data-factory 0.0.1
Data preparations tools for data science projects
1 version - Latest release: about 1 year ago - 7 downloads last month - 1 maintainer
oc-preprocessing 0.0.5
This package is meant to preprocess OpenCitations source dumps so to make them easily usable in O...
2 versions - Latest release: about 1 year ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
patatas 0.1.1
A powerful package for K-NN regression, data preprocessing, and analysis for Data Science
2 versions - Latest release: about 1 year ago - 20 downloads last month - 1 maintainer
ocm2 0.2.0
This python package extracts subdatasets from OCM-2 HDF file, georeference them and exports them ...
4 versions - Latest release: about 1 year ago - 23 downloads last month - 1 stars on GitHub - 1 maintainer
lidirl 0.0.1
LID toolkit to improve performance on spontaneous noisy text with data augmentation.
1 version - Latest release: about 1 year ago - 25 downloads last month - 0 stars on GitHub - 1 maintainer
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views
16 versions - Latest release: 26 days ago - 275 downloads last month - 1 maintainer
hot-fair-utilities 1.2.3 💰
Utilities for AI - Assisted Mapping fAIr
22 versions - Latest release: 7 months ago - 125 downloads last month - 11 stars on GitHub - 2 maintainers
ipa-core 0.1.3
NLP Preprocessing Pipeline Wrappers
4 versions - Latest release: about 1 year ago - 27 downloads last month - 13 stars on GitHub - 1 maintainer
csv2mne 0.0.1
Data formater
2 versions - Latest release: over 1 year ago - 12 downloads last month - 1 maintainer
hotlib 1.0.33
Utilities for an AI-assisted mapping tool developed for HOT.
33 versions - Latest release: over 1 year ago - 1 dependent repositories - 226 downloads last month - 1 maintainer
wrainfo 0.9.5
Is a software to process FURUNO weather radar data.
8 versions - Latest release: 11 months ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
drpt 0.8.2
Tool for preparing a dataset for publishing by dropping, renaming, scaling, and obfuscating colum...
18 versions - Latest release: over 1 year ago - 147 downloads last month - 0 stars on GitHub - 1 maintainer
simple-preprocessing 0.0.5
A package that allows to build simple streams of video, audio and camera data.
5 versions - Latest release: over 1 year ago - 7 downloads last month - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.5 thousand downloads last month - 30 stars on GitHub - 2 maintainers
bids-derivatives 0.0.1
Python package for querying BIDS Apps` processed derivatives.
2 versions - Latest release: almost 2 years ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
python-standardize-it 1.0.2
Tool for standardizing strings against a known set of standards
3 versions - Latest release: almost 2 years ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
fic 0.5.1
Fast Image Compression
11 versions - Latest release: about 2 years ago - 1 dependent repositories - 69 downloads last month - 1 stars on GitHub - 1 maintainer
paderborn-bearing 1.0.0
Preprocessed Paderborn Bearing Dataset for analysing multivariate motor current signals combined ...
4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 29 downloads last month - 6 stars on GitHub - 1 maintainer
xtlearn 0.0.25
This is a package with classes to be used in sklearn pipelines with pandas dataframes
25 versions - Latest release: almost 2 years ago - 1 dependent repositories - 221 downloads last month - 0 stars on GitHub - 1 maintainer
weboa 0.6.8 removed
weboa is a cli tool to create templates for websites and preprocessors
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 0 stars on GitHub - 1 maintainer
warfit-learn 0.2.1
A toolkit for reproducible research in warfarin dose estimation
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 47 downloads last month - 10 stars on GitHub - 1 maintainer
utp 0.2
helper functions for typical python problems
1 version - Latest release: over 3 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
tweet-nlp-toolkit 1.0.5
NLP toolkit for tweets
6 versions - Latest release: about 1 year ago - 1 dependent repositories - 72 downloads last month - 2 stars on GitHub - 1 maintainer
turkish-twitter-preprocess 0.0.7
a light-weight python package to pre-process turkish twitter statuses(tweets).
7 versions - Latest release: over 3 years ago - 1 dependent repositories - 68 downloads last month - 6 stars on GitHub - 1 maintainer
toxine 1.0.52
Tiny preprocessor for Russian text
48 versions - Latest release: over 2 years ago - 1 dependent repositories - 323 downloads last month - 5 stars on GitHub - 1 maintainer
textprocess 0.1
A python module for data(text) pre-processing
1 version - Latest release: over 10 years ago - 3 dependent repositories - 14 downloads last month - 12 stars on GitHub - 1 maintainer
text-ppf 1.0.0
Text pre-processing function for NLP
1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 0 stars on GitHub - 1 maintainer
text-normalizer 0.1.3
Yoctol Natural Language Text Normalizer
10 versions - Latest release: over 5 years ago - 3 dependent repositories - 322 downloads last month - 13 stars on GitHub - 2 maintainers
textdatasetcleaner 0.0.6
Pipeline for cleaning (preprocessing/normalizing) text datasets
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 31 downloads last month - 38 stars on GitHub - 1 maintainer
templatext 0.0.2
Text preprocessing template for NLP.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
take-text-preprocess 0.0.5
Text Preprocesser
10 versions - Latest release: over 2 years ago - 4 dependent packages - 1 dependent repositories - 133 downloads last month - 1 maintainer
sparkora 0.0.1
Exploratory data analysis toolkit for Pyspark
1 version - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 53 stars on GitHub - 1 maintainer
smart-data-tools 0.3.1
This library contains adjusted tools for data preprocessing and working with mixed data types.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 18 stars on GitHub - 1 maintainer
simages 23.0.7
Find similar images in a dataset
17 versions - Latest release: 11 months ago - 1 dependent repositories - 129 downloads last month - 20 stars on GitHub - 1 maintainer
silk-ml 0.1.1
Simple Intelligent Learning Kit (SILK) for Machine learning
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 1 maintainer
seqtools 1.4.1
A library for transparent transformation of indexable containers (lists, etc.)
13 versions - Latest release: about 1 month ago - 2 dependent repositories - 702 downloads last month - 46 stars on GitHub - 1 maintainer
seq-qc 2.0.4
utilities for performing various preprocessing steps on sequencing reads
10 versions - Latest release: over 6 years ago - 2 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
scalecol 0.1
Implemetation of several preprocessing steps as i wanted
1 version - Latest release: over 5 years ago - 1 dependent repositories - 4 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
ryd 0.9.2
Ruamel Yaml Doc preprocessor (pronounced: /rɑɪt/, like the verb "write")
20 versions - Latest release: 6 months ago - 1 dependent package - 5 dependent repositories - 252 downloads last month - 1 maintainer