Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-preparation" keyword
skrub 0.5.0
Prepping tables for machine learning4 versions - Latest release: 5 months ago - 545 downloads last month - 1,018 stars on GitHub - 4 maintainers
fastai-category-encoders 0.0.4
Category encoders integrated with Fast.ai4 versions - Latest release: over 3 years ago - 1 dependent repositories - 55 downloads last month - 8 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
prosto 0.6.0
Data processing toolkit radically changing the way data is processed5 versions - Latest release: over 2 years ago - 1 dependent repositories - 64 downloads last month - 89 stars on GitHub - 1 maintainer
spltr 0.3.2
A simple PyTorch-based data loader and splitter3 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
ml-express 0.1.3
A Python library for day to day data analysis and machine learning.3 versions - Latest release: over 2 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
machine-learning-data-pipeline 1.0.3
Pipeline module for parallel real-time data processing for machine learning models development an...2 versions - Latest release: over 5 years ago - 1 dependent repositories - 22 downloads last month - 22 stars on GitHub - 1 maintainer
dptools 0.4.2
Data Preprocessing Tools20 versions - Latest release: about 2 years ago - 1 dependent repositories - 44 downloads last month - 3 stars on GitHub - 1 maintainer
label-maker 0.9.1
Data preparation for satellite machine learning18 versions - Latest release: over 3 years ago - 1 dependent repositories - 103 downloads last month - 454 stars on GitHub - 3 maintainers
daxpy 0.2
A pre-machine-learning model package1 version - Latest release: over 2 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
data-preprocessing
7
data-science
7
machine-learning
6
python
4
data-analysis
4
data-wrangling
4
spark
3
data-processing
3
feature-engineering
3
deep-learning
3
data-cleaning
3
data
3
pyspark
2
data-transformation
2
data-extraction
2
data-exploration
2
data-cleaner
2
dask-cudf
2
dask
2
cudf
2
bigdata
2
big-data-cleaning
2
data-profiling
2
data-cleansing
2
datacleaner
2
pandas
2
pytorch
2
apachespark
1
train-test-validation
1
train-test-split
1
train-split-pytorch
1
splitter
1
pytorch-dataset-split
1
pytorch-dataloader-objects
1
neural-networks
1
easy-to-use
1
easy-split
1
easy-data-split
1
satellite-imagery
1
dataset
1
data-split-pytorch
1
data-cleaning-pipeline
1
remote-sensing
1
keras
1
data-summarization
1
eda
1
pandas-profiling
1
visualization
1
algorithms
1
computing
1
data-pipeline
1
computer-vision
1
aggregation
1
parallel
1
natural-language-processing
1
dirty-data
1
encoding
1
fastai
1
categorical-features
1
fastai-category-encoders
1
fasttext-embeddings
1
notebooks
1
data processing
1
analytics
1
data science
1
map-reduce
1
feature engineering
1
business intelligence
1
business-intelligence
1
olap
1
workflow
1
PyTorch
1
Data loader
1
Data splitter
1
DataLoader
1
random_split
1
train test
1
train test validation split
1
data preprocessing
1
pytorch dataset split
1
data-loader
1
data-loading
1
data-preprocess
1
data-split
1