Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data wrangling" keyword
omnipy 0.15.12
Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orches...57 versions - Latest release: about 1 month ago - 1 dependent package - 2 dependent repositories - 417 downloads last month - 11 stars on GitHub - 1 maintainer
badfish 0.1.2
Badfish - A missing data analysis and wrangling library in Python4 versions - Latest release: almost 8 years ago - 1 dependent repositories - 31 downloads last month - 19 stars on GitHub - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...5 versions - Latest release: 4 months ago - 43 downloads last month - 1 stars on GitHub - 1 maintainer
spooq 3.4.0
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes.11 versions - Latest release: 3 months ago - 1 dependent repositories - 19.2 thousand downloads last month - 8 stars on GitHub - 1 maintainer
featurebridge 0.9.5 removed
FeatureBridge: Revolutionizing ML adaptive modelling for handling missing features and data. The ...3 versions - Latest release: 8 months ago - 191 downloads last month - 0 stars on GitHub - 1 maintainer
parallelfileconcatenator 0.1 removed
ParallelFileConcatenator is a robust tool designed to efficiently combine data files of various f...1 version - Latest release: 10 months ago
Related Keywords
data cleaning
4
machine learning
4
data analysis
4
data science
3
classification
2
predictive modeling
2
data preprocessing
2
sklearn
2
scikit-learn
2
python
2
data visualization
2
python library
2
data processing
2
data exploration
2
data manipulation
2
analytics
2
statistics
2
artificial intelligence
2
AI
2
feature engineering
2
etl
2
data pre-processing tool
2
data quality assessment
2
missing data detection
2
data missingness
2
impute missing values
2
data completeness
2
data validation
2
data cleansing
2
data integrity
2
data handling
2
missing data analysis
2
data quality
2
data engineering
2
data imputation
2
missing data
2
big data
2
regression
2
hive
1
metadata
1
data preparation
1
file processing
1
file management
1
data compression
1
data deduplication
1
data merging
1
data aggregation
1
parallel file concatenation
1
transform
1
load
1
extract
1
etl-pipeline
1
data-engineering
1
big-data
1
streaming
1
batch
1
databricks
1
data ingestion
1
hadoop
1
cloudera
1
workflows
1
research data
1
prefect
1
pydantic
1
FAIR
1
ontologies
1
JSON
1
tabular
1
type-driven
1
orchestration
1
data models
1
universal
1
data
1
data-models
1
data-wrangling
1
fair
1
json
1
research-data
1
workflow
1
missing
1
imputation
1
spooq
1
spark
1