Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-transformation" keyword
Top 8.8% on pypi.org
3 versions - Latest release: 5 months ago - 1 dependent repositories - 1.15 thousand downloads last month - 890 stars on GitHub - 1 maintainer
zingg 0.4.0
Zingg Entity Resolution, Data Mastering and Deduplication3 versions - Latest release: 5 months ago - 1 dependent repositories - 1.15 thousand downloads last month - 890 stars on GitHub - 1 maintainer
arff-format-converter 1.0.3
Converts ARFF files to CSV, JSON, XML, XLSX, and ORC10 versions - Latest release: 3 months ago - 112 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
24 versions - Latest release: 6 months ago - 78 dependent packages - 281 dependent repositories - 1.97 million downloads last month - 1,778 stars on GitHub - 1 maintainer
glom 23.5.0
A declarative object transformer and formatter, for conglomerating nested data.24 versions - Latest release: 6 months ago - 78 dependent packages - 281 dependent repositories - 1.97 million downloads last month - 1,778 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
meltano-target-cratedb 0.0.1
A Singer target for CrateDB, built with the Meltano SDK, and based on the Meltano PostgreSQL target.1 version - Latest release: 5 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
fast-resource 0.1.1
fast-resource is a data transformation layer that sits between the database and the application's...2 versions - Latest release: 12 months ago - 27 downloads last month - 9 stars on GitHub - 1 maintainer
df-and-order 0.2.5
Using df-and-order your interactions with dataframes become very clean and predictable.11 versions - Latest release: almost 3 years ago - 1 dependent repositories - 96 downloads last month - 3 stars on GitHub - 1 maintainer
totype 0.1.0
Data converter1 version - Latest release: over 2 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
taro 0.0.1
A package for repeatable rectangular data transformations.1 version - Latest release: almost 3 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
pycsvw 1.0.2
Generate JSON and RDF from csv files with metadata3 versions - Latest release: over 6 years ago - 1 dependent repositories - 21 downloads last month - 32 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
glide 0.4.1
Easy ETL45 versions - Latest release: almost 2 years ago - 1 dependent repositories - 270 downloads last month - 19 stars on GitHub - 1 maintainer
customer-segmentation-toolkit 0.1.1
Data transformations for the Engineering Lab2 Feature-Store-for-ML5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 49 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
python
6
data-science
6
machine-learning
4
spark
3
pandas
3
data
3
json
2
csv
2
data-processing
2
python3
2
pyspark
2
data-preparation
2
data-extraction
2
data-exploration
2
data-cleaning
2
data-cleaner
2
data-analysis
2
datacleaner
2
data-wrangling
2
data-cleansing
2
data-profiling
2
big-data-cleaning
2
bigdata
2
cudf
2
dask
2
dask-cudf
2
etl
2
dataframes
2
data-engineering
1
cratedb
1
pipeline
1
pypi-package
1
metadata
1
rdf
1
csvw
1
apachespark
1
dag
1
parallel-processing
1
pipelines
1
MLOps
1
ML
1
feature_store
1
data_engineering
1
jupyter
1
mlops
1
nbdev
1
flask
1
memcached
1
fastapi
1
django
1
cache
1
Singer
1
redis
1
PostgreSQL
1
Meltano SDK
1
data-converter
1
Meltano
1
io
1
ETL
1
ELT
1
data-transfer
1
data-toolkit
1
data-transform
1
data-loading
1
data-convert
1
format-conversion
1
data-conversion
1
arff
1
modern-data-stack
1
ml
1
masterdata
1
identity-resolution
1
identity
1
fuzzymatch
1
fuzzy-matching
1
entity-resolution
1
dedupe
1
dataquality
1
datalake
1
dataengineering
1
data-transformations
1
analytics-engineering
1
analytics
1
identity resolution
1
data mastering
1
record linkage
1
deduplication
1
Entity Resolution
1
utilities
1
recursion
1
nested-structures
1
dictionaries
1
declarative
1
cli
1
apis
1
xlxs
1
package
1
arff-format-converter
1
arff-files
1
apache-orc-format
1