Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-wrangling" keyword

Top 4.7% on pypi.org
hypertools 0.8.0
A python package for visualizing and manipulating high-dimensional data
21 versions - Latest release: over 2 years ago - 1 dependent package - 72 dependent repositories - 1.02 thousand downloads last month - 1,795 stars on GitHub - 1 maintainer
desbordante 2.0.0
Science-intensive high-performance data profiler
3 versions - Latest release: 29 days ago - 1 dependent package - 138 downloads last month - 61 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 349 downloads last month - 1,441 stars on GitHub - 2 maintainers
prosto 0.6.0
Data processing toolkit radically changing the way data is processed
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 64 downloads last month - 89 stars on GitHub - 1 maintainer
skloverlay 1.2.0
SKLearn Classification Interface
5 versions - Latest release: 8 months ago - 37 downloads last month - 0 stars on GitHub - 1 maintainer
gis-conflation-toolchain 0.7.1
gis-conflation-toolchain
2 versions - Latest release: 12 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
skrub 0.5.0
Prepping tables for machine learning
4 versions - Latest release: 5 months ago - 443 downloads last month - 1,012 stars on GitHub - 4 maintainers
pandance 0.3.0
Advanced relational operations for pandas DataFrames
4 versions - Latest release: 12 months ago - 1 dependent repositories - 12 downloads last month - 5 stars on GitHub - 1 maintainer
panda-grove 0.1.4
A lightweight package for easier management of multiple Pandas DataFrames
4 versions - Latest release: 4 months ago - 1 dependent repositories - 74 downloads last month - 2 stars on GitHub - 1 maintainer
xplore 0.0.1
A python package built with pandas for data scientist/analysts, AI/ML engineers for exploring fea...
1 version - Latest release: over 3 years ago - 1 dependent repositories - 30 downloads last month - 21 stars on GitHub - 3 maintainers
whyqd 1.1.3
data wrangling simplicity, complete audit transparency, and at speed
25 versions - Latest release: 2 months ago - 1 dependent repositories - 238 downloads last month - 32 stars on GitHub - 1 maintainer
sparx 0.0.2
Sparx is a simplified data munging, wrangling and preparation library
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 13 downloads last month - 0 stars on GitLab.com - 3 maintainers
Top 4.8% on pypi.org
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 10.7 thousand downloads last month - 1,441 stars on GitHub - 2 maintainers
data-toolz 0.1.11
Data helper package
9 versions - Latest release: 9 months ago - 1 dependent repositories - 872 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
datatest 0.11.1
Test driven data-wrangling and data validation.
16 versions - Latest release: over 3 years ago - 3 dependent packages - 25 dependent repositories - 21.8 thousand downloads last month - 288 stars on GitHub - 1 maintainer
datasetops 0.0.6
Fluent dataset operations, compatible with your favorite libraries
4 versions - Latest release: about 4 years ago - 4 dependent repositories - 57 downloads last month - 10 stars on GitHub - 1 maintainer
data-cleaning 1.0.1
An utility to clean the data and return you the cleaned data
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 123 downloads last month - 5 stars on GitHub - 2 maintainers
omnipy 0.15.12
Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orches...
57 versions - Latest release: 23 days ago - 1 dependent package - 2 dependent repositories - 730 downloads last month - 11 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
pipda 0.13.1
A framework for data piping in python
56 versions - Latest release: 7 months ago - 2 dependent packages - 18 dependent repositories - 3.11 thousand downloads last month - 35 stars on GitHub - 1 maintainer
anonymized-fraud-detection 0.1.3
A small package to parse and train an ML model for anonymized credit card transactions. Refer to ...
6 versions - Latest release: over 1 year ago - 47 downloads last month - 2 stars on GitHub - 1 maintainer
data-hopper 0.1.0
Package for data wrangling in python.
1 version - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
pydata-wrangler 0.2.2
Wrangle messy data into pandas DataFrames, with a special focus on text data and natural language...
10 versions - Latest release: almost 2 years ago - 1 dependent repositories - 58 downloads last month - 9 stars on GitHub - 1 maintainer
monggregate 0.20.0
MongoDB aggregation pipelines made easy. Joins, grouping, counting and much more...
22 versions - Latest release: 4 months ago - 1 dependent repositories - 306 downloads last month - 18 stars on GitHub - 1 maintainer
fraud-detection-package-new 0.1.1 removed
A small package to parse and train an ML model. Will update the readme later
10 versions - Latest release: over 1 year ago - 280 downloads last month - 1 stars on GitHub