Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "data" keyword
r-spocc 1.2.0
Species occurrence data toolkit for R5 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 101 stars on GitHub
r-rgbif 3.7.3
Interface to the Global Biodiversity Information Facility API18 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 132 stars on GitHub
dataproperty 0.55.0 💰
A Python library for extract property from data.9 versions - Latest release: about 2 years ago - 2 dependent packages - 16 stars on GitHub
mimesis 6.1.1
Mimesis is a high-performance fake data generator for Python, which provides data for a variety o...9 versions - Latest release: over 1 year ago - 3 dependent repositories - 3,928 stars on GitHub
dvc-data 0.26.0
DVC's data management subsystem67 versions - Latest release: over 1 year ago - 1 dependent package - 13 stars on GitHub
datacompy 0.8.3
Pandas and Spark DataFrame comparison for humans9 versions - Latest release: over 1 year ago - 1 dependent repositories - 269 stars on GitHub
Top 6.5% on conda-forge.org
7 versions - Latest release: almost 3 years ago - 7 dependent packages - 83 dependent repositories - 2,606 stars on GitHub
pandas-datareader 0.10.0
Extract data from a wide range of Internet sources into a pandas DataFrame.7 versions - Latest release: almost 3 years ago - 7 dependent packages - 83 dependent repositories - 2,606 stars on GitHub
bw_processing 0.8.2
Tools to create structured arrays in a common format3 versions - Latest release: over 1 year ago - 1 dependent package - 2 stars on GitHub
r-datawizard 0.6.3 💰
Magic potions to clean and transform your data 🧙15 versions - Latest release: over 1 year ago - 6 dependent packages - 1 dependent repositories - 148 stars on GitHub
pyusid 0.0.10
Framework for storing, visualizing, and processing Universal Spectroscopic and Imaging Data (USID)10 versions - Latest release: about 3 years ago - 2 dependent packages - 23 stars on GitHub
xbpch 0.3.5
xbpch is a simple utility for reading binary-format outputs (bpch) used in versions of GEOS-Chem ...4 versions - Latest release: almost 5 years ago - 1 dependent package - 17 stars on GitHub
rockhound 0.2.0
RockHound is a Python library to download geophysical models and datasets (PREM, CRUST1.0, ETOPO1...2 versions - Latest release: about 4 years ago - 3 dependent repositories - 34 stars on GitHub
ncagg 0.8.14
ncagg is a flexible aggregation software that combines multiple netcdf files into a single file, ...3 versions - Latest release: over 1 year ago - 6 stars on GitHub
cdms2 3.1.5
The Community Data Management System is an object-oriented data management system, specialized fo...10 versions - Latest release: over 2 years ago - 7 dependent packages - 3 dependent repositories - 8 stars on GitHub
dirty_cat 0.3.0
Machine learning on dirty tabular data3 versions - Latest release: over 1 year ago - 682 stars on GitHub
ensaio 0.4.0
Practice datasets to probe your code4 versions - Latest release: almost 2 years ago - 5 dependent repositories - 9 stars on GitHub
pypika 0.48.8
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a s...1 version - Latest release: over 2 years ago - 1 dependent package - 1,948 stars on GitHub
ckan 2.9.4
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN...2 versions - Latest release: over 2 years ago - 3,737 stars on GitHub
r-edmdata 1.2.0
Supplementary data package for the edm package3 versions - Latest release: almost 3 years ago - 4 stars on GitHub
pooch 1.6.0
Pooch manages your remote data files. It automatically downloads and stores them in a local direc...23 versions - Latest release: over 2 years ago - 46 dependent packages - 203 dependent repositories - 394 stars on GitHub
pandas-gbq 0.17.9
Google BigQuery connector for pandas36 versions - Latest release: over 1 year ago - 3 dependent packages - 6 dependent repositories - 357 stars on GitHub
r-datacomparer 0.1.4
dataCompareR is an R package that allows users to compare two datasets and view a report on the s...2 versions - Latest release: over 2 years ago - 68 stars on GitHub
glom 22.1.0
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got dat...7 versions - Latest release: over 1 year ago - 1 dependent package - 3 dependent repositories - 1,674 stars on GitHub
drawdata 0.2.0
Draw datasets from within Jupyter.2 versions - Latest release: almost 2 years ago - 528 stars on GitHub
scmdata 0.14.2
Handling of Simple Climate Model data31 versions - Latest release: over 1 year ago - 4 dependent packages - 3 dependent repositories - 5 stars on GitHub
cleverdict 1.9.2
A JSON-friendly data structure which allows both object attributes and dictionary keys and values...1 version - Latest release: over 1 year ago - 1 dependent package - 95 stars on GitHub
pybv 0.7.5
A lightweight I/O utility for the BrainVision data format, written in Python.11 versions - Latest release: over 1 year ago - 2 dependent packages - 20 dependent repositories - 16 stars on GitHub
r-mlr3data 0.6.1 💰
Data sets used in the book, gallery, or in examples of mlr3.6 versions - Latest release: over 1 year ago - 1 dependent package - 2 stars on GitHub
pyjanitor 0.23.1
Clean APIs for data cleaning. Python implementation of R package Janitor51 versions - Latest release: over 1 year ago - 1 dependent package - 14 dependent repositories - 1,124 stars on GitHub
wetterdienst 0.48.0 💰
Open weather data for humans.43 versions - Latest release: over 1 year ago - 241 stars on GitHub
ytree 3.1.2
A yt-based merger-tree code.2 versions - Latest release: about 2 years ago - 14 stars on GitHub
litedao 0.0.7
Intuitive interaction with SQLite database1 version - Latest release: almost 2 years ago - 1 dependent package - 0 stars on GitHub
r-daff 0.3.5
Diff, patch and merge for data.frames, see http://paulfitz.github.io/daff/1 version - Latest release: almost 5 years ago - 133 stars on GitHub
pyfunctional 1.4.3
Python library for creating data pipelines with chain functional programming1 version - Latest release: over 1 year ago - 2,145 stars on GitHub
retriever 3.1.0
This module analyzes jpeg/jpeg2000/png/gif image header and return image size.8 versions - Latest release: about 2 years ago - 1 dependent repositories - 279 stars on GitHub
pytest-cases 3.6.13
Did you ever think that most of your test functions were actually the same test code, but with di...34 versions - Latest release: almost 2 years ago - 7 dependent repositories - 274 stars on GitHub
Top 4.9% on conda-forge.org
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
quilt 3.0.6
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...2 versions - Latest release: over 4 years ago - 1,226 stars on GitHub
quilt3 5.0.0
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...21 versions - Latest release: about 2 years ago - 4 dependent packages - 7 dependent repositories - 1,226 stars on GitHub
pdpipe 0.3.2
Ever written a preprocessing pipeline for pandas dataframes and had trouble serializing it for la...20 versions - Latest release: over 1 year ago - 703 stars on GitHub
locopy 0.5.0
locopy: Loading/Unloading to Redshift and Snowflake using Python.5 versions - Latest release: over 1 year ago - 98 stars on GitHub
shared 0.0.24
Triptych for data exchange and persistence6 versions - Latest release: over 1 year ago - 2 dependent packages - 17 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
uxf 2.8.1
Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that...26 versions - Latest release: over 1 year ago - 92 stars on GitHub
osdatahub 1.2.0
A python package from Ordnance Survey, designed to make data from the OS Data Hub APIs readily ac...6 versions - Latest release: over 1 year ago - 17 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
flytekitplugins-sqlalchemy 1.2.4
SQLAlchemy plugin for Flytekit: `flytekitplugins-sqlalchemy` PyPI: [https://pypi.org/project/fly...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-modin 1.2.4
Modin plugin for Flytekit: `flytekitplugins-modin` PyPI: [https://pypi.org/project/flytekitplugi...9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-athena 1.2.4
Athena plugin for Flytekit: `flytekitplugins-athena` PyPI: [https://pypi.org/project/flytekitplu...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-awsbatch 1.2.4
AWS Batch plugin for Flytekit: `flytekitplugins-awsbatch` PyPI: [https://pypi.org/project/flytek...9 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-data-fsspec 1.2.4
`fsspec` powered data-plugins for Flytekit: `flytekitplugins-data-fsspec` PyPI: [https://pypi.or...8 versions - Latest release: over 1 year ago - 123 stars on GitHub
flytekitplugins-spark 1.0.5
Spark 3 plugin for Flytekit: `flytekitplugins-spark` PyPI: [https://pypi.org/project/flytekitplu...1 version - Latest release: over 1 year ago - 123 stars on GitHub
copulae 0.7.7
Copulae is a package for multivariate modeling18 versions - Latest release: about 2 years ago - 1 dependent package - 107 stars on GitHub
spyql 0.8.1
Query data on the command line with SQL-like SELECTs powered by Python expressions7 versions - Latest release: over 1 year ago - 872 stars on GitHub
signac-dashboard 0.3.1
Built on top of the signac framework, signac-dashboard allows users to rapidly visualize and anal...17 versions - Latest release: over 1 year ago - 5 dependent repositories - 17 stars on GitHub
castoredc_api 0.1.4
Python Wrapper for Castor EDC API7 versions - Latest release: almost 2 years ago - 2 stars on GitHub
pyprep 0.4.2
A Python implementation of the Preprocessing Pipeline (PREP) for EEG data4 versions - Latest release: about 2 years ago - 1 dependent repositories - 86 stars on GitHub
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.1 version - Latest release: over 1 year ago - 1 stars on GitHub
pydaymet 0.13.7
PyDaymet is a part of Hydrodata software stack that provides access to the Daymet's climate data ...24 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 6 stars on GitHub
r-rbison 1.0.0
:no_entry: ARCHIVED :no_entry: Interface to the 'USGS' 'BISON' API2 versions - Latest release: almost 4 years ago - 1 dependent package - 1 dependent repositories - 10 stars on GitHub
scikit-data 0.1.3
This library offers functions to manipulate, clean and visualize data in a easy way.3 versions - Latest release: over 1 year ago - 19 stars on GitHub
pandas-msgpack 0.1.4 💰
Pandas Msgpack1 version - Latest release: over 1 year ago - 22 stars on GitHub
r-rsnps 0.4.0
Wrapper to a number of SNP web APIs3 versions - Latest release: over 3 years ago - 45 stars on GitHub
r-rio 0.5.29
A Swiss-Army Knife for Data I/O5 versions - Latest release: over 2 years ago - 6 dependent packages - 6 dependent repositories - 548 stars on GitHub
gspread-pandas 2.2.3
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...3 versions - Latest release: about 4 years ago - 357 stars on GitHub
r-wdpar 1.3.2
Interface to the World Database on Protected Areas3 versions - Latest release: over 2 years ago - 1 dependent repositories - 35 stars on GitHub
libhxl 4.27
This library provides support for parsing, validating, cleaning, and transforming humanitarian da...1 version - Latest release: over 1 year ago - 38 stars on GitHub
anaconda-project 0.11.1
By adding an anaconda-project.yml to a project directory, a single anaconda-project runcommand wi...13 versions - Latest release: over 1 year ago - 3 dependent packages - 156 dependent repositories - 200 stars on GitHub
flytekitplugins-pandera 1.2.4
Pandera plugin for Flytekit: `flytekitplugins-pandera` PyPI: [https://pypi.org/project/flytekitp...7 versions - Latest release: over 1 year ago - 123 stars on GitHub
r-babynames 1.0.1
An R package containing US baby names from the SSA1 version - Latest release: about 2 years ago - 130 stars on GitHub
signac 1.8.0
The signac framework helps users manage and scale file-based workflows, facilitating data reuse, ...27 versions - Latest release: over 1 year ago - 2 dependent packages - 16 dependent repositories - 114 stars on GitHub
ipychart 0.4.0
ipychart is an ipywidget which allows to create dynamic, refined and customizable charts within t...9 versions - Latest release: about 2 years ago - 71 stars on GitHub
schemaorg 0.1.1 💰
Python functions for instantiating and validating schemas.4 versions - Latest release: over 2 years ago - 29 stars on GitHub
colour-science 0.4.1 💰
Colour Science for Python11 versions - Latest release: about 2 years ago - 3 dependent repositories - 1,664 stars on GitHub
Related Keywords
python
40
data-science
22
hacktoberfest
12
r
11
spark
10
automation
9
pypi
9
workflows
8
pandas
8
sdk
8
mlops
8
flyte-tasks
8
extensible
8
flyte
8
r-package
7
data-analysis
6
data-engineering
6
rstats
6
database
5
sql
5
python3
5
api
4
dataset
4
json
4
pipeline
4
climate
4
open-data
3
datascience
3
machine-learning
3
fatiando-a-terra
3
sqlite
3
analysis
3
biodiversity
3
spocc
3
binary
2
eeg
2
pydata
2
time-series
2
dataframe
2
hydrology
2
orchestration
2
gbif
2
csv
2
cli
2
data-version-control
2
species
2
data-versioning
2
parquet
2
serialization
2
dataframes
2
jupyter
2
data-visualization
2
geoscience
2
etl
2
datasets
2
geophysics
2
earth-science
2
xarray
2
pyrustic
2
cran
2
library
2
data-management
2
reproducibility
2
storage-engine
1
jesth
1
toml
1
xml
1
yaml
1
api-wrapper
1
geospatial
1
conda
1
copula
1
copula-models
1
copulae
1
copulas
1
dependency-analysis
1
dependency-modeling
1
modeling
1
pypi-packages
1
configuration
1
collections
1
unload
1
snowflake
1
s3
1
redshift
1
psycopg2
1
pg8000
1
copy
1
aws
1
pandas-dataframe
1
workflow-engine
1
workflow
1
prefect
1
orion
1
observability
1
pretty-printer
1
parser
1
ini
1
transformation
1
reverse-etl
1