Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 1.5% on conda-forge.org
Top 0.5% dependent packages on conda-forge.org
Top 1.1% dependent repos on conda-forge.org
Top 2.1% forks on conda-forge.org

conda-forge.org : pyarrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Registry - Source - Homepage - JSON
purl: pkg:conda/pyarrow
Keywords: arrow
License: Apache-2.0
Latest release: over 1 year ago
First release: over 1 year ago
Dependent packages: 160
Dependent repositories: 605
Stars: 11,354 on GitHub
Forks: 2,795 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 14 days ago

snowflake-connector-python 3.10.0
The Snowflake Connector for Python provides an interface for developing Python applications that ...
10 versions - Latest release: 2 days ago - 3 dependent packages - 4 dependent repositories - 424 stars on GitHub
modin-ray 0.28.1
Modin is a drop-in replacement for pandas. While pandas is single-threaded, Modin lets you instan...
7 versions - Latest release: 13 days ago - 3 dependent packages - 8,463 stars on GitHub
Top 9.2% on anaconda.org
streamlit 1.32.0
Streamlit lets you turn data scripts into sharable web apps in minutes, not weeks. It's all Pytho...
7 versions - Latest release: about 2 months ago - 13 dependent packages - 54 dependent repositories - 25,632 stars on GitHub
Top 9.0% on anaconda.org
pyspark 3.4.1
Apache Spark is a fast and general engine for large-scale data processing.
16 versions - Latest release: 9 months ago - 3 dependent packages - 89 dependent repositories - 35,277 stars on GitHub
datasets 2.12.0
Datasets is a lightweight library providing two main features: - one-line dataloaders for many p...
4 versions - Latest release: 12 months ago - 4 dependent packages - 29 dependent repositories - 15,553 stars on GitHub
Top 6.9% on conda-forge.org
streamlit 1.15.0
51 versions - Latest release: over 1 year ago - 6 dependent packages - 54 dependent repositories
coiled-runtime 0.1.1
Coiled Runtime is a metapackage that pins Dask, and other packages commonly used with Dask, to a ...
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 13 stars on GitHub
Top 4.1% on conda-forge.org
datasets 2.7.0
Datasets is a lightweight library providing one-line dataloaders for many public datasets and one...
34 versions - Latest release: over 1 year ago - 13 dependent packages - 29 dependent repositories - 15,569 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...
33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
graphysio 2022.11.15
GraPhysio is a graphical time series visualizer created for biometric data signals from ICU patie...
6 versions - Latest release: over 1 year ago - 5 stars on GitHub
Top 4.0% on conda-forge.org
mlflow 2.0.1
Open source platform for the machine learning lifecycle
43 versions - Latest release: over 1 year ago - 9 dependent packages - 40 dependent repositories - 13,891 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...
22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
Top 5.1% on conda-forge.org
dvc 2.34.2
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
python-duckdb 0.6.0
DuckDB is an embedded database designed to execute analytical SQL queries fast while embedded in ...
21 versions - Latest release: over 1 year ago - 6 dependent packages - 1 dependent repositories - 9,159 stars on GitHub
sqlalchemy-dremio 3.0.3
SQLAlchemy for Dremio via the ODBC and Flight interface.
2 versions - Latest release: over 1 year ago - 19 stars on GitHub
modin-hdk 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
4 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin-ray 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code
22 versions - Latest release: over 1 year ago - 4 dependent packages - 8,468 stars on GitHub
dask-snowflake 0.2.0
Dask integration for Snowflake
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 22 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning
1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
ploomber 0.21.7
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
60 versions - Latest release: over 1 year ago - 2 dependent repositories - 3,017 stars on GitHub
rubin-env 5.0.0 💰
This metapackage exists to define the Rubin Observatory common software environment, including ve...
30 versions - Latest release: over 1 year ago - 4 dependent packages - 0 stars on GitHub
rubin-env-nosysroot 5.0.0 💰
This metapackage exists to define the Rubin Observatory common software environment, including ve...
16 versions - Latest release: over 1 year ago - 2 dependent packages - 0 stars on GitHub
rpy2-arrow 0.0.7
`rpy2-arrow` allows you to share the same arrow datasets between Python and R without copying the...
2 versions - Latest release: over 1 year ago - 11 stars on GitHub
ray-data 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...
10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
nannyml 0.7.0
Detecting silent model failure. NannyML estimates performance for regression and classification m...
8 versions - Latest release: over 1 year ago - 1,451 stars on GitHub
coffea 0.7.20
Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP ana...
28 versions - Latest release: over 1 year ago - 11 dependent repositories - 106 stars on GitHub
pygeohydro 0.13.7
A part of HyRiver software stack for accessing hydrology data through web services
19 versions - Latest release: over 1 year ago - 1 dependent package - 4 dependent repositories - 52 stars on GitHub
pynhd 0.13.7
PyNHD is a part of Hydrodata software stack that provides access to NHDPlus data though NLDI and ...
23 versions - Latest release: over 1 year ago - 2 dependent packages - 3 dependent repositories - 23 stars on GitHub
google-cloud-bigquery 3.3.6
Python idiomatic client for Google BigQuery
93 versions - Latest release: over 1 year ago - 16 dependent packages - 5 dependent repositories - 575 stars on GitHub
google-cloud-bigquery-core 3.3.6
google-cloud-bigquery-core the core client library for connecting to the BigQuery API. Supported...
61 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 575 stars on GitHub
pandera 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...
30 versions - Latest release: over 1 year ago - 3 dependent packages - 9 dependent repositories - 2,116 stars on GitHub
pandera-base 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...
9 versions - Latest release: over 1 year ago - 13 dependent packages - 2,130 stars on GitHub
pandera-core 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...
20 versions - Latest release: over 1 year ago - 2,130 stars on GitHub
snowflake-connector-python 2.8.1
The Snowflake Connector for Python provides an interface for developing Python applications that ...
58 versions - Latest release: over 1 year ago - 11 dependent packages - 4 dependent repositories - 425 stars on GitHub
stglib 0.5.0
This package contains code to process data from a variety of oceanographic instrumentation, consi...
6 versions - Latest release: over 1 year ago - 6 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...
44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
datapane 0.15.4
Create and publish interactive reports and apps in Python. Datapane is an open source framework w...
39 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 811 stars on GitHub
lsstdesc-gcr-catalogs 1.4.2
GCRCatalogs is a Python package that serves as a catalog repository for the Rubin LSST Dark Energ...
13 versions - Latest release: over 1 year ago - 21 stars on GitHub
Top 3.0% on conda-forge.org
pyspark 3.3.1
Apache Spark is a fast and general engine for large-scale data processing.
28 versions - Latest release: over 1 year ago - 28 dependent packages - 89 dependent repositories
koopa-viz 0.0.1
1 version - Latest release: over 1 year ago - 0 stars on GitHub
plateau 4.1.3
4 versions - Latest release: over 1 year ago
graphistry 0.28.5
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the G...
16 versions - Latest release: over 1 year ago - 1,817 stars on GitHub
dremio-arrow 1.0.1
Dremio Arrow Flight Client
1 version - Latest release: over 1 year ago - 2 stars on GitHub
mlflow-pipelines 1.30.0
Open source platform for the machine learning lifecycle
3 versions - Latest release: over 1 year ago - 13,891 stars on GitHub
Top 9.3% on conda-forge.org
apache-beam 2.42.0
Apache Beam is an open source, unified model for defining both batch and streaming data-parallel ...
34 versions - Latest release: over 1 year ago - 8 dependent packages - 1 dependent repositories - 6,627 stars on GitHub
sas7bdat-converter 1.2.0 💰
Converts proprietary sas7bdat files from SAS into formats such as csv and XML useable by other pr...
1 version - Latest release: over 1 year ago - 14 stars on GitHub
triad 0.7.0
A collection of python utility functions
16 versions - Latest release: over 1 year ago - 4 dependent packages - 8 stars on GitHub
tiled-dataframe 0.1.0a75
This is an experimental prototype.
9 versions - Latest release: over 1 year ago - 1 dependent package - 31 stars on GitHub
tiled-sparse 0.1.0a75
This is an experimental prototype.
7 versions - Latest release: over 1 year ago - 2 dependent packages - 31 stars on GitHub
delta-sharing-python 0.5.2
An open protocol for secure data sharing
6 versions - Latest release: over 1 year ago - 539 stars on GitHub
vaex-core 4.14.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...
54 versions - Latest release: over 1 year ago - 10 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
pyhdk 0.2.0
oneHDK
2 versions - Latest release: over 1 year ago - 1 dependent package - 16 stars on GitHub
larch 5.7.1
This is a tool for the estimation and application of logit-based discrete choice models. It is de...
30 versions - Latest release: over 1 year ago - 35 stars on GitHub
pandas-gbq 0.17.9
Google BigQuery connector for pandas
36 versions - Latest release: over 1 year ago - 3 dependent packages - 6 dependent repositories - 357 stars on GitHub
arrow_odbc 0.3.4 💰
Read Apache Arrow batches from ODBC data sources in Python
5 versions - Latest release: over 1 year ago - 1 dependent package - 20 stars on GitHub
dask-ms 0.2.14
10 versions - Latest release: over 1 year ago - 1 dependent repositories
fugue 0.7.3
Fugue is a unified interface for distributed computing that lets users execute Python, pandas, a...
8 versions - Latest release: over 1 year ago - 4 dependent repositories - 1,271 stars on GitHub
xlntpyarrow 1.1.0
:bar_chart: Cross-platform user-friendly xlsx library for C++11+
1 version - Latest release: over 1 year ago - 1,239 stars on GitHub
towhee 0.8.1
For win-64 users, you need to run `conda config --add channels 'pytorch'` first due to this [issu...
10 versions - Latest release: over 1 year ago - 1,920 stars on GitHub
databricks-sql-connector 2.1.0
9 versions - Latest release: over 1 year ago - 1 dependent package
sharrow 2.4.0
numba for ActivitySim-style spec files
12 versions - Latest release: over 1 year ago - 2 dependent packages - 1 stars on GitHub
mosartwmpy 0.4.4
Python translation of MOSART-WM: a water routing and management model
11 versions - Latest release: over 1 year ago - 11 stars on GitHub
awswrangler 2.17.0
An open-source Python package that extends the power of Pandas library to AWS connecting DataFram...
81 versions - Latest release: over 1 year ago - 1 dependent repositories - 3,363 stars on GitHub
pado 0.11.0
PAthological Data Obsession - cloud native digital pathology datasets
2 versions - Latest release: over 1 year ago - 8 stars on GitHub
db-dtypes 1.0.4
8 versions - Latest release: over 1 year ago - 4 dependent packages - 1 dependent repositories - 12 stars on GitHub
kedro 0.18.3
A Python framework for creating reproducible, maintainable and modular data science code.
20 versions - Latest release: over 1 year ago - 3 dependent packages - 3 dependent repositories - 8,186 stars on GitHub
ibis-bigquery 2.2.0
BigQuery backend for Ibis
8 versions - Latest release: over 1 year ago - 16 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling
5 versions - Latest release: over 1 year ago - 152 stars on GitHub
pyiem 1.14.0
pyIEM contains a wide collection of codes relevant for working with US National Weather Service d...
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 36 stars on GitHub
ibis-pyspark 3.2.0
6 versions - Latest release: over 1 year ago
Top 10.0% on conda-forge.org
ibis-framework 3.2.0
19 versions - Latest release: over 1 year ago - 8 dependent packages - 7 dependent repositories
ibis-dask 3.2.0
6 versions - Latest release: over 1 year ago
bertopic 0.12.0
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
4 versions - Latest release: over 1 year ago - 1 dependent package - 3,956 stars on GitHub
modin-omnisci 0.15.3
Modin: Scale your Pandas workflows by changing a single line of code
18 versions - Latest release: over 1 year ago - 2 dependent packages - 8,468 stars on GitHub
scmorph 0.1.0
1 version - Latest release: over 1 year ago
pymongoarrow 0.5.1
5 versions - Latest release: over 1 year ago
descarteslabs 1.11.0
Descartes Labs Python Client Library
40 versions - Latest release: over 1 year ago - 1 dependent repositories - 153 stars on GitHub
dvc-hdfs 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.
67 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
pyfitparquet 1.0
The pyfitparquet package provides support for Garmin [FIT](https://developer.garmin.com/fit/overv...
1 version - Latest release: over 1 year ago - 1 stars on GitHub
lenskit 0.14.2
LensKit is an open-source toolkit for building, researching, and learning about recommender systems.
9 versions - Latest release: over 1 year ago - 2 dependent repositories - 229 stars on GitHub
mapshader 0.1.3
Simple Python GIS Web Services
5 versions - Latest release: over 1 year ago - 34 stars on GitHub
stac-geoparquet 0.1.0
1 version - Latest release: over 1 year ago - 1 dependent repositories
tables-io 0.7.9
Input/output and conversion interfaces for tabular data formats.
11 versions - Latest release: over 1 year ago - 1 dependent package - 1 stars on GitHub
turbodbc 4.5.5
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (OD...
27 versions - Latest release: over 1 year ago - 2 dependent packages - 2 dependent repositories - 566 stars on GitHub
pyarrow-tests 9.0.0
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
14 versions - Latest release: over 1 year ago - 11,354 stars on GitHub
spatialpandas 0.4.4
Pandas extension arrays for spatial/geometric operations
7 versions - Latest release: almost 2 years ago - 3 dependent packages - 12 dependent repositories - 262 stars on GitHub
condastats 0.2.1
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 5 stars on GitHub
vegafusion 0.9.0
13 versions - Latest release: almost 2 years ago - 1 dependent package
modin-omnisci 0.15.2
Modin: Scale your Pandas workflows by changing a single line of code
3 versions - Latest release: almost 2 years ago - 8,463 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform
12 versions - Latest release: almost 2 years ago - 51,076 stars on GitHub
pyomniscidbe 5.10.1
OmniSciDB / HeavyDB is an in-memory, column store, SQL relational database that was designed from...
3 versions - Latest release: almost 2 years ago - 2,774 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...
4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
census-parquet 0.0.9
Python tools for creating Parquet files from 2020 Census Data
2 versions - Latest release: almost 2 years ago - 13 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data
7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
geosnap 0.11.0
geosnap is an open-source, Python package for exploring, modeling, and visualizing neighborhood d...
17 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 181 stars on GitHub
bigearthnet-gdf-builder 0.1.8
A package to generate and extend BigEarthNet GeoDataFrame's
2 versions - Latest release: almost 2 years ago - 2 stars on GitHub
featurewiz 0.1.87
Use advanced feature engineering strategies and select best features from your data set with a si...
7 versions - Latest release: almost 2 years ago - 374 stars on GitHub
xbbg 0.7.7
An intuitive Bloomberg API
4 versions - Latest release: almost 2 years ago - 167 stars on GitHub
parquet-tools 0.2.11
easy install parquet-tools
6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 101 stars on GitHub
apache-airflow-providers-papermill 3.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
9 versions - Latest release: almost 2 years ago - 29,539 stars on GitHub