Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
Top 1.5% on conda-forge.org
Top 0.5% dependent packages on conda-forge.org
Top 1.1% dependent repos on conda-forge.org
Top 2.1% forks on conda-forge.org
Top 0.5% dependent packages on conda-forge.org
Top 1.1% dependent repos on conda-forge.org
Top 2.1% forks on conda-forge.org
conda-forge.org : pyarrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Registry
-
Source
- Homepage
- JSON
purl: pkg:conda/pyarrow
Keywords: arrow
License: Apache-2.0
Latest release: over 1 year ago
First release: over 1 year ago
Dependent packages: 160
Dependent repositories: 605
Stars: 11,354 on GitHub
Forks: 2,795 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 14 days ago
snowflake-connector-python 3.10.0
The Snowflake Connector for Python provides an interface for developing Python applications that ...10 versions - Latest release: 2 days ago - 3 dependent packages - 4 dependent repositories - 424 stars on GitHub
modin-ray 0.28.1
Modin is a drop-in replacement for pandas. While pandas is single-threaded, Modin lets you instan...7 versions - Latest release: 13 days ago - 3 dependent packages - 8,463 stars on GitHub
Top 9.2% on anaconda.org
7 versions - Latest release: about 2 months ago - 13 dependent packages - 54 dependent repositories - 25,632 stars on GitHub
streamlit 1.32.0
Streamlit lets you turn data scripts into sharable web apps in minutes, not weeks. It's all Pytho...7 versions - Latest release: about 2 months ago - 13 dependent packages - 54 dependent repositories - 25,632 stars on GitHub
Top 9.0% on anaconda.org
16 versions - Latest release: 9 months ago - 3 dependent packages - 89 dependent repositories - 35,277 stars on GitHub
pyspark 3.4.1
Apache Spark is a fast and general engine for large-scale data processing.16 versions - Latest release: 9 months ago - 3 dependent packages - 89 dependent repositories - 35,277 stars on GitHub
datasets 2.12.0
Datasets is a lightweight library providing two main features: - one-line dataloaders for many p...4 versions - Latest release: 12 months ago - 4 dependent packages - 29 dependent repositories - 15,553 stars on GitHub
Top 6.9% on conda-forge.org
streamlit 1.15.0
51 versions - Latest release: over 1 year ago - 6 dependent packages - 54 dependent repositoriescoiled-runtime 0.1.1
Coiled Runtime is a metapackage that pins Dask, and other packages commonly used with Dask, to a ...4 versions - Latest release: over 1 year ago - 1 dependent repositories - 13 stars on GitHub
Top 4.1% on conda-forge.org
34 versions - Latest release: over 1 year ago - 13 dependent packages - 29 dependent repositories - 15,569 stars on GitHub
datasets 2.7.0
Datasets is a lightweight library providing one-line dataloaders for many public datasets and one...34 versions - Latest release: over 1 year ago - 13 dependent packages - 29 dependent repositories - 15,569 stars on GitHub
rubicon-ml 0.4.0
rubicon-ml is a machine learning solution designed to help standardize the model development life...33 versions - Latest release: over 1 year ago - 2 dependent repositories - 99 stars on GitHub
graphysio 2022.11.15
GraPhysio is a graphical time series visualizer created for biometric data signals from ICU patie...6 versions - Latest release: over 1 year ago - 5 stars on GitHub
Top 4.0% on conda-forge.org
43 versions - Latest release: over 1 year ago - 9 dependent packages - 40 dependent repositories - 13,891 stars on GitHub
mlflow 2.0.1
Open source platform for the machine learning lifecycle43 versions - Latest release: over 1 year ago - 9 dependent packages - 40 dependent repositories - 13,891 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
Top 5.1% on conda-forge.org
212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
dvc 2.34.2
Data Version Control or DVC is an open-source tool for data science and machine learning projects.212 versions - Latest release: over 1 year ago - 10 dependent packages - 26 dependent repositories - 11,242 stars on GitHub
python-duckdb 0.6.0
DuckDB is an embedded database designed to execute analytical SQL queries fast while embedded in ...21 versions - Latest release: over 1 year ago - 6 dependent packages - 1 dependent repositories - 9,159 stars on GitHub
sqlalchemy-dremio 3.0.3
SQLAlchemy for Dremio via the ODBC and Flight interface.2 versions - Latest release: over 1 year ago - 19 stars on GitHub
modin-hdk 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code4 versions - Latest release: over 1 year ago - 8,468 stars on GitHub
modin-ray 0.17.0
Modin: Scale your Pandas workflows by changing a single line of code22 versions - Latest release: over 1 year ago - 4 dependent packages - 8,468 stars on GitHub
dask-snowflake 0.2.0
Dask integration for Snowflake3 versions - Latest release: over 1 year ago - 1 dependent repositories - 22 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
ploomber 0.21.7
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️60 versions - Latest release: over 1 year ago - 2 dependent repositories - 3,017 stars on GitHub
rubin-env 5.0.0 💰
This metapackage exists to define the Rubin Observatory common software environment, including ve...30 versions - Latest release: over 1 year ago - 4 dependent packages - 0 stars on GitHub
rubin-env-nosysroot 5.0.0 💰
This metapackage exists to define the Rubin Observatory common software environment, including ve...16 versions - Latest release: over 1 year ago - 2 dependent packages - 0 stars on GitHub
rpy2-arrow 0.0.7
`rpy2-arrow` allows you to share the same arrow datasets between Python and R without copying the...2 versions - Latest release: over 1 year ago - 11 stars on GitHub
ray-data 2.0.1
Ray is a fast and simple framework for building and running distributed applications. It is split...10 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 28,849 stars on GitHub
nannyml 0.7.0
Detecting silent model failure. NannyML estimates performance for regression and classification m...8 versions - Latest release: over 1 year ago - 1,451 stars on GitHub
coffea 0.7.20
Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP ana...28 versions - Latest release: over 1 year ago - 11 dependent repositories - 106 stars on GitHub
pygeohydro 0.13.7
A part of HyRiver software stack for accessing hydrology data through web services19 versions - Latest release: over 1 year ago - 1 dependent package - 4 dependent repositories - 52 stars on GitHub
pynhd 0.13.7
PyNHD is a part of Hydrodata software stack that provides access to NHDPlus data though NLDI and ...23 versions - Latest release: over 1 year ago - 2 dependent packages - 3 dependent repositories - 23 stars on GitHub
google-cloud-bigquery 3.3.6
Python idiomatic client for Google BigQuery93 versions - Latest release: over 1 year ago - 16 dependent packages - 5 dependent repositories - 575 stars on GitHub
google-cloud-bigquery-core 3.3.6
google-cloud-bigquery-core the core client library for connecting to the BigQuery API. Supported...61 versions - Latest release: over 1 year ago - 5 dependent packages - 3 dependent repositories - 575 stars on GitHub
pandera 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...30 versions - Latest release: over 1 year ago - 3 dependent packages - 9 dependent repositories - 2,116 stars on GitHub
pandera-base 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...9 versions - Latest release: over 1 year ago - 13 dependent packages - 2,130 stars on GitHub
pandera-core 0.13.4 💰
A simple, zero-configuration Python library to help you build confidence in the quality of your d...20 versions - Latest release: over 1 year ago - 2,130 stars on GitHub
snowflake-connector-python 2.8.1
The Snowflake Connector for Python provides an interface for developing Python applications that ...58 versions - Latest release: over 1 year ago - 11 dependent packages - 4 dependent repositories - 425 stars on GitHub
stglib 0.5.0
This package contains code to process data from a variety of oceanographic instrumentation, consi...6 versions - Latest release: over 1 year ago - 6 stars on GitHub
woodwork 0.20.0
Woodwork is a Python library that provides robust methods for managing and communicating data typ...44 versions - Latest release: over 1 year ago - 5 dependent packages - 112 stars on GitHub
datapane 0.15.4
Create and publish interactive reports and apps in Python. Datapane is an open source framework w...39 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 811 stars on GitHub
lsstdesc-gcr-catalogs 1.4.2
GCRCatalogs is a Python package that serves as a catalog repository for the Rubin LSST Dark Energ...13 versions - Latest release: over 1 year ago - 21 stars on GitHub
Top 3.0% on conda-forge.org
28 versions - Latest release: over 1 year ago - 28 dependent packages - 89 dependent repositories
pyspark 3.3.1
Apache Spark is a fast and general engine for large-scale data processing.28 versions - Latest release: over 1 year ago - 28 dependent packages - 89 dependent repositories
koopa-viz 0.0.1
1 version - Latest release: over 1 year ago - 0 stars on GitHubplateau 4.1.3
4 versions - Latest release: over 1 year agographistry 0.28.5
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the G...16 versions - Latest release: over 1 year ago - 1,817 stars on GitHub
dremio-arrow 1.0.1
Dremio Arrow Flight Client1 version - Latest release: over 1 year ago - 2 stars on GitHub
mlflow-pipelines 1.30.0
Open source platform for the machine learning lifecycle3 versions - Latest release: over 1 year ago - 13,891 stars on GitHub
Top 9.3% on conda-forge.org
34 versions - Latest release: over 1 year ago - 8 dependent packages - 1 dependent repositories - 6,627 stars on GitHub
apache-beam 2.42.0
Apache Beam is an open source, unified model for defining both batch and streaming data-parallel ...34 versions - Latest release: over 1 year ago - 8 dependent packages - 1 dependent repositories - 6,627 stars on GitHub
sas7bdat-converter 1.2.0 💰
Converts proprietary sas7bdat files from SAS into formats such as csv and XML useable by other pr...1 version - Latest release: over 1 year ago - 14 stars on GitHub
triad 0.7.0
A collection of python utility functions16 versions - Latest release: over 1 year ago - 4 dependent packages - 8 stars on GitHub
tiled-dataframe 0.1.0a75
This is an experimental prototype.9 versions - Latest release: over 1 year ago - 1 dependent package - 31 stars on GitHub
tiled-sparse 0.1.0a75
This is an experimental prototype.7 versions - Latest release: over 1 year ago - 2 dependent packages - 31 stars on GitHub
delta-sharing-python 0.5.2
An open protocol for secure data sharing6 versions - Latest release: over 1 year ago - 539 stars on GitHub
vaex-core 4.14.0
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of ...54 versions - Latest release: over 1 year ago - 10 dependent packages - 1 dependent repositories - 7,837 stars on GitHub
pyhdk 0.2.0
oneHDK2 versions - Latest release: over 1 year ago - 1 dependent package - 16 stars on GitHub
larch 5.7.1
This is a tool for the estimation and application of logit-based discrete choice models. It is de...30 versions - Latest release: over 1 year ago - 35 stars on GitHub
pandas-gbq 0.17.9
Google BigQuery connector for pandas36 versions - Latest release: over 1 year ago - 3 dependent packages - 6 dependent repositories - 357 stars on GitHub
arrow_odbc 0.3.4 💰
Read Apache Arrow batches from ODBC data sources in Python5 versions - Latest release: over 1 year ago - 1 dependent package - 20 stars on GitHub
dask-ms 0.2.14
10 versions - Latest release: over 1 year ago - 1 dependent repositoriesfugue 0.7.3
Fugue is a unified interface for distributed computing that lets users execute Python, pandas, a...8 versions - Latest release: over 1 year ago - 4 dependent repositories - 1,271 stars on GitHub
xlntpyarrow 1.1.0
:bar_chart: Cross-platform user-friendly xlsx library for C++11+1 version - Latest release: over 1 year ago - 1,239 stars on GitHub
towhee 0.8.1
For win-64 users, you need to run `conda config --add channels 'pytorch'` first due to this [issu...10 versions - Latest release: over 1 year ago - 1,920 stars on GitHub
databricks-sql-connector 2.1.0
9 versions - Latest release: over 1 year ago - 1 dependent packagesharrow 2.4.0
numba for ActivitySim-style spec files12 versions - Latest release: over 1 year ago - 2 dependent packages - 1 stars on GitHub
mosartwmpy 0.4.4
Python translation of MOSART-WM: a water routing and management model11 versions - Latest release: over 1 year ago - 11 stars on GitHub
awswrangler 2.17.0
An open-source Python package that extends the power of Pandas library to AWS connecting DataFram...81 versions - Latest release: over 1 year ago - 1 dependent repositories - 3,363 stars on GitHub
pado 0.11.0
PAthological Data Obsession - cloud native digital pathology datasets2 versions - Latest release: over 1 year ago - 8 stars on GitHub
db-dtypes 1.0.4
8 versions - Latest release: over 1 year ago - 4 dependent packages - 1 dependent repositories - 12 stars on GitHubkedro 0.18.3
A Python framework for creating reproducible, maintainable and modular data science code.20 versions - Latest release: over 1 year ago - 3 dependent packages - 3 dependent repositories - 8,186 stars on GitHub
ibis-bigquery 2.2.0
BigQuery backend for Ibis8 versions - Latest release: over 1 year ago - 16 stars on GitHub
activitysim 1.1.3
An Open Platform for Activity-Based Travel Modeling5 versions - Latest release: over 1 year ago - 152 stars on GitHub
pyiem 1.14.0
pyIEM contains a wide collection of codes relevant for working with US National Weather Service d...11 versions - Latest release: over 1 year ago - 1 dependent repositories - 36 stars on GitHub
ibis-pyspark 3.2.0
6 versions - Latest release: over 1 year ago
Top 10.0% on conda-forge.org
ibis-framework 3.2.0
19 versions - Latest release: over 1 year ago - 8 dependent packages - 7 dependent repositoriesibis-dask 3.2.0
6 versions - Latest release: over 1 year agobertopic 0.12.0
Leveraging BERT and c-TF-IDF to create easily interpretable topics.4 versions - Latest release: over 1 year ago - 1 dependent package - 3,956 stars on GitHub
modin-omnisci 0.15.3
Modin: Scale your Pandas workflows by changing a single line of code18 versions - Latest release: over 1 year ago - 2 dependent packages - 8,468 stars on GitHub
scmorph 0.1.0
1 version - Latest release: over 1 year agopymongoarrow 0.5.1
5 versions - Latest release: over 1 year agodescarteslabs 1.11.0
Descartes Labs Python Client Library40 versions - Latest release: over 1 year ago - 1 dependent repositories - 153 stars on GitHub
dvc-hdfs 2.19.0
Data Version Control or DVC is an open-source tool for data science and machine learning projects.67 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 11,242 stars on GitHub
pyfitparquet 1.0
The pyfitparquet package provides support for Garmin [FIT](https://developer.garmin.com/fit/overv...1 version - Latest release: over 1 year ago - 1 stars on GitHub
lenskit 0.14.2
LensKit is an open-source toolkit for building, researching, and learning about recommender systems.9 versions - Latest release: over 1 year ago - 2 dependent repositories - 229 stars on GitHub
mapshader 0.1.3
Simple Python GIS Web Services5 versions - Latest release: over 1 year ago - 34 stars on GitHub
stac-geoparquet 0.1.0
1 version - Latest release: over 1 year ago - 1 dependent repositoriestables-io 0.7.9
Input/output and conversion interfaces for tabular data formats.11 versions - Latest release: over 1 year ago - 1 dependent package - 1 stars on GitHub
turbodbc 4.5.5
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (OD...27 versions - Latest release: over 1 year ago - 2 dependent packages - 2 dependent repositories - 566 stars on GitHub
pyarrow-tests 9.0.0
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing14 versions - Latest release: over 1 year ago - 11,354 stars on GitHub
spatialpandas 0.4.4
Pandas extension arrays for spatial/geometric operations7 versions - Latest release: almost 2 years ago - 3 dependent packages - 12 dependent repositories - 262 stars on GitHub
condastats 0.2.1
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 5 stars on GitHubvegafusion 0.9.0
13 versions - Latest release: almost 2 years ago - 1 dependent packagemodin-omnisci 0.15.2
Modin: Scale your Pandas workflows by changing a single line of code3 versions - Latest release: almost 2 years ago - 8,463 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform12 versions - Latest release: almost 2 years ago - 51,076 stars on GitHub
pyomniscidbe 5.10.1
OmniSciDB / HeavyDB is an in-memory, column store, SQL relational database that was designed from...3 versions - Latest release: almost 2 years ago - 2,774 stars on GitHub
flytekit 1.1.0
Flytekit Python is the Python Library for easily authoring, testing, deploying, and interacting w...4 versions - Latest release: almost 2 years ago - 8 dependent packages - 123 stars on GitHub
census-parquet 0.0.9
Python tools for creating Parquet files from 2020 Census Data2 versions - Latest release: almost 2 years ago - 13 stars on GitHub
traffic 2.8.0
A toolbox for processing and analysing air traffic data7 versions - Latest release: almost 2 years ago - 1 dependent repositories - 273 stars on GitHub
geosnap 0.11.0
geosnap is an open-source, Python package for exploring, modeling, and visualizing neighborhood d...17 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 181 stars on GitHub
bigearthnet-gdf-builder 0.1.8
A package to generate and extend BigEarthNet GeoDataFrame's2 versions - Latest release: almost 2 years ago - 2 stars on GitHub
featurewiz 0.1.87
Use advanced feature engineering strategies and select best features from your data set with a si...7 versions - Latest release: almost 2 years ago - 374 stars on GitHub
xbbg 0.7.7
An intuitive Bloomberg API4 versions - Latest release: almost 2 years ago - 167 stars on GitHub
parquet-tools 0.2.11
easy install parquet-tools6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 101 stars on GitHub
apache-airflow-providers-papermill 3.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows9 versions - Latest release: almost 2 years ago - 29,539 stars on GitHub