Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "pyarrow" keyword
pandas-pyarrow 0.1.6 💰
A library for switching pandas backend to pyarrow7 versions - Latest release: 30 days ago - 1.21 thousand downloads last month - 2 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
101 versions - Latest release: 10 months ago - 15 dependent packages - 64 dependent repositories - 67 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-core 4.17.1
Core of vaex101 versions - Latest release: 10 months ago - 15 dependent packages - 64 dependent repositories - 67 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
23 versions - Latest release: 10 months ago - 4 dependent packages - 50 dependent repositories - 20.4 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-jupyter 0.8.2
Jupyter notebook and Jupyter lab support for vaex23 versions - Latest release: 10 months ago - 4 dependent packages - 50 dependent repositories - 20.4 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
21 versions - Latest release: 10 months ago - 3 dependent packages - 51 dependent repositories - 20.4 thousand downloads last month - 8,171 stars on GitHub - 2 maintainers
vaex-server 0.9.0
Webserver and client for vaex for a remote dataset21 versions - Latest release: 10 months ago - 3 dependent packages - 51 dependent repositories - 20.4 thousand downloads last month - 8,171 stars on GitHub - 2 maintainers
turntable-spoonbill 9.0.5
Productivity-centric Python Big Data Framework4 versions - Latest release: about 1 month ago - 118 downloads last month - 4,266 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
90 versions - Latest release: 11 days ago - 13 dependent packages - 130 dependent repositories - 195 thousand downloads last month - 3,333 stars on GitHub - 6 maintainers
ibis-framework 9.0.0
The portable Python dataframe library90 versions - Latest release: 11 days ago - 13 dependent packages - 130 dependent repositories - 195 thousand downloads last month - 3,333 stars on GitHub - 6 maintainers
swemaps 0.1.0
Sweden in GeoParquet for easy usage.1 version - Latest release: 4 days ago - 127 downloads last month - 0 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
58 versions - Latest release: 10 months ago - 20 dependent packages - 90 dependent repositories - 21.6 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex 4.17.0
Out-of-Core DataFrames to visualize and explore big tabular datasets58 versions - Latest release: 10 months ago - 20 dependent packages - 90 dependent repositories - 21.6 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
dremio-arrow 1.0.3
Dremio SQL Lakehouse Arrow Flight Client.6 versions - Latest release: 10 months ago - 339 downloads last month - 4 stars on GitHub - 2 maintainers
parquet-csv 0.2.0
Parquet from and to CSV format converter5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 59 downloads last month - 2 stars on GitHub - 2 maintainers
sqldatamodel 0.4.3
SQLDataModel is a lightweight dataframe library designed for efficient data extraction, transform...43 versions - Latest release: 4 days ago - 1.09 thousand downloads last month - 6 stars on GitHub - 1 maintainer
mongo2file 1.0.1
↻ 一个用于 mongodb 数据库转换为各类文件格式的库1 version - Latest release: about 2 years ago - 1 dependent repositories - 9 downloads last month - 4 stars on GitHub - 2 maintainers
mysql2file 1.0.3
↻ 一个 Mysql 数据库转换为表格文件的库1 version - Latest release: about 2 years ago - 1 dependent repositories - 12 downloads last month - 1 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
petastorm 0.12.1
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...86 versions - Latest release: over 1 year ago - 4 dependent packages - 26 dependent repositories - 36.9 thousand downloads last month - 1,752 stars on GitHub - 2 maintainers
firespark 0.0.32
FireSpark data processing utility library16 versions - Latest release: almost 4 years ago - 1 dependent repositories - 140 downloads last month - 1,752 stars on GitHub - 2 maintainers
hops-petastorm 0.9.4
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Py...6 versions - Latest release: over 3 years ago - 1 dependent repositories - 93 downloads last month - 1,752 stars on GitHub - 3 maintainers
schemarrow 0.1.1a0 💰
A library for switching pandas backend to pyarrow2 versions - Latest release: 2 months ago - 19 downloads last month - 2 stars on GitHub - 2 maintainers
condition 1.0.8
A user friendly way to construct conditions for pandas dataframe query and sql9 versions - Latest release: about 3 years ago - 5 dependent repositories - 171 downloads last month - 0 stars on GitLab.com - 2 maintainers
rxls 0.2.2
Reading both XLSX and XLSB files, fast and memory-safe, into PyArrow.2 versions - Latest release: 3 months ago - 72 downloads last month - 9 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
36 versions - Latest release: over 1 year ago - 4 dependent packages - 58 dependent repositories - 25.1 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-hdf5 0.14.1
hdf5 file support for vaex36 versions - Latest release: over 1 year ago - 4 dependent packages - 58 dependent repositories - 25.1 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
featherplot 0.0.6
featherplot6 versions - Latest release: 9 months ago - 4 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
12 versions - Latest release: almost 4 years ago - 1 dependent package - 17 dependent repositories - 371 downloads last month - 8,171 stars on GitHub - 2 maintainers
vaex-arrow 0.5.1
Arrow support for vaex12 versions - Latest release: almost 4 years ago - 1 dependent package - 17 dependent repositories - 371 downloads last month - 8,171 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
24 versions - Latest release: over 1 year ago - 3 dependent packages - 54 dependent repositories - 21.7 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-viz 0.5.4
Visualization for vaex24 versions - Latest release: over 1 year ago - 3 dependent packages - 54 dependent repositories - 21.7 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
18 versions - Latest release: over 1 year ago - 2 dependent packages - 51 dependent repositories - 21 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-astro 0.9.3
Astronomy related transformations and FITS file support18 versions - Latest release: over 1 year ago - 2 dependent packages - 51 dependent repositories - 21 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
pyarrowfs-adlgen2 0.2.4
Use pyarrow with Azure Data Lake gen210 versions - Latest release: about 1 year ago - 2 dependent repositories - 136 thousand downloads last month - 23 stars on GitHub - 2 maintainers
Top 1.6% on pypi.org
34 versions - Latest release: 10 months ago - 2 dependent packages - 47 dependent repositories - 20.7 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-ml 0.18.3
Machine learning support for vaex34 versions - Latest release: 10 months ago - 2 dependent packages - 47 dependent repositories - 20.7 thousand downloads last month - 8,171 stars on GitHub - 1 maintainer
procmondf 0.10
provides a convenient and efficient solution for capturing and analyzing system activity logs usi...1 version - Latest release: 11 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
biobear 0.20.0 💰
A package for working with Bioinformatics data with SQL and Arrow49 versions - Latest release: 16 days ago - 1 dependent package - 3.42 thousand downloads last month - 122 stars on GitHub - 2 maintainers
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 2 maintainers
thor-mlops 0.0.19
Amazing ML ops for preprocessing, feature storage and inference19 versions - Latest release: over 1 year ago - 60 downloads last month - 0 stars on GitHub - 1 maintainer
wombat-db 0.0.18
Useful data crunching tools for pyarrow18 versions - Latest release: over 2 years ago - 1 dependent repositories - 108 downloads last month - 8 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
3 versions - Latest release: about 3 years ago - 9 dependent repositories - 171 downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-graphql 0.2.0
GraphQL support for accessing vaex DataFrame3 versions - Latest release: about 3 years ago - 9 dependent repositories - 171 downloads last month - 8,171 stars on GitHub - 1 maintainer
vaex-distributed 0.3.0
Distributed dataset for vaex3 versions - Latest release: about 5 years ago - 1 dependent repositories - 18 downloads last month - 8,171 stars on GitHub - 2 maintainers
feco3 0.5.0
A Rust-backed Python library for parsing .fec files.2 versions - Latest release: 9 months ago - 197 downloads last month - 2 stars on GitHub - 2 maintainers
Top 9.6% on pypi.org
7 versions - Latest release: over 4 years ago - 3 dependent repositories - 39 downloads last month - 8,171 stars on GitHub - 2 maintainers
vaex-ui 0.3.0
Graphical user interface for vaex based on Qt7 versions - Latest release: over 4 years ago - 3 dependent repositories - 39 downloads last month - 8,171 stars on GitHub - 2 maintainers
lakeshack 0.2.3
Query parquet files using pyarrow or S3 Select by first gathering file metadata into a database5 versions - Latest release: 6 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
flaco 0.6.0
(PoC) A very memory-efficient way to read data from PostgreSQL16 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.18 thousand downloads last month - 14 stars on GitHub - 2 maintainers
db2ixf 0.16.1
Parsing and processing of IBM eXchange format (IXF)36 versions - Latest release: 2 months ago - 139 downloads last month - 14 stars on GitHub - 2 maintainers
arff-format-converter 1.0.3
Converts ARFF files to CSV, JSON, XML, XLSX, and ORC10 versions - Latest release: 3 months ago - 32 downloads last month - 1 stars on GitHub - 2 maintainers
pyarrow-ops 0.0.8
Useful data crunching tools for pyarrow8 versions - Latest release: about 3 years ago - 1 dependent repositories - 4.57 thousand downloads last month - 41 stars on GitHub - 2 maintainers
featherhelper 0.0.5
Feather Helper is a concise interface to cache numpy arrays and pandas dataframes.2 versions - Latest release: over 5 years ago - 1 dependent repositories - 11 downloads last month - 3 stars on GitHub - 2 maintainers
vaex-contrib 0.1.3
Community contributed modules to vaex3 versions - Latest release: over 1 year ago - 1 dependent repositories - 17 downloads last month - 8,171 stars on GitHub - 1 maintainer
Related Keywords
python
26
dataframe
19
machine-learning
18
data-science
17
visualization
13
tabular-data
13
memory-mapped-file
13
machinelearning
13
hdf5
13
bigdata
13
parquet
11
arrow
10
pandas
8
csv
6
pyspark
5
excel
4
json
4
sql
4
polars
4
postgresql
3
mysql
3
python3
3
deep-learning
3
parquet-files
3
pytorch
3
sysml
3
tensorflow
3
duckdb
3
bigquery
3
pandas-dataframe
3
clickhouse
2
data-analysis
2
data-conversion
2
processing
2
pandas-pyarrow
2
pandas-arrow
2
backend
2
db-dtypes
2
dtypes
2
parser
2
pandas-tricks-for-data-manipulation
2
format-conversion
2
rust
2
data
2
mssql
2
trino
2
dask
2
impala
2
datafusion
2
snowflake
2
database
2
sqlite
2
sqlalchemy
2
microsoft
1
geoarrow
1
pdf
1
pdf2image
1
pdftotext
1
pytesseract
1
pytesseract-ocr
1
ray
1
tesseract
1
tesseract-ocr
1
ml
1
preprocessing
1
procmon
1
DataFrame
1
logging
1
windows
1
bioinformatics
1
biology
1
biopython
1
rust-bio
1
samtools
1
filesystem
1
datalake
1
distributed-computing
1
distributed-systems
1
ocr
1
azure
1
parallel
1
cell
1
single
1
anndata
1
deltalake
1
ibm
1
ibm-cloud
1
ixf
1
jsonlines
1
parsing
1
parsing-library
1
arff
1
data-interchange
1
data-preprocessing
1
data-transformation
1
file-format-conversion
1
python-package
1
xml
1
orc
1
data-manipulation
1