Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "big-data" keyword

opteryx 0.14.1 ๐Ÿ’ฐ
Python SQL Query Engine
229 versions - Latest release: about 1 month ago - 1 dependent repositories - 13.3 thousand downloads last month - 43 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
feast 0.37.1
Python SDK for Feast
118 versions - Latest release: about 1 month ago - 13 dependent packages - 140 dependent repositories - 333 thousand downloads last month - 5,027 stars on GitHub - 5 maintainers
Top 1.8% on pypi.org
phoenixdb 1.2.1
Phoenix database adapter for Python
12 versions - Latest release: over 1 year ago - 2 dependent packages - 125 dependent repositories - 16.3 thousand downloads last month - 45 stars on GitHub - 4 maintainers
covsirphy 3.1.1 ๐Ÿ’ฐ
COVID-19 data analysis with phase-dependent SIR-derived ODE models
59 versions - Latest release: 4 months ago - 1 dependent repositories - 235 downloads last month - 101 stars on GitHub - 1 maintainer
sqlalchemy-risingwave 1.1.0
RisingWave dialect for SQLAlchemy
11 versions - Latest release: about 1 month ago - 1 dependent package - 2 dependent repositories - 15 thousand downloads last month - 6,392 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
apache-bookkeeper-client 4.16.5
Apache BookKeeper client library
31 versions - Latest release: about 1 month ago - 2 dependent packages - 5 dependent repositories - 11.1 thousand downloads last month - 1,860 stars on GitHub - 10 maintainers
Top 1.2% on pypi.org
koalas 1.8.2
Koalas: pandas API on Apache Spark
47 versions - Latest release: over 2 years ago - 11 dependent packages - 444 dependent repositories - 2.24 million downloads last month - 3,308 stars on GitHub - 7 maintainers
Top 1.7% on pypi.org
databricks-connect 14.3.2
Databricks Connect Client
211 versions - Latest release: 11 days ago - 15 dependent packages - 64 dependent repositories - 1.02 million downloads last month - 37,738 stars on GitHub - 19 maintainers
Top 6.3% on pypi.org
zvt 0.10.5 ๐Ÿ’ฐ
unified,modular quant framework for human beings
68 versions - Latest release: over 1 year ago - 7 dependent repositories - 713 downloads last month - 2,972 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
getdaft 0.2.24
Distributed Dataframes for Multimodal Data
71 versions - Latest release: 12 days ago - 5 dependent packages - 3 dependent repositories - 13.9 thousand downloads last month - 1,761 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
delta-spark 3.2.0
Python APIs for using Delta Lake with Apache Spark
19 versions - Latest release: 10 days ago - 38 dependent packages - 90 dependent repositories - 11.2 million downloads last month - 6,958 stars on GitHub - 6 maintainers
Top 8.0% on pypi.org
verticapy 1.0.3
VerticaPy simplifies data exploration, data cleaning, and machine learning in Vertica.
28 versions - Latest release: about 2 months ago - 1 dependent repositories - 5.88 thousand downloads last month - 209 stars on GitHub - 3 maintainers
fooltrader 0.0.1a1 ๐Ÿ’ฐ
Open source quantitative framework for Humans
1 version - Latest release: about 6 years ago - 1 dependent repositories - 15 downloads last month - 1,126 stars on GitHub - 1 maintainer
catboost-dev 0.26.1
Catboost Python Package
413 versions - Latest release: almost 3 years ago - 1 dependent repositories - 21.6 thousand downloads last month - 7,776 stars on GitHub - 1 maintainer
Top 0.3% on pypi.org
catboost 1.2.5
CatBoost Python Package
107 versions - Latest release: about 1 month ago - 203 dependent packages - 3,035 dependent repositories - 2.02 million downloads last month - 7,530 stars on GitHub - 5 maintainers
zvtm 0.0.11
unified,modular quant framework for mysql
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 100 downloads last month - 5 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
nipype 1.8.6
Neuroimaging in Python: Pipelines and Interfaces
64 versions - Latest release: about 1 year ago - 45 dependent packages - 1,107 dependent repositories - 162 thousand downloads last month - 733 stars on GitHub - 4 maintainers
Top 0.4% on pypi.org
cython 3.0.10 ๐Ÿ’ฐ
The Cython compiler for writing C extensions in the Python language.
139 versions - Latest release: about 2 months ago - 1,019 dependent packages - 18,920 dependent repositories - 45.9 million downloads last month - 8,978 stars on GitHub - 3 maintainers
miniff 0.1.4 ๐Ÿ’ฐ
A minimal implementation of force fields
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 33 downloads last month - 8,967 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
uproot 5.3.7
ROOT I/O in pure Python and NumPy.
305 versions - Latest release: 10 days ago - 83 dependent packages - 240 dependent repositories - 137 thousand downloads last month - 220 stars on GitHub - 2 maintainers
Top 3.6% on pypi.org
hazelcast-python-client 5.3.0
Hazelcast Python Client
26 versions - Latest release: 11 months ago - 3 dependent packages - 17 dependent repositories - 36 thousand downloads last month - 112 stars on GitHub - 1 maintainer
feastmo 0.341.0
Python SDK for Feast
8 versions - Latest release: 6 months ago - 30 downloads last month - 4,988 stars on GitHub - 1 maintainer
mister 0.0.2
Approachable map/reduce jobs
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
richdem 0.3.4
High-Performance Terrain Analysis
18 versions - Latest release: almost 6 years ago - 7 dependent packages - 25 dependent repositories - 2.58 thousand downloads last month - 241 stars on GitHub - 1 maintainer
eoxserver 1.3.0
EOxServer is a server for Earth Observation (EO) data
108 versions - Latest release: 8 months ago - 1 dependent repositories - 964 downloads last month - 40 stars on GitHub - 3 maintainers
Top 7.5% on pypi.org
merlin 1.12.1
The building blocks of workflows!
35 versions - Latest release: about 1 month ago - 8 dependent repositories - 1.66 thousand downloads last month - 115 stars on GitHub - 4 maintainers
merlinwf 2.0.1
The 'merlinwf' package has been deprecated and replaced by the 'merlin' package.
17 versions - Latest release: about 3 years ago - 1 dependent repositories - 178 downloads last month - 115 stars on GitHub - 4 maintainers
jagular 0.0.2
Out-of-core pre-processing of big-ish electrophysiology data.
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 23 downloads last month - 3 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
graphscope 0.27.0
๐Ÿ”จ ๐Ÿ‡ ๐Ÿ’ป ๐Ÿš€ GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | ไธ€็ซ™ๅผๅ›พ่ฎก็ฎ—็ณป็ปŸ
574 versions - Latest release: about 2 months ago - 1 dependent package - 2 dependent repositories - 2.39 thousand downloads last month - 3,088 stars on GitHub - 3 maintainers
pramen-py 1.8.8
Pramen transformations written in python
30 versions - Latest release: 3 days ago - 451 downloads last month - 22 stars on GitHub - 3 maintainers
flowcept 0.2.10
FlowCept is a runtime data integration system that empowers any data processing system to capture...
45 versions - Latest release: 3 months ago - 1 dependent package - 406 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
selinon 1.3.0
an advanced dynamic task flow management on top of Celery
19 versions - Latest release: over 1 year ago - 15 dependent repositories - 837 downloads last month - 294 stars on GitHub - 2 maintainers
alluxio-python-library 2.0.1
Alluxio Python library 1.0.0 provides API to interact with Alluxio servers.
3 versions - Latest release: 4 months ago - 22 downloads last month - 24 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
alluxio 1.0.0
Alluxio Python library 1.0.0 provides API to interact with Alluxio servers.
5 versions - Latest release: 3 months ago - 1 dependent package - 77 dependent repositories - 3.35 thousand downloads last month - 24 stars on GitHub - 3 maintainers
Top 3.3% on pypi.org
synapseml 1.0.4
Synapse Machine Learning
16 versions - Latest release: about 1 month ago - 2 dependent packages - 3 dependent repositories - 233 thousand downloads last month - 4,981 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
keyvi 0.6.0
Python package for keyvi
30 versions - Latest release: about 1 month ago - 1 dependent repositories - 14.1 thousand downloads last month - 235 stars on GitHub - 3 maintainers
decentralized-internet 4.3.5 ๐Ÿ’ฐ
A library for creating distributed web and grid projects
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 32 downloads last month - 488 stars on GitHub - 1 maintainer
jupyterlab-pachyderm 2.10.0
A JupyterLab extension.
118 versions - Latest release: 4 days ago - 1 dependent repositories - 1.27 thousand downloads last month - 6,070 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
pachyderm-sdk 2.10.0
Python Pachyderm Client
49 versions - Latest release: 4 days ago - 1 dependent package - 3 dependent repositories - 4.34 thousand downloads last month - 6,027 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
arcticdb 4.4.2
ArcticDB DataFrame Database
52 versions - Latest release: 12 days ago - 5 dependent packages - 1 dependent repositories - 22.4 thousand downloads last month - 1,108 stars on GitHub - 6 maintainers
django-mass-migration 0.2.9
Django app for long-running data migrations
16 versions - Latest release: 4 months ago - 1.1 thousand downloads last month - 2 stars on GitHub - 1 maintainer
grizzlys 0.0.1
Python DataFrames powered by Julia
2 versions - Latest release: about 1 month ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
scannerpy 0.2.1
Efficient video analysis at scale
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 13 downloads last month - 607 stars on GitHub - 2 maintainers
Top 2.7% on pypi.org
uproot3 3.14.4
ROOT I/O in pure Python and Numpy.
5 versions - Latest release: over 3 years ago - 15 dependent packages - 30 dependent repositories - 14.5 thousand downloads last month - 314 stars on GitHub - 1 maintainer
pyspark-data-sources 0.1.2
Custom Spark data sources for reading and writing data in Apache Spark, using the Python Data Sou...
3 versions - Latest release: 3 months ago - 47 downloads last month - 38,255 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
tasklogger 1.2.0
tasklogger
13 versions - Latest release: almost 2 years ago - 6 dependent packages - 22 dependent repositories - 4.94 thousand downloads last month - 2 stars on GitHub - 1 maintainer
h2o-mlflow-flavor 0.1.0
A mlflow flavor for working with H2O-3 MOJO and POJO models
1 version - Latest release: 6 months ago - 39 downloads last month - 6,710 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
scprep 1.2.3
scprep
43 versions - Latest release: 11 months ago - 19 dependent packages - 34 dependent repositories - 16.1 thousand downloads last month - 69 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
scikit-learn-intelex 2024.4.0
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application.
27 versions - Latest release: 6 days ago - 18 dependent packages - 615 dependent repositories - 120 thousand downloads last month - 1,152 stars on GitHub - 2 maintainers
Top 2.7% on pypi.org
redislite 6.2.912183
Redis built into a python package
64 versions - Latest release: 6 months ago - 10 dependent packages - 106 dependent repositories - 91.2 thousand downloads last month - 561 stars on GitHub - 3 maintainers
Top 4.4% on pypi.org
nflx-genie-client 3.6.17
Genie Python Client.
102 versions - Latest release: 10 months ago - 2 dependent repositories - 80.4 thousand downloads last month - 1,679 stars on GitHub - 3 maintainers
Top 0.1% on pypi.org
pyspark 3.5.1
Apache Spark Python API
44 versions - Latest release: 3 months ago - 588 dependent packages - 6,227 dependent repositories - 29 million downloads last month - 38,255 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
magic-impute 3.0.0
MAGIC
15 versions - Latest release: about 3 years ago - 13 dependent packages - 22 dependent repositories - 2.41 thousand downloads last month - 318 stars on GitHub - 1 maintainer
dislib 0.9.0
The distributed computing library on top of PyCOMPSs
19 versions - Latest release: 6 months ago - 1 dependent repositories - 721 downloads last month - 43 stars on GitHub - 2 maintainers
Top 2.9% on pypi.org
phate 1.0.11
PHATE
36 versions - Latest release: 11 months ago - 18 dependent packages - 33 dependent repositories - 2.73 thousand downloads last month - 443 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
km3io 1.1.0
"KM3NeT I/O library without ROOT"
67 versions - Latest release: 2 months ago - 2 dependent packages - 2 dependent repositories - 789 downloads last month - 314 stars on GitHub - 2 maintainers
dbt-dataops-starrocks 1.4.3
The starrocks adapter plugin for dbt
3 versions - Latest release: about 1 year ago - 51 downloads last month - 6,779 stars on GitHub - 1 maintainer
apache-iotdb-nightly 0.11.2.20210408
Apache IoTDB client API
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 62 downloads last month - 4,248 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
apache-iotdb 1.3.0
Apache IoTDB client API
32 versions - Latest release: 5 months ago - 3 dependent repositories - 1.02 thousand downloads last month - 4,248 stars on GitHub - 6 maintainers
starrocks 1.0.6
Python SQLAlchemy Dialect for StarRocks
3 versions - Latest release: 3 months ago - 13.1 thousand downloads last month - 6,779 stars on GitHub - 2 maintainers
pyspark-connectby 1.1.3
connectby hierarchy query in spark
6 versions - Latest release: 2 months ago - 656 downloads last month - 38,255 stars on GitHub - 1 maintainer
bartbroere-eland 8.13.1
[Development fork!] Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL ...
7 versions - Latest release: about 2 months ago - 45 downloads last month - 614 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
eland 8.13.1
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
31 versions - Latest release: 17 days ago - 2 dependent packages - 22 dependent repositories - 16.2 thousand downloads last month - 614 stars on GitHub - 3 maintainers
Top 0.7% on pypi.org
h2o 3.46.0.2
H2O, Fast Scalable Machine Learning, for python
113 versions - Latest release: 6 days ago - 14 dependent packages - 393 dependent repositories - 323 thousand downloads last month - 6,710 stars on GitHub - 2 maintainers
Top 4.9% on pypi.org
graphtools 1.5.3
graphtools
25 versions - Latest release: over 1 year ago - 14 dependent packages - 24 dependent repositories - 4.85 thousand downloads last month - 39 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
delta-sharing 1.0.5
Python Connector for Delta Sharing
35 versions - Latest release: 26 days ago - 6 dependent packages - 4 dependent repositories - 161 thousand downloads last month - 677 stars on GitHub - 3 maintainers
Top 1.9% on pypi.org
daal4py 2024.4.0
daal4py is a Convenient Python API to the Intelยฎ oneAPI Data Analytics Library (oneDAL)
29 versions - Latest release: 6 days ago - 2 dependent packages - 433 dependent repositories - 120 thousand downloads last month - 1,152 stars on GitHub - 3 maintainers
csv-schema-inference 0.0.9 ๐Ÿ’ฐ
A tool to automatically infer columns data types in .csv files
9 versions - Latest release: almost 2 years ago - 2 dependent packages - 123 downloads last month - 32 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
cloud-volume 8.33.0
A serverless client for reading and writing Neuroglancer Precomputed volumes both locally and on ...
326 versions - Latest release: 9 days ago - 19 dependent packages - 45 dependent repositories - 4.61 thousand downloads last month - 121 stars on GitHub - 1 maintainer
cc2dataset 1.5.0
Easily convert common crawl to image caption set using pyspark
3 versions - Latest release: 11 months ago - 884 downloads last month - 292 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
stream-framework-plus 1.4.0.3
Stream Framework allows you to build complex feed and caching structures using Redis.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 319 downloads last month - 4,723 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
stream_framework 1.4.0
Stream Framework allows you to build complex feed and caching structures using Redis.
15 versions - Latest release: over 7 years ago - 2 dependent repositories - 647 downloads last month - 4,723 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
feedly 0.11.3
Feedly allows you to build complex feed and caching structures using Redis.
98 versions - Latest release: over 9 years ago - 4 dependent repositories - 416 downloads last month - 4,723 stars on GitHub - 1 maintainer
pyengnet 0.0.3
pyEnGNet: optimized reconstruction of gene co-expression networks using multi-GPU
9 versions - Latest release: over 1 year ago - 33 downloads last month - 1 stars on GitHub - 1 maintainer
schemarrow 0.1.1a0 ๐Ÿ’ฐ
A library for switching pandas backend to pyarrow
2 versions - Latest release: 2 months ago - 19 downloads last month - 2 stars on GitHub - 1 maintainer
datafusion-cli 36.0.0 removed
Command Line Client for DataFusion query engine.
1 version - Latest release: 3 months ago - 228 downloads last month - 4,841 stars on GitHub - 1 maintainer
scalifiai-client 1.3.0a8
SCALIFI AI NO CODE PLATFORM
38 versions - Latest release: about 1 month ago - 342 downloads last month - 1 maintainer
spark-connectby 1.0.4 removed
connectby hierarchy query in spark
13 versions - Latest release: 3 months ago - 1.12 thousand downloads last month - 37,955 stars on GitHub - 1 maintainer
rayban 0.1.0
A scalable ray dashboard built with perspective
1 version - Latest release: 4 months ago - 14 downloads last month - 1 stars on GitHub - 1 maintainer
python-olap 0.1.1
Python for EuclidOLAP.
1 version - Latest release: 5 months ago - 17 downloads last month - 38 stars on GitHub - 1 maintainer
pyolap 0.1.6
Python Olap on EuclidOLAP.
6 versions - Latest release: 5 months ago - 63 downloads last month - 40 stars on GitHub - 1 maintainer
moraine 0.7.0
Modern Radar Interferometry Environment, A simple, stupid InSAR postprocessing tool in big data era
3 versions - Latest release: 19 days ago - 605 downloads last month - 6 stars on GitHub - 1 maintainer
lakehouse-engine 1.19.0
A Spark framework serving as the engine for several lakehouse algorithms and data flows.
8 versions - Latest release: 2 months ago - 289 downloads last month - 6 stars on GitHub - 1 maintainer
stemflow 1.0.9
A package for Adaptive Spatio-Temporal Exploratory Model (AdaSTEM) in python
45 versions - Latest release: 7 months ago - 276 downloads last month - 12 stars on GitHub - 1 maintainer
radarpipeline 2.0.1
A python feature generation and visualization package use with RADAR project data.
3 versions - Latest release: 10 months ago - 37 downloads last month - 7 stars on GitHub - 1 maintainer
retake 0.1.14 ๐Ÿ’ฐ
Open Source Infrastructure for Vector Data Streams
14 versions - Latest release: 10 months ago - 80 downloads last month - 3,727 stars on GitHub - 1 maintainer
zarque-profiling 0.5.10
Data profiling tools for Big Data
6 versions - Latest release: 10 months ago - 419 downloads last month - 2 stars on GitHub - 1 maintainer
tphate 0.0.5
Temporal PHATE (TPHATE) is a python package for learning robust manifold representations of times...
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 21 downloads last month - 18 stars on GitHub - 1 maintainer
ytsaurus-spyt 66.0.3
YTsaurus SPYT high-level client
106 versions - Latest release: 3 months ago - 577 downloads last month - 1,764 stars on GitHub - 2 maintainers
Top 9.0% on pypi.org
ytsaurus-pyspark 66.0.3
Apache Spark Python API, YTsaurus fork
14 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 335 downloads last month - 1,669 stars on GitHub - 2 maintainers
dbt-starrocks 1.6.2
The Starrocks adapter plugin for dbt
12 versions - Latest release: about 1 month ago - 715 downloads last month - 6,779 stars on GitHub - 1 maintainer
clip-jax 0.0.2
Training of CLIP in JAX
3 versions - Latest release: 10 months ago - 1 dependent repositories - 207 downloads last month - 3,256 stars on GitHub - 1 maintainer
cellograph 0.0.6
cellograph
6 versions - Latest release: over 1 year ago - 60 downloads last month - 3 stars on GitHub - 1 maintainer
pmoss 2.1
Python package to model the p-value as an n-dependent function using Monte Carlo cross-validation.
3 versions - Latest release: 2 months ago - 12 downloads last month - 16 stars on GitHub - 1 maintainer
openseize 1.2.0
Digital Signal Processing for Big EEG Datasets
4 versions - Latest release: about 1 year ago - 28 downloads last month - 9 stars on GitHub - 1 maintainer
video2dataset 1.3.0
Easily create large video dataset from video urls
4 versions - Latest release: 3 months ago - 1.04 thousand downloads last month - 449 stars on GitHub - 1 maintainer
csv-shuffler 0.0.4 ๐Ÿ’ฐ
A tool to automatically Shuffle lines in a csv file
4 versions - Latest release: almost 2 years ago - 56 downloads last month - 4 stars on GitHub - 1 maintainer
livyc 0.0.14 ๐Ÿ’ฐ
Apache Livy Client
11 versions - Latest release: almost 2 years ago - 120 downloads last month - 3 stars on GitHub - 1 maintainer
faux-data 0.0.18
Generate fake data from yaml templates
12 versions - Latest release: almost 2 years ago - 1 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
gs-gaia 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 7 downloads last month - 3,088 stars on GitHub - 1 maintainer