Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "big-data" keyword

bigdata 0.0.3
IPython magic for running Apache tools for Big Data
4 versions - Latest release: about 5 years ago - 506 downloads last month - 1 maintainer
lidbox 0.7.1
End-to-end spoken language identification (LID) on TensorFlow
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 48 stars on GitHub - 1 maintainer
owck 1.5.8
Optimal Weighted Kriging / Gaussian Process
48 versions - Latest release: over 7 years ago - 1 dependent repositories - 155 downloads last month - 10 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
tasklogger 1.2.0
tasklogger
13 versions - Latest release: almost 2 years ago - 6 dependent packages - 22 dependent repositories - 4.94 thousand downloads last month - 2 stars on GitHub - 1 maintainer
autodl-gpu 0.1.1
Automatic Deep Learning, towards fully automated multi-label classification for image, video, tex...
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 14 downloads last month - 1,106 stars on GitHub - 1 maintainer
decorrelation 0.5.1
An InSAR postprocessing tool
13 versions - Latest release: 5 months ago - 31 downloads last month - 6 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
scprep 1.2.3
scprep
43 versions - Latest release: 12 months ago - 19 dependent packages - 34 dependent repositories - 13.5 thousand downloads last month - 71 stars on GitHub - 1 maintainer
jpl.pipedreams 1.0.5
Pipe Dreams: API for publication of scientific data
6 versions - Latest release: 3 months ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 2 maintainers
edrn.labcas.ui 1.0.17
User interface for LabCAS, for EDRN
22 versions - Latest release: over 5 years ago - 2 dependent repositories - 3 downloads last month - 1 stars on GitHub - 1 maintainer
metastore 1.0.0.dev21
Metastore Python SDK. Feature store and data catalog for machine learning.
21 versions - Latest release: over 2 years ago - 1 dependent repositories - 82 downloads last month - 0 stars on GitHub - 1 maintainer
sageworks 0.6.2 💰
SageWorks: A Python WorkBench for creating and deploying AWS SageMaker Models
133 versions - Latest release: about 1 month ago - 1.19 thousand downloads last month - 37 stars on GitHub - 1 maintainer
pygama 2.0.0
Python package for data processing and analysis
18 versions - Latest release: about 1 month ago - 1 dependent package - 286 downloads last month - 16 stars on GitHub - 1 maintainer
Top 7.6% on pypi.org
geopyspark 0.4.3
Python bindings for GeoTrellis
15 versions - Latest release: over 5 years ago - 3 dependent repositories - 2.46 thousand downloads last month - 177 stars on GitHub - 2 maintainers
talariaclient 0.0.5
Talaria Client to ingest events to TalariaDB
1 version - Latest release: about 4 years ago - 1 dependent repositories - 15 downloads last month - 197 stars on GitHub - 1 maintainer
latentcor 0.2.5
Fast Computation of Latent Correlations for Mixed Data
6 versions - Latest release: 7 months ago - 1 dependent repositories - 30 downloads last month - 7 stars on GitHub - 1 maintainer
csv-schema-inference 0.0.9 💰
A tool to automatically infer columns data types in .csv files
9 versions - Latest release: almost 2 years ago - 2 dependent packages - 123 downloads last month - 32 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
graphtools 1.5.3
graphtools
25 versions - Latest release: over 1 year ago - 14 dependent packages - 24 dependent repositories - 5.19 thousand downloads last month - 39 stars on GitHub - 1 maintainer
bygfiles 0.1.1
Big file collection manager
1 version - Latest release: over 3 years ago - 2 dependent repositories - 33 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
lithops 3.3.0
Lithops lets you transparently run your Python applications in the Cloud
50 versions - Latest release: about 2 months ago - 2 dependent packages - 8 dependent repositories - 2.67 thousand downloads last month - 289 stars on GitHub - 2 maintainers
Top 3.0% on pypi.org
img2dataset 1.45.0
Easily turn a set of image urls to an image dataset
87 versions - Latest release: 5 months ago - 2 dependent packages - 10 dependent repositories - 27.2 thousand downloads last month - 3,256 stars on GitHub - 1 maintainer
stemflow 1.0.9
A package for Adaptive Spatio-Temporal Exploratory Model (AdaSTEM) in python
45 versions - Latest release: 7 months ago - 189 downloads last month - 12 stars on GitHub - 1 maintainer
vertica-ml-python 1.0b0
Vertica-ML-Python simplifies data exploration, data cleaning and machine learning in Vertica.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 217 stars on GitHub - 1 maintainer
spotrix 1.2.0
A modern, enterprise-ready business intelligence web application
4 versions - Latest release: over 1 year ago - 12 downloads last month - 27 stars on GitHub - 1 maintainer
clip-jax 0.0.2
Training of CLIP in JAX
3 versions - Latest release: 11 months ago - 1 dependent repositories - 107 downloads last month - 3,256 stars on GitHub - 1 maintainer
jqdatapy 0.1.8 💰
unified,modular quantitative system for human beings
11 versions - Latest release: about 2 years ago - 1 dependent package - 5 dependent repositories - 367 downloads last month - 12 stars on GitHub - 1 maintainer
video2dataset 1.3.0
Easily create large video dataset from video urls
4 versions - Latest release: 4 months ago - 726 downloads last month - 454 stars on GitHub - 1 maintainer
miss-lightgbm-mmlspark
Microsoft ML for Spark
1 version - 4,986 stars on GitHub
youml 0.6.0
A Machine Learning Toolkit
1 version - Latest release: over 2 years ago - 1 dependent repositories - 5 downloads last month - 13 stars on GitHub - 1 maintainer
hyperengine 0.1.1
Python library for Bayesian hyper-parameters optimization
1 version - Latest release: over 6 years ago - 6 dependent repositories - 18 downloads last month - 85 stars on GitHub - 1 maintainer
spark-df-profiling-optimus 0.1.1
Create HTML profiling reports from Apache Spark DataFrames
6 versions - Latest release: almost 7 years ago - 3 dependent repositories - 347 downloads last month - 2 stars on GitHub - 1 maintainer
mmtfpyspark 0.3.6
Methods for parallel and distributed analysis and mining of the Protein Data Bank using MMTF and ...
10 versions - Latest release: over 5 years ago - 1 dependent repositories - 41 downloads last month - 67 stars on GitHub - 2 maintainers
trjtrypy 0.0.0
Distance between trajectories
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 9 downloads last month - 3 stars on GitHub - 4 maintainers
ukv 0.12.1
Python bindings for Unum's UStore.
23 versions - Latest release: about 1 year ago - 99 downloads last month - 488 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
uproot3 3.14.4
ROOT I/O in pure Python and Numpy.
5 versions - Latest release: over 3 years ago - 15 dependent packages - 30 dependent repositories - 44.7 thousand downloads last month - 314 stars on GitHub - 1 maintainer
pyolap 0.1.6
Python Olap on EuclidOLAP.
6 versions - Latest release: 6 months ago - 74 downloads last month - 43 stars on GitHub - 1 maintainer
ustore 1.7.5
Python bindings for Unum's UStore.
21 versions - Latest release: almost 2 years ago - 1 dependent repositories - 38 downloads last month - 488 stars on GitHub - 1 maintainer
python-olap 0.1.1
Python for EuclidOLAP.
1 version - Latest release: 6 months ago - 10 downloads last month - 43 stars on GitHub - 1 maintainer
euclidolap 0.1.1
Python for EuclidOLAP.
1 version - Latest release: 6 months ago - 19 downloads last month - 43 stars on GitHub - 1 maintainer
sr3 0.0.1.1
SR3 fusion clustering
1 version - Latest release: over 4 years ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
pybda 0.1.0
Analysis of big biological data sets for distributed HPC clusters.
6 versions - Latest release: almost 5 years ago - 1 dependent repositories - 33 downloads last month - 9 stars on GitHub - 1 maintainer
algops 0.0.1
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 23 downloads last month - 1 maintainer
Top 4.0% on pypi.org
pmaw 3.0.0
A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.
18 versions - Latest release: over 1 year ago - 5 dependent packages - 47 dependent repositories - 2.23 thousand downloads last month - 212 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
phate 1.0.11
PHATE
36 versions - Latest release: 12 months ago - 18 dependent packages - 33 dependent repositories - 2.86 thousand downloads last month - 448 stars on GitHub - 1 maintainer
ibmpairs 3.1.1
open source Python modules for the IBM PAIRS Geoscope platform
34 versions - Latest release: 2 months ago - 1 dependent repositories - 1.86 thousand downloads last month - 34 stars on GitHub - 1 maintainer
ytsaurus-spyt 66.0.3
YTsaurus SPYT high-level client
106 versions - Latest release: 4 months ago - 681 downloads last month - 1,764 stars on GitHub - 2 maintainers
Top 9.0% on pypi.org
ytsaurus-pyspark 66.0.3
Apache Spark Python API, YTsaurus fork
14 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 484 downloads last month - 1,669 stars on GitHub - 2 maintainers
clusterdock 2.3.0
clusterdock is a framework for creating Docker-based container clusters
24 versions - Latest release: almost 4 years ago - 1 dependent repositories - 844 downloads last month - 28 stars on GitHub - 3 maintainers
Top 2.4% on pypi.org
apache-bookkeeper-client 4.16.5
Apache BookKeeper client library
31 versions - Latest release: 2 months ago - 2 dependent packages - 5 dependent repositories - 12.9 thousand downloads last month - 1,861 stars on GitHub - 10 maintainers
scarf 0.28.9
Scarf
45 versions - Latest release: 5 months ago - 1 dependent repositories - 432 downloads last month - 89 stars on GitHub - 1 maintainer
scarf-toolkit 0.8.5 removed
Scarf
16 versions - Latest release: almost 3 years ago - 54 stars on GitHub
catboost-dev 0.26.1
Catboost Python Package
413 versions - Latest release: almost 3 years ago - 1 dependent repositories - 13.7 thousand downloads last month - 7,793 stars on GitHub - 1 maintainer
sqlalchemy-risingwave 1.1.0
RisingWave dialect for SQLAlchemy
11 versions - Latest release: 2 months ago - 1 dependent package - 2 dependent repositories - 15.8 thousand downloads last month - 6,401 stars on GitHub - 1 maintainer
j2v 1.6.0
A tool to generate Looker views and explores from JSONs
26 versions - Latest release: about 3 years ago - 1 dependent repositories - 232 downloads last month - 12 stars on GitHub - 3 maintainers
alfrd 0.0.2 💰
Automated Logical FRamework for script execution Dynamically(ALFRD)
2 versions - Latest release: about 1 month ago - 276 downloads last month - 1 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
verticapy 1.0.3
VerticaPy simplifies data exploration, data cleaning, and machine learning in Vertica.
28 versions - Latest release: 2 months ago - 1 dependent repositories - 5.7 thousand downloads last month - 209 stars on GitHub - 3 maintainers
miniff 0.1.4 💰
A minimal implementation of force fields
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 20 downloads last month - 8,989 stars on GitHub - 2 maintainers
nearist 1.0.2
Tools for Nearist hardware
3 versions - Latest release: about 6 years ago - 1 dependent repositories - 20 downloads last month - 6 stars on GitHub - 1 maintainer
streamsql 2.0.1
Python SDK for the StreamSQL feature store
14 versions - Latest release: almost 4 years ago - 1 dependent repositories - 95 downloads last month - 4 stars on GitHub - 1 maintainer
nozberkman-mmlspark 1.0.0
Microsoft ML for Spark
1 version - Latest release: over 2 years ago - 1 dependent repositories - 18 downloads last month - 4,472 stars on GitHub - 1 maintainer
yahoo-panoptes 1.3.2
Network Telemetry And Monitoring
89 versions - Latest release: over 4 years ago - 1 dependent repositories - 452 downloads last month - 98 stars on GitHub - 1 maintainer
m-phate 0.1.6
m-phate
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 44 downloads last month - 58 stars on GitHub - 1 maintainer
zarque-profiling 0.5.10
Data profiling tools for Big Data
6 versions - Latest release: 11 months ago - 101 downloads last month - 2 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
uproot4 4.0.0
ROOT I/O in pure Python and NumPy.
30 versions - Latest release: over 3 years ago - 10 dependent packages - 11 dependent repositories - 44.8 thousand downloads last month - 199 stars on GitHub - 2 maintainers
apache-iotdb-nightly 0.11.2.20210408
Apache IoTDB client API
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 35 downloads last month - 4,281 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
apache-iotdb 1.3.0
Apache IoTDB client API
32 versions - Latest release: 5 months ago - 3 dependent repositories - 890 downloads last month - 4,281 stars on GitHub - 6 maintainers
pycarbon-sdk 0.1.0
Pycarbon is a library that optimizes data access for AI based on CarbonData files, and it is bas...
1 version - Latest release: about 4 years ago - 1 dependent repositories - 24 downloads last month - 1,420 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
gs-apps 0.27.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
510 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 1.51 thousand downloads last month - 3,101 stars on GitHub - 3 maintainers
gs-gaia 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 13 downloads last month - 3,101 stars on GitHub - 1 maintainer
gs-gaiax 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 17 downloads last month - 3,101 stars on GitHub - 1 maintainer
gs-jython 0.13.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
8 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 76 downloads last month - 3,101 stars on GitHub - 1 maintainer
graphscope-gpu 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 14 downloads last month - 3,101 stars on GitHub - 1 maintainer
graphscope-gaiax 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 10 downloads last month - 3,101 stars on GitHub - 1 maintainer
graphscope-gaia 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 10 downloads last month - 3,101 stars on GitHub - 1 maintainer
gs-java 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 19 downloads last month - 3,101 stars on GitHub - 1 maintainer
graphscope-jupyter 0.4.1
Python implementation of the graph visualization tool Graphin.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 17 downloads last month - 3,101 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
graphscope-client 0.27.0
GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba
532 versions - Latest release: 2 months ago - 1 dependent package - 13 dependent repositories - 3.19 thousand downloads last month - 3,101 stars on GitHub - 3 maintainers
pygaiax 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 12 downloads last month - 3,088 stars on GitHub - 1 maintainer
graphscope-java 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 11 downloads last month - 3,088 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
graphscope 0.27.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
577 versions - Latest release: 2 months ago - 1 dependent package - 2 dependent repositories - 2.38 thousand downloads last month - 3,101 stars on GitHub - 3 maintainers
gs-gpu 0.1.1
GPU for GraphScope
1 version - Latest release: about 2 years ago - 1 dependent repositories - 11 downloads last month - 3,101 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
gs-engine 0.27.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
513 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 1.18 thousand downloads last month - 3,101 stars on GitHub - 3 maintainers
gs-lib 0.13.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
8 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 68 downloads last month - 3,101 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
gs-include 0.27.0
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
513 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 1.81 thousand downloads last month - 3,088 stars on GitHub - 3 maintainers
dvu 0.0.2
Functions for data visualization in matplotlib.
3 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 47 downloads last month - 18 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
synapseml 1.0.4
Synapse Machine Learning
16 versions - Latest release: 2 months ago - 2 dependent packages - 3 dependent repositories - 230 thousand downloads last month - 4,985 stars on GitHub - 1 maintainer
lakehouse-engine 1.19.0
A Spark framework serving as the engine for several lakehouse algorithms and data flows.
9 versions - Latest release: 3 months ago - 254 downloads last month - 6 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
awkward0 0.15.5
Manipulate arrays of complex data structures as easily as Numpy.
6 versions - Latest release: over 3 years ago - 5 dependent packages - 15 dependent repositories - 45.2 thousand downloads last month - 217 stars on GitHub - 1 maintainer
awkward-numba 0.14.0
Allows awkward arrays to be used in Numba-compiled code and optimizes awkward methods with JIT co...
68 versions - Latest release: over 3 years ago - 1 dependent repositories - 614 downloads last month - 216 stars on GitHub - 1 maintainer
zvtm 0.0.11
unified,modular quant framework for mysql
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 100 downloads last month - 5 stars on GitHub - 1 maintainer
mister 0.0.2
Approachable map/reduce jobs
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
jagular 0.0.2
Out-of-core pre-processing of big-ish electrophysiology data.
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 23 downloads last month - 3 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
selinon 1.3.0
an advanced dynamic task flow management on top of Celery
19 versions - Latest release: over 1 year ago - 15 dependent repositories - 837 downloads last month - 294 stars on GitHub - 2 maintainers
alluxio-python-library 2.0.1
Alluxio Python library 1.0.0 provides API to interact with Alluxio servers.
3 versions - Latest release: 5 months ago - 22 downloads last month - 24 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
alluxio 1.0.0
Alluxio Python library 1.0.0 provides API to interact with Alluxio servers.
5 versions - Latest release: 3 months ago - 1 dependent package - 77 dependent repositories - 3.35 thousand downloads last month - 24 stars on GitHub - 3 maintainers
decentralized-internet 4.3.5 💰
A library for creating distributed web and grid projects
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 32 downloads last month - 488 stars on GitHub - 1 maintainer
grizzlys 0.0.1
Python DataFrames powered by Julia
2 versions - Latest release: about 2 months ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
scannerpy 0.2.1
Efficient video analysis at scale
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 13 downloads last month - 607 stars on GitHub - 2 maintainers
h2o-mlflow-flavor 0.1.0
A mlflow flavor for working with H2O-3 MOJO and POJO models
1 version - Latest release: 7 months ago - 39 downloads last month - 6,710 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
nflx-genie-client 3.6.17
Genie Python Client.
102 versions - Latest release: 11 months ago - 2 dependent repositories - 80.4 thousand downloads last month - 1,679 stars on GitHub - 3 maintainers
dislib 0.9.0
The distributed computing library on top of PyCOMPSs
19 versions - Latest release: 7 months ago - 1 dependent repositories - 721 downloads last month - 43 stars on GitHub - 2 maintainers