Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
repo1.maven.org "big-data" keyword
org.milyn:smooks-parent 1.7.1
Smooks is an extensible framework for building applications for processing XML and non XML data (...3 versions - Latest release: over 5 years ago - 377 stars on GitHub
org.smooks:smooks-scribe-ibatis 2.0.0-M3
Smooks is an extensible Java framework for building XML and non-XML data (CSV, EDI, Java, etc...)...3 versions - Latest release: about 3 years ago - 4 dependent packages - 377 stars on GitHub
org.apache.apex:apex-app-archetype 3.7.0
The Apache Software Foundation provides support for the Apache community of open-source software ...7 versions - Latest release: about 6 years ago - 350 stars on GitHub
Top 6.4% on repo1.maven.org
51 versions - Latest release: over 3 years ago - 3 dependent packages - 18 dependent repositories - 7,473 stars on GitHub
io.prestosql:presto-parquet 350
Presto51 versions - Latest release: over 3 years ago - 3 dependent packages - 18 dependent repositories - 7,473 stars on GitHub
uk.gov.gchq.gaffer:road-traffic-model 2.2.1
A scalable Graph database framework100 versions - Latest release: 6 days ago - 3 dependent packages - 4 dependent repositories - 1,694 stars on GitHub
org.apache.predictionio:predictionio-tools_2.10 0.10.0-incubating-rc1
predictionio-tools1 version - Latest release: over 7 years ago - 12,552 stars on GitHub
org.apache.flink:flink-table-store-spark2 0.3.0
The Apache Software Foundation provides support for the Apache community of open-source software ...3 versions - Latest release: over 1 year ago - 1,553 stars on GitHub
com.nfsdb:nfsdb-thrift 2.1.0
NFSdb Apache Thrift support library7 versions - Latest release: over 9 years ago - 10,280 stars on GitHub
uk.gov.gchq.gaffer:flink-library 2.2.1
A scalable Graph database framework88 versions - Latest release: 6 days ago - 1 dependent package - 1 dependent repositories - 1,694 stars on GitHub
Top 9.5% on repo1.maven.org
51 versions - Latest release: over 3 years ago - 1 dependent package - 17 dependent repositories - 7,473 stars on GitHub
io.prestosql:presto-mysql 350
Presto - MySQL Connector51 versions - Latest release: over 3 years ago - 1 dependent package - 17 dependent repositories - 7,473 stars on GitHub
org.bdgenomics.adam:adam-distribution_2.11 0.23.0
A fast, scalable genome analysis system10 versions - Latest release: over 6 years ago - 948 stars on GitHub
Top 9.3% on repo1.maven.org
8 versions - Latest release: 4 months ago - 11 dependent packages - 8 dependent repositories - 948 stars on GitHub
org.bdgenomics.adam:adam-core-spark3_2.12 1.0.1
A fast, scalable genome analysis system8 versions - Latest release: 4 months ago - 11 dependent packages - 8 dependent repositories - 948 stars on GitHub
Top 9.6% on repo1.maven.org
14 versions - Latest release: about 8 years ago - 2 dependent packages - 6 dependent repositories - 12,552 stars on GitHub
io.prediction:data_2.10 0.9.6
data14 versions - Latest release: about 8 years ago - 2 dependent packages - 6 dependent repositories - 12,552 stars on GitHub
Top 8.1% on repo1.maven.org
187 versions - Latest release: 5 months ago - 1 dependent package - 93 dependent repositories - 14,560 stars on GitHub
com.facebook.presto:presto-atop 0.285.1
Presto - Atop Connector187 versions - Latest release: 5 months ago - 1 dependent package - 93 dependent repositories - 14,560 stars on GitHub
com.facebook.presto:presto-benchto-queries 0.214
Presto2 versions - Latest release: over 5 years ago - 4 dependent packages - 14,558 stars on GitHub
com.vesoft:nebula-spark-connector 3.8.0
Nebula Spark Connector18 versions - Latest release: 20 days ago - 1 dependent package - 5 dependent repositories - 8,876 stars on GitHub
org.opencypher:okapi-licensecheck-config 0.2.3
Okapi is a compiler pipeline for Cypher queries, including a consumer API, which translates Cyphe...18 versions - Latest release: over 5 years ago - 329 stars on GitHub
com.twitter:parquet-hive-binding-factory 1.6.0
Parquet is a columnar storage format that supports nested data. This provides the java implementa...18 versions - Latest release: about 9 years ago - 2 dependent packages - 10 dependent repositories - 1 stars on GitHub
org.opencypher:okapi-ir 0.4.2
Okapi is a compiler pipeline for Cypher queries, including a consumer API, which translates Cyphe...33 versions - Latest release: almost 5 years ago - 7 dependent packages - 2 dependent repositories - 329 stars on GitHub
io.eels:eel-orc_2.12 1.2.4
eel-orc7 versions - Latest release: almost 7 years ago - 1 dependent package - 147 stars on GitHub
com.hurence.logisland:logisland-outlier-detection-plugin_2.11 0.14.0 💰
LogIsland is an event mining platform based on Kafka to handle a huge amount of data in realtime.3 versions - Latest release: almost 6 years ago - 108 stars on GitHub
com.hotels:circus-train-distcp-copier 16.4.1
circus-train replicates data and hive metadata between various clusters32 versions - Latest release: about 2 years ago - 2 dependent packages - 1 dependent repositories - 84 stars on GitHub
com.github.harbby:sylph-base-kafka 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 3 dependent packages - 404 stars on GitHub
com.github.harbby:sylph-kafka09 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
com.github.harbby:sylph-elasticsearch6 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
com.github.harbby:sylph-yarn 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 2 dependent packages - 404 stars on GitHub
com.github.harbby:sylph-connectors 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
com.github.harbby:sylph-hdfs 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
com.github.harbby:sylph-spi 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 6 dependent packages - 404 stars on GitHub
com.hotels:circus-train-comparator 16.4.1
circus-train replicates data and hive metadata between various clusters32 versions - Latest release: about 2 years ago - 3 dependent packages - 1 dependent repositories - 84 stars on GitHub
com.vesoft:nebula-algorithm 3.1.0
Nebula Algorithm9 versions - Latest release: about 1 year ago - 2 dependent repositories - 8,874 stars on GitHub
com.github.harbby:sylph-elasticsearch5 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
com.github.harbby:sylph-clickhouse 0.6.0-alpha3
A lightweight API test framework1 version - Latest release: almost 5 years ago - 404 stars on GitHub
io.prestosql:presto-test-jdbc-compatibility 341
Presto - Tests whether older Presto JDBC clients are compatible with current Presto server2 versions - Latest release: over 3 years ago - 7,473 stars on GitHub
Top 7.3% on repo1.maven.org
51 versions - Latest release: over 3 years ago - 4 dependent packages - 3 dependent repositories - 7,473 stars on GitHub
io.prestosql:presto-mongodb 350
Presto - mongodb Connector51 versions - Latest release: over 3 years ago - 4 dependent packages - 3 dependent repositories - 7,473 stars on GitHub
ai.h2o:h2o-test-support 3.46.0.1
null48 versions - Latest release: 2 months ago - 6,186 stars on GitHub
com.hurence.logisland:logisland-spark_2_1-engine_2.11 0.14.0 💰
LogIsland is an event mining platform based on Kafka to handle a huge amount of data in realtime.3 versions - Latest release: almost 6 years ago - 108 stars on GitHub
com.hazelcast.jet.contrib:pulsar 0.1
A Hazelcast Jet connector for Apache Pulsar which enables Hazelcast Jet pipelines to produceand c...1 version - Latest release: almost 4 years ago - 1 dependent repositories - 20 stars on GitHub
uk.gov.gchq.gaffer:access 2.2.1
A scalable Graph database framework29 versions - Latest release: 6 days ago - 4 dependent packages - 2 dependent repositories - 1,694 stars on GitHub
org.apache.reef:reef-poison 0.16.0
Fault injection for REEF7 versions - Latest release: almost 7 years ago - 2 dependent packages - 4 dependent repositories - 94 stars on GitHub
com.twitter:parquet-hive-binding-bundle 1.6.0
Parquet is a columnar storage format that supports nested data. This provides the java implementa...18 versions - Latest release: about 9 years ago - 1 dependent package - 10 dependent repositories - 1 stars on GitHub
io.qbeast:qbeast-core_2.12 0.6.0
qbeast-core13 versions - Latest release: 14 days ago - 1 dependent package - 138 stars on GitHub
io.qbeast:qbeast-spark_2.12 0.6.0
qbeast-spark13 versions - Latest release: 14 days ago - 138 stars on GitHub
com.hazelcast.jet.contrib:debezium 0.1
A Hazelcast Jet connector for Debezium which enables Hazelcast Jet pipelines to read CDC events f...1 version - Latest release: about 4 years ago - 1 dependent repositories - 20 stars on GitHub
io.prediction:common_2.10 0.9.6
common7 versions - Latest release: about 8 years ago - 1 dependent package - 4 dependent repositories - 12,552 stars on GitHub
com.yahoo.bullet:bullet-core 1.5.2
This is the core library that powers various components for Bullet - a real-time data query engine.13 versions - Latest release: over 2 years ago - 7 dependent packages - 9 dependent repositories - 37 stars on GitHub
Top 10.0% on repo1.maven.org
93 versions - Latest release: 6 days ago - 4 dependent packages - 3 dependent repositories - 1,694 stars on GitHub
uk.gov.gchq.gaffer:cache 2.2.1
A scalable Graph database framework93 versions - Latest release: 6 days ago - 4 dependent packages - 3 dependent repositories - 1,694 stars on GitHub
Top 7.9% on repo1.maven.org
73 versions - Latest release: over 8 years ago - 3 dependent packages - 6 dependent repositories - 5,280 stars on GitHub
com.hazelcast:hazelcast-code-generator 3.5.5
Hazelcast In-Memory DataGrid73 versions - Latest release: over 8 years ago - 3 dependent packages - 6 dependent repositories - 5,280 stars on GitHub
org.opencypher:spark-cypher-examples 0.3.2
Okapi is a compiler pipeline for Cypher queries, including a consumer API, which translates Cyphe...15 versions - Latest release: about 5 years ago - 329 stars on GitHub
ch.cern.spark:spark-avro_2.12 3.0.1
The Apache Software Foundation provides support for the Apache community of open-source software ...1 version - Latest release: over 3 years ago - 37,392 stars on GitHub
com.hazelcast.jet.contrib:kafka-connect 0.1
Generic Kafka Connect source provides ability to plug any Kafka Connect source for data ingestion...1 version - Latest release: about 4 years ago - 1 dependent package - 2 dependent repositories - 20 stars on GitHub
Top 6.2% on repo1.maven.org
41 versions - Latest release: 5 months ago - 2 dependent packages - 75 dependent repositories - 14,558 stars on GitHub
com.facebook.presto:presto-prometheus 0.285.1
Presto - Prometheus Connector41 versions - Latest release: 5 months ago - 2 dependent packages - 75 dependent repositories - 14,558 stars on GitHub
com.hurence.logisland:logisland-solr-client-service-api 0.14.0 💰
LogIsland is an event mining platform based on Kafka to handle a huge amount of data in realtime.1 version - Latest release: almost 6 years ago - 2 dependent packages - 108 stars on GitHub
com.hazelcast.jet.contrib:elasticsearch-7 0.2
A Hazelcast Jet connector for Elasticsearch (v7.x.x) for querying/indexing objects from/to Elasti...2 versions - Latest release: about 4 years ago - 20 stars on GitHub
org.apache.predictionio:apache-predictionio-data-hbase_2.10 0.13.0
apache-predictionio-data-hbase4 versions - Latest release: over 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-s3_2.11 0.14.0
apache-predictionio-data-s34 versions - Latest release: about 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-jdbc_2.11 0.14.0
apache-predictionio-data-jdbc5 versions - Latest release: about 5 years ago - 1 dependent package - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-common_2.10 0.13.0
apache-predictionio-common6 versions - Latest release: over 5 years ago - 1 dependent package - 1 dependent repositories - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-elasticsearch_2.10 0.13.0
apache-predictionio-data-elasticsearch4 versions - Latest release: over 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-e2_2.10 0.13.0
apache-predictionio-e26 versions - Latest release: over 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-hbase_2.11 0.14.0
apache-predictionio-data-hbase5 versions - Latest release: about 5 years ago - 1 dependent package - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-localfs_2.10 0.13.0
apache-predictionio-data-localfs4 versions - Latest release: over 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-e2_2.11 0.14.0
apache-predictionio-e25 versions - Latest release: about 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data-hdfs_2.10 0.13.0
apache-predictionio-data-hdfs4 versions - Latest release: over 5 years ago - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data_2.11 0.14.0
apache-predictionio-data5 versions - Latest release: about 5 years ago - 3 dependent packages - 12,552 stars on GitHub
Top 8.6% on repo1.maven.org
6 versions - Latest release: over 5 years ago - 7 dependent packages - 1 dependent repositories - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-core_2.10 0.13.0
apache-predictionio-core6 versions - Latest release: over 5 years ago - 7 dependent packages - 1 dependent repositories - 12,552 stars on GitHub
org.apache.predictionio:apache-predictionio-data_2.10 0.13.0
apache-predictionio-data6 versions - Latest release: over 5 years ago - 3 dependent packages - 1 dependent repositories - 12,552 stars on GitHub
com.twitter:parquet-protobuf 1.6.0
Parquet is a columnar storage format that supports nested data. This provides the java implementa...13 versions - Latest release: about 9 years ago - 6 dependent repositories - 1 stars on GitHub
org.apache.drill.contrib:drill-mongo-storage 1.21.1
Apache Drill is an open source, low latency SQL query engine for Hadoop and NoSQL.24 versions - Latest release: about 1 year ago - 1 dependent repositories - 1,781 stars on GitHub
Top 6.8% on repo1.maven.org
14 versions - Latest release: over 3 years ago - 25 dependent packages - 11 dependent repositories - 948 stars on GitHub
org.bdgenomics.adam:adam-core-spark2_2.11 0.33.0
A fast, scalable genome analysis system14 versions - Latest release: over 3 years ago - 25 dependent packages - 11 dependent repositories - 948 stars on GitHub
org.bdgenomics.adam:adam-distribution_2.10 0.23.0
A fast, scalable genome analysis system10 versions - Latest release: over 6 years ago - 1 dependent repositories - 948 stars on GitHub
org.bdgenomics.adam:adam-distribution-spark2_2.11 0.33.0
A fast, scalable genome analysis system14 versions - Latest release: over 3 years ago - 948 stars on GitHub
org.bdgenomics.adam:adam-assembly-spark2_2.11 0.33.0
A fast, scalable genome analysis system14 versions - Latest release: over 3 years ago - 2 dependent packages - 948 stars on GitHub
ch.epfl.scala:spores-serialization_2.11 0.4.3
spores-serialization4 versions - Latest release: over 7 years ago - 28 stars on GitHub
com.hazelcast.jet.contrib:probabilistic 0.2
Collections of probabilistic aggregations2 versions - Latest release: about 4 years ago - 20 stars on GitHub
org.milyn:milyn-scribe-ibatis 1.7.1
Maintain uniform and consistent view of Scribe specific dependencies and build configuration.17 versions - Latest release: over 5 years ago - 2 dependent packages - 6 dependent repositories - 377 stars on GitHub
com.sksamuel.eels:eel-avro_2.10 0.12.0
eel-avro1 version - Latest release: over 8 years ago - 147 stars on GitHub
org.apache.wayang:wayang-tests-integration_2.11 0.7.1
Wayang integration Tests3 versions - Latest release: 9 months ago - 84 stars on GitHub
io.github.maanaim:hbase-om 1.4.0
A little Java-annotation based compact utility library for HBase that helps you: [1] convert obje...8 versions - Latest release: over 6 years ago - 0 stars on GitHub
com.hurence.logisland:logisland-outlier-detection-plugin 0.9.7 💰
LogIsland is an event mining platform based on Kafka to handle a huge amount of log files.4 versions - Latest release: over 7 years ago - 3 dependent packages - 108 stars on GitHub
com.sksamuel.eel:eel-core_2.11 0.10.0
eel-core1 version - Latest release: over 8 years ago - 147 stars on GitHub
com.baidu.hugegraph:hugegraph-test 0.11.2
hugegraph is a fast-speed, highly-scalable, transactional graph database developed by baidu4 versions - Latest release: over 3 years ago - 2,460 stars on GitHub
com.hazelcast.jet.contrib:elasticsearch-5 0.2
A Hazelcast Jet connector for Elasticsearch (v5.6.x) for querying/indexing objects from/to Elasti...2 versions - Latest release: about 4 years ago - 20 stars on GitHub
Top 4.6% on repo1.maven.org
96 versions - Latest release: 7 days ago - 5 dependent packages - 131 dependent repositories - 9,327 stars on GitHub
io.trino:trino-parquet 447
Trino - Parquet file format support96 versions - Latest release: 7 days ago - 5 dependent packages - 131 dependent repositories - 9,327 stars on GitHub
Top 4.1% on repo1.maven.org
96 versions - Latest release: 7 days ago - 6 dependent packages - 211 dependent repositories - 9,331 stars on GitHub
io.trino:trino-record-decoder 447
Trino - Record-based file format support96 versions - Latest release: 7 days ago - 6 dependent packages - 211 dependent repositories - 9,331 stars on GitHub
Top 3.3% on repo1.maven.org
96 versions - Latest release: 7 days ago - 12 dependent packages - 134 dependent repositories - 9,331 stars on GitHub
io.trino:trino-hive 447
Trino - Hive connector96 versions - Latest release: 7 days ago - 12 dependent packages - 134 dependent repositories - 9,331 stars on GitHub
Top 3.5% on repo1.maven.org
54 versions - Latest release: 7 days ago - 13 dependent packages - 85 dependent repositories - 9,327 stars on GitHub
io.trino:trino-hdfs 447
Trino - Legacy HDFS file system support54 versions - Latest release: 7 days ago - 13 dependent packages - 85 dependent repositories - 9,327 stars on GitHub
io.trino:trino-docs 447
Trino - Documentation96 versions - Latest release: 7 days ago - 9,327 stars on GitHub
org.apache.flink:flink-ml-core_2.12 2.0.0
The Apache Software Foundation provides support for the Apache community of open-source software ...1 version - Latest release: over 2 years ago - 3 dependent packages - 2 dependent repositories - 272 stars on GitHub
pl.touk.nussknacker:nussknacker-lite-kafka-runtime-bin-test_2.12 1.1.1
nussknacker-lite-kafka-runtime-bin-test2 versions - Latest release: over 2 years ago - 1 dependent package - 388 stars on GitHub
pl.touk.nussknacker:nussknacker-process_2.11 0.4.3
nussknacker-process21 versions - Latest release: over 2 years ago - 14 dependent packages - 389 stars on GitHub
pl.touk.nussknacker:nussknacker-flink-util_2.12 1.2.0
nussknacker-flink-util18 versions - Latest release: over 2 years ago - 15 dependent packages - 388 stars on GitHub
pl.touk.nussknacker:nussknacker-flink-util_2.11 0.4.3
nussknacker-flink-util21 versions - Latest release: over 2 years ago - 9 dependent packages - 388 stars on GitHub
pl.touk.nussknacker:nussknacker-bom_2.11 0.4.3
nussknacker-bom5 versions - Latest release: over 2 years ago - 388 stars on GitHub
pl.touk.nussknacker:nussknacker-interpreter_2.11 0.4.3
nussknacker-interpreter21 versions - Latest release: over 2 years ago - 16 dependent packages - 389 stars on GitHub
pl.touk.nussknacker:nussknacker-perf-test_2.11 0.0.6
nussknacker-perf-test3 versions - Latest release: almost 7 years ago - 388 stars on GitHub
pl.touk.nussknacker:nussknacker-lite-runtime_2.12 1.14.0
nussknacker-lite-runtime49 versions - Latest release: about 2 months ago - 6 dependent packages - 389 stars on GitHub
pl.touk.nussknacker:nussknacker-demo_2.12 0.3.0
nussknacker-demo7 versions - Latest release: over 3 years ago - 388 stars on GitHub
org.apache.iotdb:iotdb-grafana 0.12.6
Grafana data source connector for IoTDB21 versions - Latest release: almost 2 years ago - 1 dependent package - 13 dependent repositories - 3,106 stars on GitHub
Top 0.7% on repo1.maven.org
71 versions - Latest release: 2 months ago - 98 dependent packages - 863 dependent repositories - 21,983 stars on GitHub
org.apache.flink:flink-json 1.19.0
The Apache Software Foundation provides support for the Apache community of open-source software ...71 versions - Latest release: 2 months ago - 98 dependent packages - 863 dependent repositories - 21,983 stars on GitHub
Related Keywords
java
2,207
sql
1,428
scala
1,420
python
1,067
flink
955
spark
677
hadoop
658
stream-processing
489
jdbc
462
hive
460
analytics
434
kafka
363
parquet
338
r
322
machine-learning
316
distributed-systems
285
presto
274
data-science
256
database
256
etl
212
cpp
199
ai
185
search-engine
175
vespa
174
vector-search
174
tensorflow
174
serving-recommendation
174
serving
174
server
174
delta-lake
167
apache-flink
164
streaming
156
databases
155
trino
153
query-engine
153
prestodb
153
iceberg
153
distributed-database
153
datalake
153
real-time
149
distributed
149
decision-making
148
flink-kafka
148
gui
148
low-code
148
lowcode
148
marketers
148
touk
148
solr
133
cassandra
132
complex-event-processing
132
elasticsearch
132
influxdb
132
kafka-streams
132
pattern-recognition
132
graph
128
hacktoberfest
127
scalability
116
hazelcast
114
graph-database
109
low-latency
106
xml
104
distributed-computing
102
in-memory
100
caching
100
apache-spark
99
event-driven
94
pipelines
94
smooks
94
enterprise-integration
94
sax
94
geospatial
91
calcite
91
iot
77
tsdb
77
nosql
71
timeseries
71
bookkeeper
70
orc
67
kudu
67
accumulo
67
hbase
63
avro
63
aggregation
62
hive-metastore
61
bigdata
59
drill
57
gpu
57
bioinformatics
55
genomics
55
gbm
53
apache
48
data-processing
47
replication
44
cloud
43
data-engineering
43
hive-table
43
replicate-data
43
s3
43
deep-learning
41