Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 3.3% on repo1.maven.org
Top 0.3% dependent packages on repo1.maven.org
Top 0.1% dependent repos on repo1.maven.org
Top 6.8% forks on repo1.maven.org
Top 0.4% docker downloads on repo1.maven.org

repo1.maven.org : org.apache.tika:tika-parsers

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Registry - Source - Homepage - Documentation - JSON
purl: pkg:maven/org.apache.tika/tika-parsers
Keywords: content, extraction, java, metadata, tika
License: Apache-2.0
Latest release: 2 months ago
First release: almost 15 years ago
Namespace: org.apache.tika
Dependent packages: 310
Dependent repositories: 4,443
Stars: 1,877 on GitHub
Forks: 715 on GitHub
Docker dependents: 573
Docker downloads: 168,682,937
Total Commits: 6295
Committers: 165
Average commits per author: 38.152
Development Distribution Score (DDS): 0.765
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 5 days ago

org.jesterj:jesterj-ingest 1.0.0 💰
Library for writing JesterJ ingestion plans
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 27 stars on GitHub
com.composum.assets:composum-assets-commons-bundle 1.4.0
Asset integration services and API
6 versions - Latest release: about 1 year ago - 5 dependent packages - 2 dependent repositories - 1 stars on GitHub
org.apache-extras.camel-extra:camel-wmq 3.20.2
Camel IBM Websphere MQ component
14 versions - Latest release: over 1 year ago - 35 stars on GitHub
com.centit.support:centit-es-client 5.3.2302
elasticsearch 客户端
8 versions - Latest release: over 1 year ago - 2 dependent packages - 5 dependent repositories
org.exist-db:exist-contentextraction 6.2.0 💰
eXist-db NoSQL Database Content Extraction Extension
15 versions - Latest release: over 1 year ago - 1 dependent package - 7 dependent repositories - 379 stars on GitHub
org.appng:appng-parent 1.24.6
Parent project for all appNG modules
25 versions - Latest release: over 1 year ago - 33 stars on GitHub
Top 7.8% on repo1.maven.org
org.hibernate:hibernate-search-elasticsearch 5.11.12.Final
Hibernate Search backend which has indexing operations forwarded to Elasticsearch
65 versions - Latest release: over 1 year ago - 21 dependent packages - 84 dependent repositories - 445 stars on GitHub
Top 6.0% on repo1.maven.org
org.hibernate:hibernate-search-orm 5.11.12.Final
Hibernate Search integration with Hibernate Core
144 versions - Latest release: over 1 year ago - 139 dependent packages - 2,066 dependent repositories - 440 stars on GitHub
Top 6.2% on repo1.maven.org
org.hibernate:hibernate-search-engine 5.11.12.Final
Core of the Object/Lucene mapper, query engine and index management
144 versions - Latest release: over 1 year ago - 88 dependent packages - 331 dependent repositories - 445 stars on GitHub
org.hibernate:hibernate-search-parent 5.11.12.Final
Hibernate Search Aggregator POM
164 versions - Latest release: over 1 year ago - 2 dependent repositories - 445 stars on GitHub
io.archivesunleashed:aut 1.2.0
An open-source toolkit for analyzing web archives.
27 versions - Latest release: over 1 year ago - 125 stars on GitHub
org.apache.tika:tika-nlp 1.28.5
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
20 versions - Latest release: over 1 year ago - 1 dependent repositories - 1,877 stars on GitHub
Top 6.5% on repo1.maven.org
org.apache.tika:tika-bundle 1.28.5
OSGi bundle that contains the tika-parsers component and all its upstream dependencies that aren'...
41 versions - Latest release: over 1 year ago - 8 dependent packages - 47 dependent repositories - 1,646 stars on GitHub
org.yunchen.gb:gb-plugin-tika-parser 1.3.0
6 versions - Latest release: almost 2 years ago
org.appng:appng-application-bom 1.24.5
the bill of materials for an appNG application, defining all provided dependencies
23 versions - Latest release: almost 2 years ago - 8 dependent packages - 8 dependent repositories - 33 stars on GitHub
org.appng:appng-search 1.24.5
Lucene based appNG Search
24 versions - Latest release: almost 2 years ago - 7 dependent packages - 5 dependent repositories - 33 stars on GitHub
org.docetproject:docet-maven-plugin 1.20.0
A flexible easy-to-integrate online documentation manager
10 versions - Latest release: almost 2 years ago - 2 stars on GitHub
com.alibaba.lindorm:lindorm-search-cell 8.10.2
Lindorm Search Content Extraction Library integrates Apache Tika content extraction framework int...
6 versions - Latest release: about 2 years ago - 2 dependent packages
com.alibaba.lindorm:lindorm-search-langid 8.10.2
This module is intended to be used while indexing documents. It is implemented as an UpdateProces...
6 versions - Latest release: about 2 years ago - 2 dependent packages
com.alibaba.lindorm:lindorm-search-dataimporthandler-extras 8.10.2
Lindorm Search DataImportHandler Extras
6 versions - Latest release: about 2 years ago - 2 dependent packages
com.alibaba.lindorm:lindorm-search-parent 8.10.2
Lindorm Search Parent POM
6 versions - Latest release: about 2 years ago
com.alibaba.lindorm:lucene-parent 8.10.2
Lucene parent POM
6 versions - Latest release: about 2 years ago
io.annot8:annot8-components-tika 1.2.2
Components that use Apache Tika to extract content from files
5 versions - Latest release: about 2 years ago - 2 stars on GitHub
org.apache.servicemix.bundles:org.apache.servicemix.bundles.any23 2.7_1
This OSGi bundle wraps apache-any23-core 2.7 jar file.
5 versions - Latest release: about 2 years ago
org.apache.any23:apache-any23-cli 2.7
Command line interface.
8 versions - Latest release: over 2 years ago - 2 dependent packages - 2 dependent repositories - 93 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-core 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 2 years ago - 4 dependent packages - 6 dependent repositories - 1,178 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-tika 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 2 years ago - 3 dependent packages - 6 dependent repositories - 1,178 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-parent 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 2 years ago - 1,178 stars on GitHub
com.alibaba.lindorm:lindorm-search-grandparent 8.9.1
Grandparent POM for Apache Lucene Core and Lindorm Search
3 versions - Latest release: over 2 years ago
org.fcrepo.bom:modeshape-bom-embedded 5.5.1.fcr
Bill of Material (BOM) for embedding ModeShape within JavaSE apps, libraries, and (non-AS) web apps
2 versions - Latest release: over 2 years ago - 1 stars on GitHub
org.fcrepo:modeshape-distribution 5.5.1.fcr
ModeShape is a JCR repository implementation with support for federation and sequencing.
3 versions - Latest release: over 2 years ago - 1 stars on GitHub
org.fcrepo:modeshape-extractor-tika 5.5.1.fcr
ModeShape text extractor that uses the Apache Tika library
3 versions - Latest release: over 2 years ago - 4 dependent packages - 1 stars on GitHub
org.fcrepo:modeshape-jcr 5.5.1.fcr
ModeShape implementation of the JCR API
3 versions - Latest release: over 2 years ago - 26 dependent packages - 1 stars on GitHub
org.fcrepo:modeshape-parent 5.5.1.fcr
ModeShape is a JCR repository implementation with support for federation and sequencing.
3 versions - Latest release: over 2 years ago - 1 stars on GitHub
com.liferay.portal:release.dxp.bom.third.party 7.4.13
292 versions - Latest release: over 2 years ago - 1 dependent repositories - 1,949 stars on GitHub
org.apache.oodt:cas-filemgr 1.9.1
The file management component of a Catalog and Archive Service. This component purposefully separ...
22 versions - Latest release: over 2 years ago - 14 dependent packages - 15 dependent repositories - 22 stars on GitHub
org.apache.oodt:oodt-core 1.9.1
The Apache Software Foundation provides support for the Apache community of open-source software ...
22 versions - Latest release: over 2 years ago - 28 stars on GitHub
com.composum.meta.sling.dependencies:sling-starter-dependencies-11 1.4
This specifies the versions of the OSGI bundles included in the Sling Starter as maven dependenci...
5 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 0 stars on GitHub
io.bigconnect:dw-text-extractor 4.2.1
Various DataWorker Plugins for BigConnect
1 version - Latest release: almost 3 years ago - 1 dependent package - 1 dependent repositories - 0 stars on GitHub
io.bigconnect:dw-mime-type-detector 4.2.1
Various DataWorker Plugins for BigConnect
1 version - Latest release: almost 3 years ago - 2 dependent packages - 2 dependent repositories - 0 stars on GitHub
com.graysonnorland.pdfmantis:pdf-mantis 0.0.1
Simplified PDF Data Extraction
1 version - Latest release: almost 3 years ago - 5 stars on GitHub
com.github.euler-io:euler-tika 0.10.0
Euler - File Processing API - Tika module.
34 versions - Latest release: almost 3 years ago - 3 dependent packages - 2 dependent repositories - 1 stars on GitHub
com.github.euler-io:euler-common 0.10.0
Euler - File Processing API - Common classes module.
34 versions - Latest release: almost 3 years ago - 5 dependent packages - 2 dependent repositories - 1 stars on GitHub
com.github.storm-bit:vk-bot-sdk-kotlin 1.2.0
The Kotlin library for working with VK api
19 versions - Latest release: almost 3 years ago
Top 7.0% on repo1.maven.org
com.liferay.portal:release.portal.bom.third.party 7.4.2
140 versions - Latest release: almost 3 years ago - 4 dependent packages - 25 dependent repositories - 1,950 stars on GitHub
com.huemulsolutions.bigdata:huemul-bigdatagovernance_2.12 2.6.3
Enable full data quality and data lineage for BigData Projects. Huemul BigDataGovernance, es una ...
1 version - Latest release: almost 3 years ago - 2 dependent repositories - 11 stars on GitHub
org.entando:entando-core-parent 6.4.8
Entando Core Maven Parent POM
4 versions - Latest release: almost 3 years ago - 6 dependent repositories
org.bidib.org.codehaus.izpack:izpack 5.2.0.M2
The IzPack parent module
2 versions - Latest release: almost 3 years ago - 316 stars on GitHub
org.bidib.org.codehaus.izpack:izpack-compiler 5.2.0.M2
The IzPack parent module
2 versions - Latest release: almost 3 years ago - 6 dependent packages - 1 dependent repositories - 316 stars on GitHub
com.github.bogdanovmn.txtparser:text-parser 1.0.3
This library uses Apache Tika under the hood. It provides more convenient API.
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 0 stars on GitHub
com.huemulsolutions.bigdata:huemul-bigdatagovernance 2.6.3
Enable full data quality and data lineage for BigData Projects. Huemul BigDataGovernance, es una ...
20 versions - Latest release: about 3 years ago - 1 dependent repositories - 11 stars on GitHub
uk.ac.gate:gate-core 9.0.1
GATE - general architecture for text engineering - is open source software capable of solving alm...
18 versions - Latest release: about 3 years ago - 23 dependent packages - 145 dependent repositories - 73 stars on GitHub
com.goikosoft.crawler4j:crawler4j 4.5.11
crawler4j: Open Source Web Crawler for Java. Modified by Dario Goikoetxea to add POST capabilities
16 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 1 stars on GitHub
Top 7.5% on repo1.maven.org
com.adobe.granite:com.adobe.granite.poi 2.0.28
The parent project for Granite, the Open Web Stack
5 versions - Latest release: over 3 years ago - 6 dependent packages - 15 dependent repositories
Top 7.1% on repo1.maven.org
com.adobe.cq.social:cq-social-commons 1.8.1
43 versions - Latest release: over 3 years ago - 15 dependent packages - 6 dependent repositories
com.day.cq:cq-quickstart-product-dependencies 5.6.1
Description
4 versions - Latest release: over 3 years ago - 24 dependent repositories
com.day.cq.wcm:cq-wcm-designimporter 1.0.14
Import HTML and collateral with a ZIP file and creates a canvas component
4 versions - Latest release: over 3 years ago - 4 dependent packages - 4 dependent repositories
Top 6.0% on repo1.maven.org
com.day.cq.dam:cq-dam-core 5.6.14
Bundle implementing the core DAM functionality for Communique 5
9 versions - Latest release: over 3 years ago - 13 dependent packages - 14 dependent repositories
com.day.cq.mcm:cq-mcm-landingpage 1.0.16
Bundle implementing the MCM Landing Page functionality
2 versions - Latest release: over 3 years ago - 3 dependent packages - 3 dependent repositories
Top 7.1% on repo1.maven.org
com.day.crx:crx-core 2.6.8
The CRX Core Library
277 versions - Latest release: over 3 years ago - 31 dependent packages - 4 dependent repositories
org.entando.entando:entando-engine 6.2.27
Entando Engine: an agile, modern and user-centric open source Portal platform.
16 versions - Latest release: over 3 years ago - 13 dependent packages - 57 dependent repositories - 0 stars on GitHub
org.entando.entando.plugins:entando-plugin-jacms 6.2.22
Allows registered users to manage dynamic contents and digital assets
11 versions - Latest release: over 3 years ago - 15 dependent packages - 22 dependent repositories - 0 stars on GitHub
com.strapdata.elasticsearch.plugin:ingest-attachment 6.2.3.31
Elasticsearch subproject :plugins:ingest-attachment
12 versions - Latest release: almost 4 years ago - 1,671 stars on GitHub
Top 7.9% on repo1.maven.org
org.opencms:opencms-core 11.0.2
OpenCms is an enterprise-ready, easy to use website content management system based on Java and X...
31 versions - Latest release: almost 4 years ago - 127 dependent packages - 22 dependent repositories - 488 stars on GitHub
com.antkorwin:mimetype 0.2
This library resolves a mime-type of file by the binary content.
2 versions - Latest release: almost 4 years ago - 0 stars on GitHub
ai.platon.pulsar:platon 3
The Apache Software Foundation provides support for the Apache community of open-source software ...
1 version - Latest release: almost 4 years ago - 1 stars on GitHub
com.rd4j:thirdparty-bom 1.0.0
thirdparty mavem bom
2 versions - Latest release: almost 4 years ago - 0 stars on GitHub
it.unimi.di:mg4j-big 5.4.4
MG4J (Managing Gigabytes for Java) is a free full-text search engine for large document collectio...
9 versions - Latest release: about 4 years ago - 5 dependent packages - 3 dependent repositories - 1 stars on GitHub
io.github.muxiaobai:test.tool.demo 1.0.0-RELEASE
java demo jar
1 version - Latest release: about 4 years ago - 0 stars on GitHub
de.envisia.ipp:akka-ipp_2.13 0.6.0
akka-ipp
2 versions - Latest release: about 4 years ago - 0 stars on GitHub
uk.ac.gate.mimir:mimir-core 6.1.1
GATE Mímir is a Multiparadigm Information Management Index and Repository, a tool for indexing da...
5 versions - Latest release: over 4 years ago - 4 dependent packages - 2 dependent repositories - 10 stars on GitHub
ai.stainless:grails-tika 0.1.0
Provides Tika libraries and services for Grails 4+ apps
1 version - Latest release: over 4 years ago - 0 stars on GitHub
io.quarkus:quarkus-universe-bom-deployment 0.28.1
Quarkus universe aggregates extensions from Quarkus Core and those developed by the community int...
26 versions - Latest release: over 4 years ago - 2 dependent repositories - 86 stars on GitHub
Top 3.4% on repo1.maven.org
io.quarkus:quarkus-tika 0.28.1
This is the parent of relocated artifacts, that are still released for compatibility reasons.
213 versions - Latest release: over 4 years ago - 9 dependent packages - 146 dependent repositories - 11,515 stars on GitHub
com.bowriverstudio:fscrawler-core 2.6
FS Crawler with custom OCR(Microsoft Computer Vision) offers a simple way to index binary files i...
1 version - Latest release: over 4 years ago - 2 dependent packages - 0 stars on GitHub
com.bowriverstudio:fscrawler-tika 2.6
FS Crawler with custom OCR(Microsoft Computer Vision) offers a simple way to index binary files i...
1 version - Latest release: over 4 years ago - 3 dependent packages - 0 stars on GitHub
com.bowriverstudio:fscrawler-parent 2.6
FS Crawler with custom OCR(Microsoft Computer Vision) offers a simple way to index binary files i...
1 version - Latest release: over 4 years ago - 0 stars on GitHub
io.quarkus:quarkus-platform-bom-deployment 0.0.1
Quarkus Platform aggregates extensions from Quarkus Core and those developed by the community int...
1 version - Latest release: over 4 years ago - 86 stars on GitHub
io.quarkus:quarkus-platform-bom 0.0.1
Quarkus Platform aggregates extensions from Quarkus Core and those developed by the community int...
1 version - Latest release: over 4 years ago - 1 dependent repositories - 86 stars on GitHub
io.committed.krill:krill 1.1.0
Uses Apache Tika (https://tika.apache.org/) and PDFBox (https://pdfbox.apache.org/) with subseque...
7 versions - Latest release: over 4 years ago - 1 dependent package - 1 dependent repositories - 4 stars on GitHub
com.github.thiagoleitecarvalho:convertertopdf 2.3.2
A component 100% free, which convert several file formats to PDF
3 versions - Latest release: over 4 years ago - 11 stars on GitHub
org.scalanlp:epic_2.12 0.5.1
epic
6 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 472 stars on GitHub
org.scalanlp:epic-html_2.12 0.5.1
epic-html
2 versions - Latest release: almost 5 years ago - 472 stars on GitHub
org.scalanlp:epic_2.11 0.5.1
epic
11 versions - Latest release: almost 5 years ago - 5 dependent packages - 1 dependent repositories - 472 stars on GitHub
org.scalanlp:epic-html_2.11 0.5.1
epic-html
2 versions - Latest release: almost 5 years ago - 472 stars on GitHub
de.envisia.ipp:akka-ipp_2.12 0.5.0
akka-ipp
5 versions - Latest release: almost 5 years ago - 0 stars on GitHub
uk.gov.dstl.baleen:baleen-collectionreaders 2.7.0
Collection Readers for Baleen
7 versions - Latest release: about 5 years ago - 3 dependent packages - 2 dependent repositories - 147 stars on GitHub
org.imixs.workflow:imixs-adapters-documents 1.6.2
Imxis Documents Adapter
1 version - Latest release: about 5 years ago - 4 stars on GitHub
org.jboss.integration-platform:jboss-integration-platform-bom 8.6.0.Final
Bill Of Materials for SwitchYard, Drools, OptaPlanner, jBPM, Overlord, ...
64 versions - Latest release: about 5 years ago - 17 dependent packages - 61 dependent repositories - 6 stars on GitHub
org.apache.jmeter:ApacheJMeter_parent 5.1.1
Apache JMeter is open source software, a 100% pure Java desktop application designed to load test...
16 versions - Latest release: about 5 years ago - 3 dependent repositories
com.thinkbiganalytics.kylo:kylo-file-metadata-core 0.10.0
Kylo is an enterprise-ready, open source, data lake management software platform for Hadoop and S...
1 version - Latest release: over 5 years ago - 4 dependent repositories - 1,086 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-tika 1.12.1 💰
Tika-based parser bolt for StormCrawler
37 versions - Latest release: over 5 years ago - 4 dependent repositories - 789 stars on GitHub
com.uttesh:exude 0.0.4
exude library will filter the stopping/stemming/swearing words from file/text.
4 versions - Latest release: over 5 years ago - 8 dependent repositories - 21 stars on GitHub
com.github.vantonov1:basalt-fulltext 1.0
Implementation of document-oriented repository on top of relational DBMS
1 version - Latest release: almost 6 years ago - 1 dependent package - 1 dependent repositories - 0 stars on GitHub
com.github.vantonov1:basalt-content 1.0
Implementation of document-oriented repository on top of relational DBMS
1 version - Latest release: almost 6 years ago - 2 dependent packages - 1 dependent repositories - 0 stars on GitHub
com.liferay:com.liferay.ce.portal.third.party 7.1.0
4 versions - Latest release: almost 6 years ago - 1,950 stars on GitHub
com.liferay:com.liferay.ce.portal.bom 7.1.0
4 versions - Latest release: almost 6 years ago - 1 dependent repositories - 1,950 stars on GitHub
info.setmy.services:java-services 1.0.6
Base project for commons services, helpers, Utils, DAOs, repositories etc
7 versions - Latest release: about 6 years ago - 2 dependent packages
com.kasisoft.mgnl:ks-mgnl-dependencies 5.6-3
Magnolia related dependencies
4 versions - Latest release: about 6 years ago - 5 dependent packages
uk.ac.shef.dcs:jate 2.0-beta.11
JATE is a toolkit for developing and experimenting Automatic Term Extractions/Recognition algorit...
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 81 stars on GitHub