Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 3.9% on repo1.maven.org
Top 1.4% dependent packages on repo1.maven.org
Top 1.0% dependent repos on repo1.maven.org
Top 6.8% forks on repo1.maven.org
Top 1.0% docker downloads on repo1.maven.org

repo1.maven.org : org.apache.tika:tika-parsers-standard-package

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Registry - Source - Homepage - Documentation - JSON
purl: pkg:maven/org.apache.tika/tika-parsers-standard-package
Keywords: content, extraction, java, metadata, tika
License: Apache-2.0
Latest release: 3 months ago
First release: about 3 years ago
Namespace: org.apache.tika
Dependent packages: 54
Dependent repositories: 234
Stars: 1,877 on GitHub
Forks: 715 on GitHub
Docker dependents: 44
Docker downloads: 10,733,359
Total Commits: 6295
Committers: 165
Average commits per author: 38.152
Development Distribution Score (DDS): 0.765
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 10 days ago

org.opensearch.plugin:ingest-attachment 2.14.0
OpenSearch subproject :plugins:ingest-attachment
44 versions - Latest release: 29 days ago - 6,605 stars on GitHub
org.uitnet.testing:smart-testauto-framework 7.0.0
Smart Software Testing Automation Framework is a tool used to automate the testing of software ap...
54 versions - Latest release: about 1 month ago - 2 dependent repositories - 1 stars on GitHub
org.icij.extract:extract-lib 6.7.3
A cross-platform command line tool for parallelised content extraction and analysis.
71 versions - Latest release: about 2 months ago - 2 dependent packages - 2 dependent repositories - 218 stars on GitHub
io.kestra.plugin:plugin-tika 0.16.0
Extract text and metadata using Apache Tika in Kestra workflows.
12 versions - Latest release: 2 months ago - 0 stars on GitHub
ai.platon.pulsar:pulsar-ql 1.12.4
Scrape Web data at scale completely and accurately with high performance, distributed AI-RPA.
62 versions - Latest release: 2 months ago - 10 dependent packages - 4 dependent repositories - 402 stars on GitHub
ai.platon.pulsar:pulsar-all 1.12.4
Scrape Web data at scale completely and accurately with high performance, distributed AI-RPA.
52 versions - Latest release: 2 months ago - 3 dependent packages - 7 dependent repositories - 402 stars on GitHub
ai.platon.pulsar:pulsar 1.12.4
Scrape Web data at scale completely and accurately with high performance, distributed AI-RPA.
61 versions - Latest release: 2 months ago - 402 stars on GitHub
ai.platon.pulsar:pulsar-skeleton 1.12.4
Scrape Web data at scale completely and accurately with high performance, distributed AI-RPA.
61 versions - Latest release: 2 months ago - 15 dependent packages - 2 dependent repositories - 402 stars on GitHub
ai.platon.pulsar:pulsar-parse 1.12.4
Scrape Web data at scale completely and accurately with high performance, distributed AI-RPA.
61 versions - Latest release: 2 months ago - 6 dependent packages - 2 dependent repositories - 402 stars on GitHub
org.pageseeder.flint:pso-flint-berlioz-tika 9.9.3
Flint framework
16 versions - Latest release: 2 months ago - 0 stars on GitHub
com.databricks.labs:tika-ocr 0.1.6
Using Apache tika and tesseract to extact text from any document
5 versions - Latest release: 2 months ago - 1 stars on GitHub
org.apache.tika:tika-detector-siegfried 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
7 versions - Latest release: 3 months ago - 1 dependent package - 1,961 stars on GitHub
Top 4.5% on repo1.maven.org
org.apache.tika:tika-java7 2.9.2
Java-7 reliant components, including FileTypeDetector implementations
48 versions - Latest release: 3 months ago - 26 dependent packages - 59 dependent repositories - 1,649 stars on GitHub
org.apache.tika:tika-server-standard 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
16 versions - Latest release: 3 months ago - 2 dependent repositories - 1,877 stars on GitHub
Top 4.6% on repo1.maven.org
org.apache.tika:tika-app 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
59 versions - Latest release: 3 months ago - 17 dependent packages - 160 dependent repositories - 1,877 stars on GitHub
org.apache.tika:tika-parsers-extended-integration-tests 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
17 versions - Latest release: 3 months ago - 1,961 stars on GitHub
org.apache.tika:tika-parser-scientific-package 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
14 versions - Latest release: 3 months ago - 1 dependent package - 8 dependent repositories - 1,961 stars on GitHub
org.apache.tika:tika-bom 2.9.2
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from v...
11 versions - Latest release: 3 months ago - 2 dependent repositories - 1,961 stars on GitHub
org.apache.camel:camel-tika 4.5.0
This component integrates with Apache Tika to extract content and metadata from thousands of file...
140 versions - Latest release: 3 months ago - 16 dependent packages - 43 dependent repositories - 22 stars on GitHub
org.teamapps:universal-db 0.6.18
Ultra fast TeamApps database
80 versions - Latest release: 4 months ago - 5 dependent packages - 4 dependent repositories - 6 stars on GitHub
org.apache.jackrabbit:jackrabbit-standalone-components 2.21.25
Jackrabbit components for Jackrabbit Standalone
49 versions - Latest release: 4 months ago - 4 dependent packages - 15 dependent repositories - 22 stars on GitHub
org.apache.jackrabbit:jackrabbit-jca 2.21.25
A resource adapter for Jackrabbit as specified by JCA 1.0 and 1.5.
226 versions - Latest release: 4 months ago - 3 dependent packages - 13 dependent repositories - 22 stars on GitHub
org.apache.jackrabbit:jackrabbit-webapp 2.21.25
Web application that hosts and serves a Jackrabbit content repository
226 versions - Latest release: 4 months ago - 7 dependent packages - 19 dependent repositories - 22 stars on GitHub
org.apache.jackrabbit:jackrabbit-core 2.21.25
Jackrabbit content repository implementation
238 versions - Latest release: 4 months ago - 233 dependent packages - 1,295 dependent repositories - 22 stars on GitHub
org.apache.jackrabbit:jackrabbit-parent 2.21.25
The Apache Jackrabbitβ„’ content repository is a fully conforming implementation of the Content Rep...
212 versions - Latest release: 4 months ago - 1 dependent repositories - 28 stars on GitHub
io.github.odys-z:album-lib 0.4.40
Album common lib for Android client and jserv
31 versions - Latest release: 4 months ago - 1 dependent package - 2 dependent repositories - 2 stars on GitHub
org.apache.nifi:nifi-media-processors 1.25.0
NiFi Extensions Bill of Materials
58 versions - Latest release: 5 months ago - 1 dependent package - 65 dependent repositories
org.apache.openmeetings:openmeetings-util 7.2.0
Module for utility classes being used by all OpenMeetings modules
29 versions - Latest release: 6 months ago - 2 dependent packages - 12 dependent repositories - 566 stars on GitHub
org.apache.openmeetings:openmeetings-parent 7.2.0
Parent project for all OpenMeetings Maven modules. Required to hold general settings
29 versions - Latest release: 6 months ago - 1 dependent repositories - 566 stars on GitHub
org.dspace:dspace-parent 7.6.1
DSpace open source software is a turnkey institutional repository application.
94 versions - Latest release: 7 months ago - 1 dependent repositories - 781 stars on GitHub
Top 5.1% on repo1.maven.org
org.dspace:dspace-api 7.6.1
DSpace core data model and service APIs.
95 versions - Latest release: 7 months ago - 45 dependent packages - 319 dependent repositories - 718 stars on GitHub
com.eurodyn.qlack.fuse:qlack-fuse-content-manager 3.6.4
QLACK is an ecosystem of software development libraries and utilities, targeting Java and Angular...
37 versions - Latest release: 7 months ago - 1 dependent package - 7 stars on GitHub
org.clulab:pdf2txt-tika_2.12 1.1.5
The pdf2txt-tika subproject implements an interface to the tika PDF converters.
6 versions - Latest release: 8 months ago - 1 dependent package - 4 stars on GitHub
org.clulab:pdf2txt-tika_2.11 1.1.5
The pdf2txt-tika subproject implements an interface to the tika PDF converters.
6 versions - Latest release: 8 months ago - 1 dependent package - 4 stars on GitHub
org.apache.jspwiki:jspwiki-tika-searchprovider 2.12.1
Apache JSPWiki Tika Search provider
11 versions - Latest release: 10 months ago - 1 dependent package - 6 dependent repositories - 93 stars on GitHub
org.apache.jspwiki:jspwiki-builder 2.12.1
Apache JSPWiki is a leading open source WikiWiki engine, feature-rich and built around standard J...
20 versions - Latest release: 10 months ago - 1 dependent repositories - 93 stars on GitHub
io.muenchendigital.digiwf:digiwf-engine-service 0.18.0
Workflow Engine used by DigiWF
19 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 9 stars on GitHub
io.muenchendigital.digiwf:digiwf-email-integration-core 0.18.0
E-Mail integration used by DigiWF
14 versions - Latest release: 11 months ago - 2 dependent packages - 1 dependent repositories - 9 stars on GitHub
org.craftercms:crafter-search-elasticsearch 4.0.6
Crafter Search Elasticsearch
77 versions - Latest release: 11 months ago - 2 dependent packages - 5 dependent repositories - 7 stars on GitHub
org.dkpro.core:dkpro-core-io-tika-asl 2.4.0
DKPro Core is a collection of software components for natural language processing (NLP) based on ...
14 versions - Latest release: 12 months ago - 2 dependent packages - 2 dependent repositories - 196 stars on GitHub
uk.bl.wa.discovery:digipres-tika 3.3.0
Sonatype helps open source projects to set up Maven repositories on https://oss.sonatype.org/
7 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 112 stars on GitHub
de.hs-heilbronn.mi:crawler4j-core 5.0.2
Open Source Web Crawler for Java
22 versions - Latest release: about 1 year ago - 3 dependent packages - 1 dependent repositories - 17 stars on GitHub
org.jesterj:jesterj-ingest 1.0.0 πŸ’°
Library for writing JesterJ ingestion plans
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 27 stars on GitHub
org.exist-db:exist-contentextraction 6.2.0 πŸ’°
eXist-db NoSQL Database Content Extraction Extension
15 versions - Latest release: over 1 year ago - 1 dependent package - 7 dependent repositories - 379 stars on GitHub
io.annot8:annot8-components-tika 1.2.2
Components that use Apache Tika to extract content from files
5 versions - Latest release: about 2 years ago - 2 stars on GitHub
org.apache.servicemix.bundles:org.apache.servicemix.bundles.any23 2.7_1
This OSGi bundle wraps apache-any23-core 2.7 jar file.
5 versions - Latest release: over 2 years ago
org.apache.any23:apache-any23-cli 2.7
Command line interface.
8 versions - Latest release: over 2 years ago - 2 dependent packages - 2 dependent repositories - 93 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-tika 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 2 years ago - 3 dependent packages - 6 dependent repositories - 1,178 stars on GitHub
fr.pilato.elasticsearch.crawler:fscrawler-parent 2.9
FS Crawler offers a simple way to index binary files into elasticsearch.
5 versions - Latest release: over 2 years ago - 1,178 stars on GitHub
com.digitalpebble.stormcrawler:storm-crawler-tika 1.12.1 πŸ’°
Tika-based parser bolt for StormCrawler
37 versions - Latest release: over 5 years ago - 4 dependent repositories - 789 stars on GitHub
org.apache.any23:apache-any23-core 0.9.0
Core Any23 library implementation.
13 versions - Latest release: over 10 years ago - 19 dependent packages - 63 dependent repositories - 93 stars on GitHub
org.apache.any23:apache-any23-encoding 0.9.0
Encoding detection library.
12 versions - Latest release: over 10 years ago - 7 dependent packages - 38 dependent repositories - 93 stars on GitHub
org.apache.any23:apache-any23-mime 0.9.0
MIME Type detection library.
12 versions - Latest release: over 10 years ago - 5 dependent packages - 4 dependent repositories - 93 stars on GitHub
org.apache.any23:apache-any23 0.9.0
Anything To Triples (any23) is a library, a web service and a command line tool that extracts str...
13 versions - Latest release: over 10 years ago - 93 stars on GitHub