An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

packagist.org "text-extraction" keyword

View the packages on the packagist.org package registry that are tagged with the "text-extraction" keyword.

centertap/tika-all-the-files 2.0.0 💰
Mediawiki extension that provides extraction of searchable text and metadata from uploaded files,...
4 versions - Latest release: over 1 year ago - 97 downloads total - 1 stars on GitHub - 1 maintainer
manofstrong/sitescrapper v0.0.1
A Package to Scrape Websites from their Sitemaps and Extract Relevant Content from the Webpage an...
1 version - Latest release: almost 6 years ago - 71 downloads total - 6 stars on GitHub - 1 maintainer
Top 2.9% on packagist.org
vaites/php-apache-tika v1.4.0
Apache Tika bindings for PHP: extracts text from documents and images (with OCR), metadata and mo...
42 versions - Latest release: 4 months ago - 2 dependent packages - 16 dependent repositories - 1.27 million downloads total - 117 stars on GitHub - 1 maintainer
oneofftech/parse-client v0.2.0
Parse PDF document keeping the structure.
2 versions - Latest release: 5 months ago - 315 downloads total - 0 stars on GitHub - 1 maintainer
dayrev/extractor v1.2.2
Web Page Content Extractor
7 versions - Latest release: over 8 years ago - 11 downloads total - 2 stars on GitHub - 1 maintainer
vasilgerginski/filamentphp-text-extractor v1.3.0
Extract and manage translatable text from Filament models with an intuitive admin interface
5 versions - Latest release: 2 months ago - 6 downloads total - 0 stars on GitHub - 1 maintainer
apache-solr-for-typo3/tika 13.0.0 💰
Apache Tika for TYPO3
30 versions - Latest release: 7 months ago - 2 dependent repositories - 520 thousand downloads total - 9 stars on GitHub - 3 maintainers