document-automation | pypi.org keywords

pypi.org "document-automation" keyword

View the packages on the pypi.org package registry that are tagged with the "document-automation" keyword.

tikara 0.1.6

The metadata and text content extractor for almost every file type.
6 versions - Latest release: 3 months ago - 214 downloads last month - 1 stars on GitHub - 1 maintainer

Related Keywords

apache-tika 1 content-detection 1 content-extraction 1 content-indexing 1 content-intelligence 1 content-management 1 content-parsing 1 content-processing 1 content-type 1 data-extraction 1 data-parsing 1 data-processing 1 document-ai 1 document-analysis 1 document-classification 1 document-converter 1 document-extraction 1 document-indexing 1 document-intelligence 1 document-management 1 document-metadata 1 document-ocr 1 document-parsing 1 document-processing 1 document-reader 1 document-text 1 document-understanding 1 docx 1 excel 1 file-analysis 1 file-conversion 1 file-format 1 file-identification 1 file-parsing 1 file-processing 1 file-reader 1 file-type 1 format-detection 1 format-identification 1 image-extraction 1 information-extraction 1 language-detection 1 metadata 1 mime-type 1 ocr 1 office-documents 1 pdf 1 pdf-parsing 1 powerpoint 1 structured-data 1 text-analytics 1 text-extraction 1 text-mining 1 text-parsing 1 text-processing 1 text-recognition 1 tika 1 unstructured-data 1 word-documents 1 image-to-text 1 java 1 llm 1 metadata-extraction 1 ml 1 natural-language-processing 1 pdf-to-text 1 retrieval-augmented-generation 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Packages

pypi.org "document-automation" keyword

tikara 0.1.6