pypi.org "word-documents" keyword
View the packages on the pypi.org package registry that are tagged with the "word-documents" keyword.
table-extractor 0.3 💰
Extract normalized tables from CSVs, Excel Spreadsheets, PDFs, Word Docs, and Web Pages7 versions - Latest release: over 1 year ago - 1 dependent repositories - 158 downloads last month - 1 stars on GitHub - 1 maintainer
tikara 0.1.6
The metadata and text content extractor for almost every file type.6 versions - Latest release: 3 months ago - 214 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
docx
2
excel
2
retrieval-augmented-generation
1
pdf-parsing
1
pdf
1
office-documents
1
ocr
1
mime-type
1
metadata
1
language-detection
1
information-extraction
1
image-extraction
1
format-identification
1
format-detection
1
file-type
1
file-reader
1
file-processing
1
file-parsing
1
file-identification
1
file-format
1
pdf-to-text
1
natural-language-processing
1
ml
1
metadata-extraction
1
llm
1
java
1
image-to-text
1
unstructured-data
1
tika
1
text-recognition
1
text-processing
1
text-parsing
1
text-mining
1
text-extraction
1
text-analytics
1
structured-data
1
powerpoint
1
file-conversion
1
data-processing
1
data-parsing
1
data-extraction
1
content-type
1
content-processing
1
content-parsing
1
content-management
1
content-intelligence
1
content-indexing
1
content-extraction
1
content-detection
1
apache-tika
1
tsv
1
doc
1
csv
1
python
1
table
1
file-analysis
1
document-understanding
1
document-text
1
document-reader
1
document-processing
1
document-parsing
1
document-ocr
1
document-metadata
1
document-management
1
document-intelligence
1
document-indexing
1
document-extraction
1
document-converter
1
document-classification
1
document-automation
1
document-analysis
1
document-ai
1