An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "table-extraction" keyword

View the packages on the pypi.org package registry that are tagged with the "table-extraction" keyword.

Top 0.8% on pypi.org
pdfplumber 0.11.6
Plumb a PDF for detailed information about each char, rectangle, and line.
72 versions - Latest release: 22 days ago - 118 dependent packages - 1,210 dependent repositories - 3.59 million downloads last month - 7,545 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pymupdf 1.25.5
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
131 versions - Latest release: 18 days ago - 206 dependent packages - 1,798 dependent repositories - 8.36 million downloads last month - 6,889 stars on GitHub - 1 maintainer
pdfplumber-aemc 0.11.3
Plumb a PDF for detailed information about each char, rectangle, and line.
16 versions - Latest release: 12 months ago - 1 dependent repositories - 471 downloads last month - 7,545 stars on GitHub - 1 maintainer
pdfautonup 1.11.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.
23 versions - Latest release: 4 months ago - 1 dependent package - 1 dependent repositories - 946 downloads last month - 6,889 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
1 version - Latest release: about 1 year ago - 67 downloads last month - 6,889 stars on GitHub - 1 maintainer
pdfmod 0.1.5
A tool for PDF file manipulation.
1 version - Latest release: 5 months ago - 62 downloads last month - 6,368 stars on GitHub - 1 maintainer
docext 0.1.12
Onprem information extraction from documents
10 versions - Latest release: 9 days ago - 753 downloads last month - 100 stars on GitHub - 1 maintainer
pdftablr 0.1.0
Python3 implementation of Kyle Cronan's pdftable module, with unit tests
1 version - Latest release: over 7 years ago - 1 dependent repositories - 102 downloads last month - 2 stars on GitHub - 1 maintainer
kreuzberg 3.1.3
A text extraction library supporting PDFs, images, office documents and more
19 versions - Latest release: 9 days ago - 6.53 thousand downloads last month - 1,736 stars on GitHub - 1 maintainer
quipucamayoc 0.1.2
Tools to extract information from digitized historical documents
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 85 downloads last month - 28 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
img2table 1.4.1
img2table is a table identification and extraction Python Library for PDF and images, based on Op...
56 versions - Latest release: 2 months ago - 1 dependent package - 4 dependent repositories - 31.5 thousand downloads last month - 693 stars on GitHub - 1 maintainer
depdf 0.2.2
PDF table & paragraph extractor
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 193 downloads last month - 11 stars on GitHub - 1 maintainer
extractable 1.0.2
Extract tables from PDFs
124 versions - Latest release: 11 months ago - 1 dependent repositories - 3.66 thousand downloads last month - 22 stars on GitHub - 1 maintainer
table-transformer 1.0.6
Table Transformer
5 versions - Latest release: 7 months ago - 408 downloads last month - 2,546 stars on GitHub - 1 maintainer
markdrop 0.3.1
A comprehensive PDF processing toolkit that converts PDFs to markdown with advanced AI-powered fe...
19 versions - Latest release: 3 months ago - 886 downloads last month - 84 stars on GitHub - 1 maintainer
extracttable 2.4.0
Extract table data from images and scanned PDFs. Easily convert image to excel, convert pdf to table
16 versions - Latest release: almost 3 years ago - 1 dependent repositories - 1.43 thousand downloads last month - 273 stars on GitHub - 1 maintainer
pyany2json 0.1.3
Python binding to Any2Json
4 versions - Latest release: 12 months ago - 118 downloads last month - 0 stars on GitHub - 1 maintainer
tablecv 0.1.1
Table extraction from image.
2 versions - Latest release: over 1 year ago - 236 downloads last month - 3 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.10
MuPDF shared libraries for PyMuPDF.
18 versions - Latest release: 8 months ago - 4 dependent packages - 133 dependent repositories - 1.62 million downloads last month - 6,195 stars on GitHub - 1 maintainer
Related Keywords
pdf 12 python 9 ocr 9 text-processing 6 tesseract 6 xps 5 text-shaping 5 pymupdf 5 pdf-documents 5 mupdf 5 font 5 extract-data 5 epub 5 data-science 5 pdf-parsing 3 rag 2 opencv 2 document-processing 2 table-detection 2 image-to-text 2 generation 1 content-description 1 advanced-ml-models 1 high-quality-image-extraction 1 document-structure-preservation 1 local-files 1 advanced-pdf-processing 1 url-support 1 ai-powered-pdf-processing 1 image-extraction 1 pdf-to-markdown-ai 1 pdf-to-markdown-tool 1 pdf-to-markdown-converter 1 pdf-to-table 1 pdf-to-image 1 pdf-to-text 1 pdf-to-markdown 1 image-analysis 1 llm 1 ai 1 converter 1 table-extract-python 1 table-extract 1 table 1 opencv-table-extraction 1 opencv-table 1 opencv-python 1 python table extraction 1 python table image 1 opencv python 1 servier 1 semi-structured-data 1 excel 1 tabular-data 1 pdf-table-extract 1 image-table-recognition 1 extracttable 1 table-to-text 1 pypi-package 1 open-source 1 markitdown 1 marker 1 markdrop 1 docling 1 pandoc 1 textmining 1 python3-library 1 csv 1 python3 1 vlms 1 unstructured-data 1 pdf-extractor 1 onpremise 1 onprem-vision 1 onprem-ocr 1 onprem 1 ocr-onpremise 1 nlp 1 machine-learning 1 llms 1 llm-ocr 1 extraction 1 document-data-extraction 1 document-analysis 1 document 1 nup 1 markdown 1 table-structure-recognition 1 table-functional-analysis 1 Computer Vision 1 Table Transformer 1 TATR 1 pdftk 1 paragraph-extraction 1 pdf-to-html 1 image-processing 1 textract 1 table-ocr 1 poppler 1 ocr-python 1