npmjs.org : read-pdf2llm
High-performance PDF text extractor (with OCR fallback) for Node.js, optimized for LLM pipelines. Uses PDFium, Tesseract, and C++ addon.
Registry
- JSON
purl: pkg:npm/read-pdf2llm
Keywords:
pdf
, ocr
, nodejs
, native-addon
, tesseract
, pdfium
, llm
, text-extraction
License: MIT
Latest release: about 2 months ago
First release: about 2 months ago
Downloads: 95 last month
Last synced: 18 days ago
Loading...
Readme
Loading...