An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

npmjs.org "text extraction" keyword

View the packages on the npmjs.org package registry that are tagged with the "text extraction" keyword.

teserver 0.0.5
A simple Docker server for extracting text from pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, g...
3 versions - Latest release: over 8 years ago - 1 dependent repositories - 7 downloads last month - 8 stars on GitHub - 1 maintainer
chatified 1.0.3
A tool to build a project structure tree and extract text from all files in a directory into a si...
4 versions - Latest release: over 2 years ago - 3 downloads last month - 0 stars on GitHub - 1 maintainer
resume-ranger 1.1.1
A Node.js package for parsing and extracting information from resumes in various formats.
12 versions - Latest release: over 1 year ago - 34 downloads last month - 1 maintainer
@pghoya2956/google-drive-mcp-server 0.5.2
MCP server for interacting with Google Drive and Sheets, with PDF text extraction and Excel file ...
4 versions - Latest release: 2 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
sprd-pdf-v2 2.0.0
Advanced PDF text extraction library with improved error handling and TypeScript support
1 version - Latest release: 8 months ago - 1 downloads last month - 1 maintainer
sprd-pdf-text 1.0.0
A simple PDF text extractor
1 version - Latest release: 8 months ago - 0 downloads last month - 1 maintainer
mimeograph 1.3.6
CoffeeScript lib for PDF OCR and text extraction.
11 versions - Latest release: over 8 years ago - 2 dependent packages - 2 dependent repositories - 9 downloads last month - 28 stars on GitHub - 3 maintainers
gutenbergscraper 1.0.3
A Scraper for Project Gutenberg allowing you to use it for scraping data into datasets, very cust...
4 versions - Latest release: 7 months ago - 5 downloads last month - 1 maintainer
cognivision 1.0.1
AI Image Recognition API for Node.js
2 versions - Latest release: over 1 year ago - 0 downloads last month - 1 maintainer
sprd-pdf 1.0.0
A library for extracting text from PDF files
1 version - Latest release: 8 months ago - 1 downloads last month - 1 maintainer
sinsintro-pdf-extractor 1.0.2
A library to extract text and images from PDFs
3 versions - Latest release: about 1 year ago - 2 downloads last month - 1 maintainer
grunt-text-grab 0.1.2
Extract chunks of text from files using regular expressions and save them to multiple formats.
3 versions - Latest release: over 11 years ago - 2 dependent packages - 1 dependent repositories - 17 downloads last month - 0 stars on GitHub - 1 maintainer
pdf-worker-package 1.0.1
A simple and robust PDF text extraction utility using pdfjs-dist
2 versions - Latest release: 3 months ago - 9 downloads last month - 1 maintainer
@push.rocks/smartpdf 4.1.1
A library for creating PDFs dynamically from HTML or websites with additional features like mergi...
14 versions - Latest release: about 2 months ago - 3 dependent packages - 419 downloads last month - 1 maintainer
any-extractor 2.0.2 💰
A universal text extractor for files.
8 versions - Latest release: 5 months ago - 5 downloads last month - 0 stars on GitHub - 1 maintainer
@s2s/language-service-client 0.0.5
Client for the language service
5 versions - Latest release: about 6 years ago - 1 dependent package - 1 dependent repositories - 0 downloads last month - 2 stars on GitHub - 1 maintainer
cloudflare-docx-parser 1.0.6
A lightweight library to parse .docx files in Cloudflare Workers
7 versions - Latest release: 8 months ago - 5 downloads last month - 0 stars on GitHub - 1 maintainer
pdfnano 0.1.1
A pure JavaScript/TypeScript PDF parser with no external dependencies
2 versions - Latest release: 5 months ago - 119 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
pdf 8 typescript 6 parser 3 ocr 3 image extraction 2 node 2 nodejs 2 javascript 2 node.js 2 docx 2 AI 2 book data scraper node 1 gutenberg project scraper 1 nodejs scraping library 1 nodejs text scraping 1 scraping books text 1 scraper npm nodejs 1 scraping books to csv 1 book data extraction 1 scrape gutenberg library 1 html extraction nodejs 1 scraper books nodejs 1 books from gutenberg scraper 1 scrape book text 1 scraping books npm 1 scraping with cheerio npm 1 scraper nodejs parallel 1 gutenberg html data 1 scrape books from gutenberg 1 nodejs project gutenberg 1 scraper project gutenberg 1 scrape nodejs 1 scrape gutenberg project data 1 gutenberg text scraper 1 web scraping tools npm 1 scraper library node 1 scraper parallel request 1 book data nodejs 1 scraper html data 1 nodejs scraper npm 1 scraper for gutenberg project 1 scrape gutenberg text 1 project gutenberg scraping 1 scraper for ebooks 1 scrape to csv 1 books scraper npm 1 gutenberg metadata scraper 1 scrape html nodejs 1 gutenberg scraping tool 1 scrape books json 1 parallel data scraping 1 scraper nodejs npm 1 scraper nodejs project 1 scraping with cheerio 1 scraping with axios 1 scraping project gutenberg books 1 nodejs scraper tool 1 gutenberg web scraping 1 book download tool 1 gutenberg library scrape 1 data scraping tool 1 scraper for nodejs 1 scraper javascript 1 scrape books into csv 1 scrape gutenberg content 1 scrape html books 1 data extraction tool nodejs 1 gutenberg node scraper 1 scraper npm package 1 scraper with axios cheerio 1 scrape gutenberg with nodejs 1 scraper for gutenberg books 1 gutenberg content extractor 1 scraping books nodejs 1 scrape gutenberg project 1 npm book scraper 1 scrape gutenberg books nodejs 1 gutenberg content extraction 1 scrape from gutenberg nodejs 1 gutenberg scraper npm 1 scraper node npm 1 books scraper 1 scrape gutenberg books 1 scraper typescript node 1 scraper csv json 1 scrape books retry 1 scraper retry 1 pdf-worker-package 1 pdfjs-dist 1 pdf-worker 1 pdfjs 1 regexp 1 gruntplugin 1 react 1 visual intelligence 1 OCR 1 emotion recognition 1 scene analysis 1 object detection 1 image processing 1