npmjs.org : js-harvester
Harvester is a lightweight and highly optimized javascript library for extracting data from the DOM tree. It supports extraction of tag texts with specified types and attributes. it's tiny and has no dependencies and also works with Puppeteer
Registry
-
Source
- Homepage
- JSON
purl: pkg:npm/js-harvester
Keywords:
puppeteer
, playwright
, lightweight
, optimized
, web-scraping
, web
, scraping
, data-extraction
, data
, extraction
, html-parsing
, html
, parsing
, dom-parsing
, dom
, harvesting
, data-harvesting
, template-based-scraping
, template-based
, template-extraction
, template
, pattern-based-scraping
, pattern-based
, visual-scraping-template
, declarative-scraping
, fuzzy-scraping
, fuzzy
, approximate-scraping
, approximate
, resilient-scraping
, resilient
, flexible-scraping
, flexible
, structure-agnostic-scraping
, semantic-scraping
, tree-template-scraping
, tree-template
, pseudo-tree-template
, string-template-scraping
, string-template
, indentation-based-template
, visual-template
, javascript-scraping
, javascript
, npm-package
, browser-scraping
, nodejs-scraping
, node-js
, nodejs
, dom-traversal
, dom-manipulation
, frontend-scraping
, hierarchical-data-extraction
, nested-data-extraction
, attribute-extraction
, text-extraction
, web-automation
, content-extraction
, web-data-extraction
License: MIT
Latest release: 4 months ago
First release: 6 months ago
Downloads: 31 last month
Stars: 23 on GitHub
Forks: 1 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 2 days ago