npmjs.org "web-data-extraction" keyword
View the packages on the npmjs.org package registry that are tagged with the "web-data-extraction" keyword.
js-harvester 0.3.14
Harvester is a lightweight and highly optimized javascript library for extracting data from the D...27 versions - Latest release: 4 months ago - 31 downloads last month - 23 stars on GitHub - 1 maintainer
@mendable/firecrawl-js 4.3.7
JavaScript SDK for Firecrawl API177 versions - Latest release: 3 days ago - 773 thousand downloads last month - 60,856 stars on GitHub - 4 maintainers
@mendable/firecrawl 4.3.7
JavaScript SDK for Firecrawl API42 versions - Latest release: 3 days ago - 1.15 thousand downloads last month - 60,856 stars on GitHub - 4 maintainers
firecrawl 4.3.7
JavaScript SDK for Firecrawl API102 versions - Latest release: 3 days ago - 13.9 thousand downloads last month - 60,856 stars on GitHub - 2 maintainers
gnews-scraper 1.2.3
GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keywor...5 versions - Latest release: about 2 years ago - 1 dependent repositories - 24 downloads last month - 13 stars on GitHub - 1 maintainer
extract-site-metadata 1.3.1
Metadata extractor for the sprawling web11 versions - Latest release: over 3 years ago - 1 dependent package - 2 dependent repositories - 40 downloads last month - 0 stars on GitHub - 1 maintainer
@lightfeed/extractor 0.2.0
Use LLMs to robustly extract and enrich structured data from HTML and markdown5 versions - Latest release: 2 months ago - 42 downloads last month - 46 stars on GitHub - 1 maintainer
@lightfeed/sdk 0.1.7
Lightfeed SDK for Node.js1 version - Latest release: 4 months ago - 10 downloads last month - 5 stars on GitHub - 1 maintainer
lightfeed 0.1.6 deprecated
Lightfeed API Client for Node.js6 versions - Latest release: 4 months ago - 238 downloads last month - 5 stars on GitHub - 1 maintainer
lightfeed-extract 0.1.5 deprecated
Use LLMs to robustly extract and enrich structured data from HTML and markdown5 versions - Latest release: 5 months ago - 357 downloads last month - 37 stars on GitHub - 1 maintainer
Related Keywords
crawler
8
web-scraping
8
ai-agents
7
llm
7
webscraping
7
data-extraction
6
html-to-markdown
5
web-data
5
web
5
scraping
5
structured-data
5
markdown
5
llm-scraper
4
llm-extraction
4
data-engineering
4
data-pipeline
4
etl
4
rag
4
web-extraction
4
scraper
4
web-search
3
web-scraper
3
web-crawler
3
ai-search
3
ai-scraping
3
ai-crawler
3
ai
3
sdk
3
html
3
api
3
extraction
3
mendable
3
firecrawl
3
web-automation
2
openai
2
gemini
2
article-extractor
2
google-gemini
2
html-parser
2
nlp
2
rss-feed
2
lightfeed
2
business-intelligence
2
data-integration
2
embedding-search
2
extract
2
knowledge-base
2
vector-database
2
web-data-management
2
approximate-scraping
1
open-graph-protocol
1
metadata-extraction
1
approximate
1
schema.org
1
opengraph
1
metadata
1
seo
1
web-crawling
1
typescript
1
news-scraping
1
keyword-search
1
json-parsing
1
google-news-scraper
1
google-news
1
gnews-api
1
data-scraping
1
article-extraction
1
google crawler
1
news crawler
1
tree-template
1
puppeteer
1
playwright
1
lightweight
1
optimized
1
data
1
html-parsing
1
parsing
1
dom-parsing
1
dom
1
harvesting
1
data-harvesting
1
template-based-scraping
1
template-based
1
template-extraction
1
template
1
pattern-based-scraping
1
pattern-based
1
visual-scraping-template
1
declarative-scraping
1
fuzzy-scraping
1
fuzzy
1
pseudo-tree-template
1
string-template-scraping
1
string-template
1
indentation-based-template
1
visual-template
1
javascript-scraping
1
javascript
1
npm-package
1
browser-scraping
1