An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

npmjs.org "web-scraping" keyword

View the packages on the npmjs.org package registry that are tagged with the "web-scraping" keyword.

Top 2.5% on npmjs.org
@crawlee/cli 3.15.2
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of dat...
1,448 versions - Latest release: 11 days ago - 2 dependent packages - 32 dependent repositories - 186 thousand downloads last month - 10,928 stars on GitHub - 10 maintainers
Top 1.8% on npmjs.org
@crawlee/puppeteer 3.15.2
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of dat...
1,556 versions - Latest release: 11 days ago - 5 dependent packages - 34 dependent repositories - 195 thousand downloads last month - 20,257 stars on GitHub - 10 maintainers
Top 1.5% on npmjs.org
@crawlee/browser-pool 3.15.2
Rotate multiple browsers using popular automation libraries such as Playwright or Puppeteer.
1,552 versions - Latest release: 11 days ago - 11 dependent packages - 35 dependent repositories - 225 thousand downloads last month - 10,928 stars on GitHub - 10 maintainers
Top 1.6% on npmjs.org
@crawlee/playwright 3.15.2
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of dat...
1,554 versions - Latest release: 11 days ago - 6 dependent packages - 34 dependent repositories - 210 thousand downloads last month - 10,928 stars on GitHub - 10 maintainers
Top 1.7% on npmjs.org
@crawlee/http 3.15.2
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of dat...
1,361 versions - Latest release: 11 days ago - 6 dependent packages - 33 dependent repositories - 198 thousand downloads last month - 20,257 stars on GitHub - 10 maintainers
Top 2.5% on npmjs.org
@crawlee/templates 3.15.2
Templates for the crawlee projects
1,435 versions - Latest release: 11 days ago - 2 dependent packages - 32 dependent repositories - 187 thousand downloads last month - 20,257 stars on GitHub - 10 maintainers
@crawlee/impit-client 3.15.2
impit-based HTTP client implementation for Crawlee. Impersonates browser requests to avoid bot de...
263 versions - Latest release: 11 days ago - 3.36 thousand downloads last month - 20,257 stars on GitHub - 5 maintainers
n8n-nodes-anchorbrowser 0.1.5
n8n node for Anchor Browser API - browser automation and control
6 versions - Latest release: about 3 hours ago - 437 downloads last month - 1 maintainer
ts-jobspy 1.3.1
TypeScript job scraper for LinkedIn, Indeed, Glassdoor, ZipRecruiter & more - rewritten from pyth...
5 versions - Latest release: 2 months ago - 34 downloads last month - 0 stars on GitHub - 1 maintainer
@mseep/scrapi-mcp 0.0.3
MCP server for using ScrAPI to scrape web pages.
1 version - Latest release: 6 months ago - 3 downloads last month - 11 stars on GitHub - 1 maintainer
html-content-processor 1.0.5
A professional library for processing, cleaning, filtering, and converting HTML content to Markdo...
4 versions - Latest release: 5 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
flamescraper 1.0.11
Deep web scraper with recursive subpages for each param and unknown-pagination loop
12 versions - Latest release: almost 2 years ago - 1 dependent package - 42 downloads last month - 0 stars on GitHub - 1 maintainer
@anysiteio/mcp 0.7.0
A comprehensive MCP server with 57 tools for LinkedIn, Instagram, Twitter/X, Reddit, and web scra...
11 versions - Latest release: about 11 hours ago - 939 downloads last month - 44 stars on GitHub - 1 maintainer
crawlyx 2.2.5
Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It ...
16 versions - Latest release: 7 months ago - 330 downloads last month - 11 stars on GitHub - 1 maintainer
n8n-nodes-url-to-html 1.0.2
n8n node for converting URLs to HTML using pdfmunk API
3 versions - Latest release: about 1 month ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
kalpana-agent 1.1.16
Kalpana (कल्पना) - AI development assistant with multi-runtime containerized execution, web autom...
27 versions - Latest release: about 1 month ago - 248 downloads last month - 0 stars on GitHub - 1 maintainer
gnews-scraper 1.2.3
GNewsScraper is a TypeScript package that scrapes article data from Google News based on a keywor...
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 24 downloads last month - 13 stars on GitHub - 1 maintainer
tyre-tracker 1.0.1
programatically retrieve product data from Canadian Tire website
2 versions - Latest release: over 2 years ago - 30 downloads last month - 0 stars on GitHub - 1 maintainer
@bluggie/nodescrapy 0.1.6
Web crawler in NodeJS
7 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 63 downloads last month - 2 stars on GitHub - 1 maintainer
scrape-them-all 2.0.0
🚀 An easy-to-handle Node.js scraper that allow you to scrape them all in a record time.
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 49 downloads last month - 10 stars on GitHub - 1 maintainer
rabbit-browser 1.1.0
Browser automation tool for detecting interactive elements on web pages
2 versions - Latest release: 7 months ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
@lyuboslavlyubenov/se-scraper 1.9.12
A module using puppeteer to scrape several search engines such as Google, Bing and Duckduckgo
9 versions - Latest release: almost 4 years ago - 1 dependent package - 46 downloads last month - 561 stars on GitHub - 1 maintainer
@promptbook/website-crawler 0.102.0
Promptbook: Turn your company's scattered knowledge into AI ready books
346 versions - Latest release: 20 days ago - 6 thousand downloads last month - 134 stars on GitHub - 1 maintainer
anchorbrowser 0.8.3
The official TypeScript library for the Anchorbrowser API
15 versions - Latest release: 1 day ago - 3.13 thousand downloads last month - 0 stars on GitHub - 1 maintainer
kargo-takip 1.0.3 💰
Türkiye kargo takip modülü - Web scraping tabanlı kargo takip sistemi
2 versions - Latest release: 3 months ago - 6 downloads last month - 1 stars on GitHub - 1 maintainer
@screenshotbuddy/node-curl-impersonate 1.0.6
A wrapper around cURL-impersonate, a binary which can be used to bypass TLS fingerprinting.
6 versions - Latest release: 6 months ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
crawlee-proxyport 1.0.1
proxyport rotating proxy integration for Crawlee web scraping framework
2 versions - Latest release: over 2 years ago - 7 downloads last month - 6 stars on GitHub - 1 maintainer
n8n-nodes-browser-use-cloud 1.0.4
n8n node for Browser Use Cloud API - Automate web tasks with AI agents
4 versions - Latest release: 1 day ago - 156 downloads last month - 1 stars on GitHub - 1 maintainer
rag-system-pgvector 2.3.1
A complete Retrieval-Augmented Generation system using pgvector, LangChain, and LangGraph for Nod...
10 versions - Latest release: 1 day ago - 472 downloads last month - 1 maintainer
scan-link 1.0.3
Scan given website recursively and report 404 links
4 versions - Latest release: over 1 year ago - 74 downloads last month - 1 stars on GitHub - 1 maintainer
competitor-intel-toolkit 1.0.0
Automated competitive intelligence: live web scraping + AI analysis. Know what your competitors a...
1 version - Latest release: 2 days ago - 1 maintainer
curlcookie 0.0.5
Parse cookie jar stored by curl
4 versions - Latest release: over 5 years ago - 2 dependent packages - 14 downloads last month - 1 stars on GitHub - 1 maintainer
@mseep/mult-fetch-mcp-server 1.3.2
An MCP protocol-based web content fetching tool that supports multiple modes and formats, can be ...
1 version - Latest release: 7 months ago - 7 downloads last month - 11 stars on GitHub - 1 maintainer
@mnmkng/scraper-tools 0.1.2
Tools shared by Apify actor-scrapers.
12 versions - Latest release: over 6 years ago - 1 dependent package - 1 dependent repositories - 150 downloads last month - 125 stars on GitHub - 1 maintainer
mcp-fetch 0.5.0
A Model Context Protocol server providing tools for HTTP requests, GraphQL queries, WebSocket con...
8 versions - Latest release: 11 months ago - 119 downloads last month - 1 stars on GitHub - 1 maintainer
@bonginkan/maria 4.4.8
🚀 MARIA v4.4.8 - Enterprise AI Development Platform with identity system and character voice impl...
227 versions - Latest release: 2 days ago - 3.11 thousand downloads last month - 2 stars on GitHub - 1 maintainer
osu-droid-scraping 1.1.5
Gather information of an osu!droid user via web scraping.
11 versions - Latest release: 12 months ago - 83 downloads last month - 0 stars on GitHub - 1 maintainer
firecrawl-simple-mcp 1.0.2
Model Context Protocol (MCP) server for Firecrawl Simple - provides web scraping and crawling cap...
3 versions - Latest release: 7 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
@apify-scrapers/shared 1.0.0
Shared utilities and constants for Apify scrapers
1 version - Latest release: 4 months ago - 13 downloads last month - 1 maintainer
chrome-automation-mcp 1.2.14
MCP server for browser automation with custom scripts
22 versions - Latest release: 2 days ago - 325 downloads last month - 0 stars on GitHub - 1 maintainer
search-agent 1.0.0
Oblien Search SDK - AI-powered web search, content extraction, and website crawling. Full documen...
1 version - Latest release: 2 days ago - 1 maintainer
playwright-vision-mcp 2.1.0
n8n MCP server with Playwright browser automation capabilities
2 versions - Latest release: 3 days ago - 86 downloads last month - 0 stars on GitHub - 1 maintainer
@citation-js/plugin-zotero-translation-server 0.2.0
Citation.js Plugin for Zotero Translation Server instances
4 versions - Latest release: over 3 years ago - 1 dependent package - 2 dependent repositories - 149 downloads last month - 5 stars on GitHub - 1 maintainer
@scrape-it-all/trampos 0.2.2
Get details of jobs available on trampos.co
6 versions - Latest release: almost 6 years ago - 2 dependent packages - 143 downloads last month - 0 stars on GitHub - 1 maintainer
@lightfeed/extractor 0.2.1
Use LLMs to robustly extract and enrich structured data from HTML and markdown
6 versions - Latest release: about 1 month ago - 153 downloads last month - 46 stars on GitHub - 1 maintainer
@iflow-mcp/fetcher-mcp 0.3.5
MCP server for fetching web content using Playwright browser
1 version - Latest release: 3 days ago - 2 maintainers
@decodo/n8n-nodes-decodo 1.5.0
Decodo n8n integration
12 versions - Latest release: 3 days ago - 433 downloads last month - 8 stars on GitHub - 1 maintainer
@langgraph-js/crawler 1.7.0
A powerful web crawler designed specifically for LLM applications, capable of extracting clean, r...
17 versions - Latest release: 4 months ago - 125 downloads last month - 5 stars on GitHub - 1 maintainer
@langgraph-js/crawler-mcp 1.5.3
A powerful web crawler designed specifically for LLM applications, capable of extracting clean, r...
3 versions - Latest release: 5 months ago - 25 downloads last month - 5 stars on GitHub - 1 maintainer
tg-scraper 1.0.1
Telegram Channel Scraper
1 version - Latest release: about 1 year ago - 8 downloads last month - 1 maintainer
@fanboynz/network-scanner 2.0.31
A Puppeteer-based network scanner for analyzing web traffic, generating adblock filter rules, and...
96 versions - Latest release: 3 days ago - 1.52 thousand downloads last month - 6 stars on GitHub - 1 maintainer
@browserbasehq/stagehand 3.0.0
An AI web browsing framework focused on simplicity and extensibility.
428 versions - Latest release: 5 days ago - 2.04 million downloads last month - 18,620 stars on GitHub - 9 maintainers
@crawlee/linkedom 3.15.2
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of dat...
1,057 versions - Latest release: 11 days ago - 1 dependent package - 8 dependent repositories - 188 thousand downloads last month - 20,257 stars on GitHub - 5 maintainers
@browserbasehq/orca 3.0.0-test.1
An AI web browsing framework focused on simplicity and extensibility.
10 versions - Latest release: 6 days ago - 1.57 thousand downloads last month - 18,620 stars on GitHub - 9 maintainers
playread 1.1.10
Web content extraction and automation via Playwright MCP
12 versions - Latest release: 4 days ago - 1 thousand downloads last month - 1 maintainer
@rpidanny/odysseus 2.6.0
Odysseus is a web scraping library built on top of Playwright, designed to handle dynamic web pag...
15 versions - Latest release: over 1 year ago - 753 downloads last month - 1 stars on GitHub - 1 maintainer
Top 2.7% on npmjs.org
html-metadata 3.0.1
Scrapes metadata of several different standards
23 versions - Latest release: 6 months ago - 22 dependent packages - 97 dependent repositories - 2.52 thousand downloads last month - 174 stars on GitHub - 2 maintainers
@iflow-mcp/puremd-mcp 1.0.3
Model Context Protocol (MCP) server for pure.md, the markdown delivery network for LLMs
1 version - Latest release: 4 days ago - 46 stars on GitHub - 2 maintainers
stepwright 1.0.7
A powerful web scraping library built with Playwright
8 versions - Latest release: 4 days ago - 350 downloads last month - 5 stars on GitHub - 1 maintainer
html-metadatas 1.5.0 unpublished
Scrapes metadata of several different standards
1 version - Latest release: about 9 years ago - 3 dependent packages - 1 dependent repositories - 14 downloads last month - 175 stars on GitHub - 1 maintainer
brave-real-browser-mcp-server 2.13.0
Universal AI IDE MCP Server - Auto-detects and supports all AI IDEs (Claude Desktop, Cursor, Wind...
87 versions - Latest release: 4 days ago - 7.99 thousand downloads last month - 1 maintainer
@iflow-mcp/hyperbrowser 1.1.0
Hyperbrowser Model Context Protocol Server
1 version - Latest release: 4 days ago - 631 stars on GitHub - 2 maintainers
vuagen 1.0.3
Vuagen is a simple and flexible User-Agent generator for browser automation, testing, and web scr...
3 versions - Latest release: 3 months ago - 19 downloads last month - 3 stars on GitHub - 1 maintainer
google-reviews-api 1.0.6
A simple Node.js library to fetch Google Maps reviews
7 versions - Latest release: 6 months ago - 58 downloads last month - 0 stars on GitHub - 1 maintainer
panini-scraper 1.1.0
A TypeScript library for scraping Panini Brasil product information
3 versions - Latest release: 5 days ago - 161 downloads last month - 0 stars on GitHub - 1 maintainer
mult-fetch-mcp-server 1.0.0
一个基于 MCP 协议的网页内容获取工具,支持多种模式和格式,可与 Claude 等 AI 助手集成
1 version - Latest release: 8 months ago - 6 downloads last month - 13 stars on GitHub - 1 maintainer
@cyanheads/jinaai-mcp-server 1.0.4 💰
A Model Context Protocol (MCP) server that provides intelligent web reading capabilities using th...
3 versions - Latest release: 3 months ago - 40 downloads last month - 0 stars on GitHub - 1 maintainer
sigaa-api-cefetmg 1.0.35
API for CEFETMG-SIGAA plataform, forked from sigaa-api project.
1 version - Latest release: about 1 month ago - 10 downloads last month - 52 stars on GitHub - 1 maintainer
@ozipoetra/ts-curl-impersonate 1.0.11
A personal typescript wrapper around cURL-impersonate.
1 version - Latest release: 5 months ago - 9 downloads last month - 1 maintainer
stonkinator 1.0.0
A low level stock data aggregation tool, a boring lib for others to build upon
1 version - Latest release: over 4 years ago - 20 downloads last month - 1 maintainer
@iflow-mcp/logo-mcp 1.0.0
一个智能Logo提取和处理的MCP服务器,支持从网站URL自动识别并提取Logo图标
1 version - Latest release: 3 months ago - 39 downloads last month - 0 stars on GitHub - 2 maintainers
naver-finance-crawl-mcp 1.0.0
Naver Finance web crawler with MCP server support - crawl Korean stock market data
1 version - Latest release: 5 days ago - 1 maintainer
pentestic-playwright-mcp 1.0.0
Enhanced Playwright MCP Server with 40+ browser automation tools
1 version - Latest release: 5 days ago - 1 maintainer
@adobe/spacecat-shared-html-analyzer 1.0.3
Analyze HTML content visibility for AI crawlers and citations - compare static HTML vs fully rend...
4 versions - Latest release: 5 days ago - 52 downloads last month - 7 stars on GitHub - 31 maintainers
@iflow-mcp/mcp-webresearch 0.1.8
MCP server for web research
2 versions - Latest release: 5 days ago - 78 downloads last month - 281 stars on GitHub - 2 maintainers
@iflow-mcp/mcpwebresearchserver 0.1.7
MCP server for web research
1 version - Latest release: 5 days ago - 281 stars on GitHub - 2 maintainers
design-clone-mcp 1.0.0
MCP server for cloning website designs with Puppeteer - One command, just works
1 version - Latest release: 5 days ago - 1 maintainer
@sashbot/browser-navigator 1.0.3
Automated browser navigation and visual mapping system for web applications
4 versions - Latest release: 4 months ago - 29 downloads last month - 1 stars on GitHub - 1 maintainer
mcp-jinaai-grounding 0.0.2
MCP server for JinaAI grounding
2 versions - Latest release: 9 months ago - 140 downloads last month - 1 stars on GitHub - 1 maintainer
@mseep/puremd-mcp 1.0.3
Model Context Protocol (MCP) server for pure.md, the markdown delivery network for LLMs
1 version - Latest release: 7 months ago - 11 downloads last month - 46 stars on GitHub - 1 maintainer
puremd-mcp 1.0.3
Model Context Protocol (MCP) server for pure.md, the markdown delivery network for LLMs
4 versions - Latest release: 7 months ago - 178 downloads last month - 46 stars on GitHub - 1 maintainer
devchrome-mcp 1.8.2
MCP (Model Context Protocol) сервер для работы с браузером через Puppeteer
15 versions - Latest release: 5 days ago - 181 downloads last month - 0 stars on GitHub - 1 maintainer
qa-agent 2.2.6
AI-powered QA agent using LLM models for automated testing and web interaction
24 versions - Latest release: 6 days ago - 472 downloads last month - 1 stars on GitHub - 1 maintainer
doki-doki-hci 0.0.2
A web automation tool for HCI
2 versions - Latest release: 3 months ago - 8 downloads last month - 1 maintainer
curl-cffi 0.1.42
A powerful HTTP client for Node.js based on libcurl with browser fingerprinting capabilities.
43 versions - Latest release: 6 days ago - 339 downloads last month - 9 stars on GitHub - 1 maintainer
telegram-scraper 1.0.2
A simple Telegram channel scraper
3 versions - Latest release: over 1 year ago - 2.47 thousand downloads last month - 30 stars on GitHub - 1 maintainer
@iflow-mcp/webscraping-ai-mcp-server 1.0.2
Model Context Protocol server for WebScraping.AI API. Provides LLM-powered web scraping tools wit...
1 version - Latest release: 6 days ago - 31 stars on GitHub - 2 maintainers
@webstandard/robots 2.0.0
A standards-compliant generator for producing robots.txt files
2 versions - Latest release: 6 days ago - 93 downloads last month - 1 maintainer
@victorsouzaleal/googlethis 1.8.1 💰
A simple yet powerful module to retrieve organic search results and much more from Google.
1 version - Latest release: over 1 year ago - 795 downloads last month - 0 stars on GitHub - 1 maintainer
aisy 0.1.0
AISY:An intelligent AI-powered search engine that breaks down complex queries into sub-questions
1 version - Latest release: 11 months ago - 9 downloads last month - 1 maintainer
mcp-docs-collector 0.0.2
MCP server for collecting documentation from various sources
2 versions - Latest release: 5 months ago - 6 downloads last month - 1 maintainer
a1hul-mcp 0.1.6
MCP server for extracting content from web pages
7 versions - Latest release: 5 months ago - 22 downloads last month - 0 stars on GitHub - 1 maintainer
webmiddle-service-cheerio-to-virtual 0.3.0 deprecated
> Service that converts a resource, whose content is parsed by the [Cheerio](https://github.com/...
5 versions - Latest release: over 7 years ago - 4 dependent packages - 1 dependent repositories - 35 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-service-jsonselect-to-json 0.3.0 deprecated
> Service that converts a resource, whose content is parsed by the [JSONSelect](https://github.c...
5 versions - Latest release: over 7 years ago - 3 dependent packages - 1 dependent repositories - 23 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-service-resume 0.3.0 deprecated
> A service that makes its children **resumable** by caching the result.
4 versions - Latest release: over 7 years ago - 3 dependent packages - 1 dependent repositories - 25 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-component-cheerio-to-virtual 0.4.0 deprecated
> Component that converts a resource, whose content is parsed by the [Cheerio](https://github.com...
2 versions - Latest release: about 7 years ago - 2 dependent packages - 1 dependent repositories - 9 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-service-pipe 0.3.0 deprecated
> Executes a sequence of services, piping their results (resources) to the subsequent services in...
5 versions - Latest release: over 7 years ago - 6 dependent packages - 1 dependent repositories - 23 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-service-cheerio-to-json 0.3.0 deprecated
> Service that converts a resource, whose content is parsed by the [Cheerio](https://github.com/...
5 versions - Latest release: over 7 years ago - 3 dependent packages - 1 dependent repositories - 41 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-component-arraymap 0.4.0 deprecated
> Maps an array into an array of resources by executing a callback on each item.
2 versions - Latest release: about 7 years ago - 1 dependent package - 1 dependent repositories - 15 downloads last month - 14 stars on GitHub - 1 maintainer
webmiddle-service-arraymap 0.3.0 deprecated
> Maps an array into an array of resources by executing a callback on each item.
5 versions - Latest release: over 7 years ago - 3 dependent packages - 1 dependent repositories - 11 downloads last month - 14 stars on GitHub - 1 maintainer