Super Fetch MCP Server
S

Super Fetch MCP Server

The SuperFetch MCP server is a web content extraction tool based on the Model Context Protocol. It can intelligently crawl web pages, extract readable content using Mozilla Readability, and return it in an AI-friendly JSONL or Markdown format. It supports structured content blocks, built-in caching, and security protection.
2 points
5.0K

What is SuperFetch MCP Server?

SuperFetch is a Model Context Protocol (MCP) server dedicated to intelligent web page content crawling and extraction. It is not just a simple web crawler. Instead, it uses Mozilla Readability technology to automatically identify and extract the main content of web pages, removing irrelevant elements such as advertisements, navigation bars, and footers, and then converts the content into a format that is easy for AI assistants to process. In simple terms, it allows AI to read web pages like humans, focusing only on truly valuable content rather than the cluttered information of the entire page.

How to use SuperFetch?

Using SuperFetch is very simple and does not require installing any software. You just need to add the configuration in the AI assistant tool you are using (such as Claude Desktop, VS Code, Cursor, etc.). It supports one-click installation and can be used within a few seconds. There are two main ways to use it: 1. **JSONL format**: Obtain structured content blocks, suitable for AI analysis and processing. 2. **Markdown format**: Obtain a clean document format, suitable for human reading and document generation.

Applicable scenarios

SuperFetch is particularly suitable for the following scenarios: • **AI assistants need to read web page content**: Enable AIs such as Claude and ChatGPT to access and understand web page information. • **Document generation and summarization**: Automatically extract the main content of articles to generate summaries or documents. • **RAG (Retrieval Augmented Generation) system**: Provide a high-quality web content source for the knowledge base. • **Content analysis and research**: Extract structured information from multiple web pages in batches for analysis. • **Technical document processing**: Extract technical content such as code examples and API documents.

Main features

Intelligent content extraction
Use Mozilla Readability technology to automatically identify and extract the core content of web pages, intelligently removing irrelevant elements such as advertisements, navigation bars, and footers, and only retaining truly valuable information.
Multiple output formats
Supports two output formats: JSONL (structured data) and Markdown (document format), meeting the needs of different scenarios. JSONL is suitable for AI processing, and Markdown is suitable for human reading.
Structured content blocks
Decompose web page content into semantic blocks: headings, paragraphs, lists, code blocks, tables, images, quotes, etc., allowing AI to better understand and process the content structure.
Built-in cache system
Automatically cache the extracted content, reducing duplicate requests and improving response speed. The cache time and maximum number of cache entries can be configured to optimize performance.
Robust crawling mechanism
Supports retry mechanism, timeout control, and redirection handling to ensure successful content acquisition even in unstable network conditions.
Security-first design
Built-in security mechanisms such as SSRF protection, URL verification, and header cleaning to prevent malicious requests and attacks and ensure safe use.
Easy integration
Supports one-click installation and is compatible with all mainstream MCP clients (Claude Desktop, VS Code, Cursor, etc.) without complex configuration.
Advantages
Intelligent content extraction: Automatically identify and extract core content, removing irrelevant elements
AI-friendly format: Structured output optimized specifically for AI processing
Easy to use: One-click installation, no technical background required
Highly configurable: Supports multiple configuration options to adapt to different needs
Safe and reliable: Built-in multiple security protection mechanisms
Excellent performance: Cache and retry mechanisms ensure stable and efficient operation
Limitations
Dynamic content limitation: Limited support for content dynamically loaded by JavaScript
Login page limitation: Unable to access content on pages that require login
Anti-crawler websites: May be blocked by the anti-crawler mechanisms of some websites
Large file processing: Extra processing may be required for extremely large web page content
Format conversion: Format conversion of some complex web pages may not be perfect

How to use

Select your AI tool
Select the corresponding configuration method according to the AI assistant tool you are using. It supports mainstream tools such as Claude Desktop, VS Code, Cursor, Codex IDE, and Windsurf.
Add configuration
Add the SuperFetch configuration to the tool's configuration file. Most tools support one-click installation, and you just need to copy the configuration code.
Restart the tool
After saving the configuration file, restart your AI tool for the configuration to take effect.
Start using
Now you can use the SuperFetch tool in your AI assistant. You can request to extract web page content and choose the JSONL or Markdown format.

Usage examples

Technical blog content analysis
You are researching a certain technical topic and need to analyze multiple relevant blog articles. Using SuperFetch, you can quickly extract the core content of these articles, remove advertisements and navigation, and only retain the technical discussion part.
Product documentation conversion
You need to convert the online documentation of a certain product into a local Markdown file for offline reading or localization processing.
News summary generation
You need to quickly understand the headline news content of multiple news websites and generate a daily summary.
Academic paper collection
Researchers need to collect abstracts and key findings of academic papers in related fields.

Frequently Asked Questions

What is the difference between SuperFetch and ordinary web crawling?
Do I need programming knowledge to use it?
Which websites are supported?
Which format should I choose, JSONL or Markdown?
Is there a size limit for the extracted content?
How to ensure safe use?
Does it support Chinese websites?
What should I do if I encounter a website that cannot be accessed?

Related resources

Official GitHub repository
View source code, submit issues, and participate in contributions
MCP Protocol official website
Learn detailed information and specifications of the Model Context Protocol
NPM package page
View package versions, download statistics, and update logs
MCP Registry
Discover more MCP servers and tools
Problem feedback
Report bugs or propose feature suggestions

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "superFetch": {
      "command": "npx",
      "args": ["-y", "@j0hanz/superfetch@latest", "--stdio"]
    }
  }
}

{
  "mcpServers": {
    "superFetch": {
      "command": "npx",
      "args": ["-y", "@j0hanz/superfetch@latest", "--stdio"],
      "disabled": false,
      "autoApprove": []
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

R
Rsdoctor
Rsdoctor is a build analysis tool specifically designed for the Rspack ecosystem, fully compatible with webpack. It provides visual build analysis, multi - dimensional performance diagnosis, and intelligent optimization suggestions to help developers improve build efficiency and engineering quality.
TypeScript
5.9K
5 points
N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
7.7K
5 points
T
Testkube
Testkube is a test orchestration and execution framework for cloud-native applications, providing a unified platform to define, run, and analyze tests. It supports existing testing tools and Kubernetes infrastructure.
Go
5.4K
5 points
M
MCP Windbg
An MCP server that integrates AI models with WinDbg/CDB for analyzing Windows crash dump files and remote debugging, supporting natural language interaction to execute debugging commands.
Python
6.7K
5 points
R
Runno
Runno is a collection of JavaScript toolkits for securely running code in multiple programming languages in environments such as browsers and Node.js. It achieves sandboxed execution through WebAssembly and WASI, supports languages such as Python, Ruby, JavaScript, SQLite, C/C++, and provides integration methods such as web components and MCP servers.
TypeScript
5.2K
5 points
P
Praisonai
PraisonAI is a production-ready multi-AI agent framework with self-reflection capabilities, designed to create AI agents to automate the solution of various problems from simple tasks to complex challenges. It simplifies the construction and management of multi-agent LLM systems by integrating PraisonAI agents, AG2, and CrewAI into a low-code solution, emphasizing simplicity, customization, and effective human-machine collaboration.
Python
5.9K
5 points
N
Netdata
Netdata is an open-source real-time infrastructure monitoring platform that provides second-level metric collection, visualization, machine learning-driven anomaly detection, and automated alerts. It can achieve full-stack monitoring without complex configuration.
Go
6.9K
5 points
M
MCP Server
The Mapbox MCP Server is a model context protocol server implemented in Node.js, providing AI applications with access to Mapbox geospatial APIs, including functions such as geocoding, point - of - interest search, route planning, isochrone analysis, and static map generation.
TypeScript
6.9K
4 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
19.4K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
30.0K
5 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
18.8K
4.5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
58.3K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
25.2K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
54.9K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
38.8K
4.8 points
C
Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
80.5K
4.7 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase