Html2md MCP
H

Html2md MCP

The MCP server for HTML to Markdown conversion can convert web page HTML into a concise Markdown format, retaining core content such as tables and images. The compression rate reaches 90 - 95%. It supports browser mode to process JavaScript websites and authenticated pages.
2 points
4.7K

What is the HTML to Markdown MCP Server?

This is a tool server specifically designed for Claude AI. It can automatically convert the HTML content of any web page into a clean and readable Markdown format. It acts like an intelligent web content extractor, removing irrelevant elements such as advertisements, navigation bars, and scripts, and only retaining the core article content, tables, and images. This greatly reduces the file size and makes it easier for AI to understand and process web information.

How to use the HTML to Markdown service?

It's very simple to use: directly enter the web page link in the Claude conversation or use a dedicated tool command. The server will automatically fetch the web page, clean up the content, convert it to Markdown, and then return the result to Claude. You can also choose whether to include images and tables, and even process dynamic web pages that require JavaScript to load.

Applicable scenarios

It is very suitable for organizing and analyzing content such as research papers, technical documents, news articles, and product descriptions. When you need AI to read and understand web page content, this tool can provide the cleanest and most relevant text information, preventing AI from being disturbed by irrelevant web elements.

Main Features

Intelligent Content Extraction
Automatically identify and retain the core content of the web page (article text, tables, images), and remove irrelevant elements such as advertisements, navigation bars, and footers.
Efficient Compression
Reduce the HTML file size by 90 - 95%, generate a concise Markdown format, and significantly save the token usage during AI processing.
Browser Mode
Support using real browsers (Chromium/Firefox/WebKit) to fetch web pages dynamically loaded by JavaScript, and perfectly handle modern websites such as React and Vue.
Authenticated Page Access
You can use your browser profile and cookies to access private pages that require login (such as corporate intranets and social media).
Flexible Configuration
You can customize whether to include images, tables, and links, set the timeout time and file size limit to meet different usage needs.
Cache Acceleration
Optionally enable the cache function. When accessing the same web page repeatedly, read directly from the cache to significantly improve the processing speed.
Advantages
Significantly reduce the token consumption when AI processes web page content (save 90 - 95%)
Provide clean and structured content to improve the accuracy of AI understanding
Support JavaScript dynamic web pages and pages that require authentication
Flexible configuration, and the output content can be adjusted according to needs
Fast processing speed, usually completing the conversion in 2 - 10 seconds
One - click Docker deployment, simple and convenient installation
Limitations
Some extremely complex web page layouts may not be perfectly extracted
The browser mode requires additional installation of Playwright and browsers
Processing very large web pages (over 50MB) may require configuration adjustment
A network connection is required to fetch web page content
Some websites with strict anti - crawling mechanisms may not be accessible

How to Use

Install the Server
Choose the installation method that suits you: use Docker (the simplest), the uv package manager, or pip for installation. If you want to use the browser mode, you also need to install Playwright.
Configure Claude Desktop
Edit the configuration file of Claude Desktop and add the MCP server configuration. Select the correct configuration file path according to your operating system.
Restart Claude Desktop
Completely close and restart the Claude Desktop application to make the configuration take effect.
Start Using
Directly enter the web page link in the Claude conversation or use the html_to_markdown tool command to convert the web page content.

Usage Examples

Research Paper Organization
Convert academic paper web pages into clean Markdown for AI to summarize core ideas and research methods.
Technical Document Learning
Convert complex technical document pages into a readable format to help AI understand API usage or technical concepts.
News Article Analysis
Remove the advertisements and sidebars of news websites and only retain the article text for content analysis.
Private Page Access
Access corporate intranets or social media pages that require login and extract private content.

Frequently Asked Questions

Why is the converted content sometimes incomplete?
How to access websites that require login?
What if the conversion process is too slow?
What is the supported size of web page files?
What if I encounter a Playwright error during installation?
Why can't I see this tool in Claude Desktop?

Related Resources

Official MCP Documentation
Official documentation and specifications of the Model Context Protocol
GitHub Repository
Project source code and latest updates
Trafilatura Library
Documentation of the used web content extraction library
Playwright Documentation
Documentation of the browser automation tool
Docker Installation Guide
Docker installation and basic usage tutorial

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "html2md": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "html2md"
      ]
    }
  }
}

{
  "mcpServers": {
    "html2md": {
      "command": "uv",
      "args": [
        "--directory",
        "/absolute/path/to/html2md",
        "run",
        "html2md"
      ]
    }
  }
}

{
  "mcpServers": {
    "html2md": {
      "command": "uv",
      "args": [
        "--directory",
        "C:\\absolute\\path\\to\\html2md",
        "run",
        "html2md"
      ]
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
6.4K
5 points
P
Praisonai
PraisonAI is a production-ready multi-AI agent framework with self-reflection capabilities, designed to create AI agents to automate the solution of various problems from simple tasks to complex challenges. It simplifies the construction and management of multi-agent LLM systems by integrating PraisonAI agents, AG2, and CrewAI into a low-code solution, emphasizing simplicity, customization, and effective human-machine collaboration.
Python
5.7K
5 points
M
Maverick MCP
MaverickMCP is a personal stock analysis server based on FastMCP 2.0, providing professional level financial data analysis, technical indicator calculation, and investment portfolio optimization tools for MCP clients such as Claude Desktop. It comes pre-set with 520 S&P 500 stock data, supports multiple technical analysis strategies and parallel processing, and can run locally without complex authentication.
Python
8.2K
4 points
B
Blueprint MCP
Blueprint MCP is a chart generation tool based on the Arcade ecosystem. It uses technologies such as Nano Banana Pro to automatically generate visual charts such as architecture diagrams and flowcharts by analyzing codebases and system architectures, helping developers understand complex systems.
Python
9.4K
4 points
K
Klavis
Klavis AI is an open-source project that provides a simple and easy-to-use MCP (Model Context Protocol) service on Slack, Discord, and Web platforms. It includes various functions such as report generation, YouTube tools, and document conversion, supporting non-technical users and developers to use AI workflows.
TypeScript
15.8K
5 points
D
Devtools Debugger MCP
The Node.js Debugger MCP server provides complete debugging capabilities based on the Chrome DevTools protocol, including breakpoint setting, stepping execution, variable inspection, and expression evaluation.
TypeScript
10.4K
4 points
S
Scrapling
Scrapling is an adaptive web scraping library that can automatically learn website changes and re - locate elements. It supports multiple scraping methods and AI integration, providing high - performance parsing and a developer - friendly experience.
Python
12.9K
5 points
M
Mcpjungle
MCPJungle is a self-hosted MCP gateway used to centrally manage and proxy multiple MCP servers, providing a unified tool access interface for AI agents.
Go
0
4.5 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
17.7K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
29.9K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
19.4K
4.3 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
59.2K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
25.2K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
54.8K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
18.6K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
38.8K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase