Html2md MCP
The MCP server for HTML to Markdown conversion can convert web page HTML into a concise Markdown format, retaining core content such as tables and images. The compression rate reaches 90 - 95%. It supports browser mode to process JavaScript websites and authenticated pages.
rating : 2 points
downloads : 4.7K
What is the HTML to Markdown MCP Server?
This is a tool server specifically designed for Claude AI. It can automatically convert the HTML content of any web page into a clean and readable Markdown format. It acts like an intelligent web content extractor, removing irrelevant elements such as advertisements, navigation bars, and scripts, and only retaining the core article content, tables, and images. This greatly reduces the file size and makes it easier for AI to understand and process web information.How to use the HTML to Markdown service?
It's very simple to use: directly enter the web page link in the Claude conversation or use a dedicated tool command. The server will automatically fetch the web page, clean up the content, convert it to Markdown, and then return the result to Claude. You can also choose whether to include images and tables, and even process dynamic web pages that require JavaScript to load.Applicable scenarios
It is very suitable for organizing and analyzing content such as research papers, technical documents, news articles, and product descriptions. When you need AI to read and understand web page content, this tool can provide the cleanest and most relevant text information, preventing AI from being disturbed by irrelevant web elements.Main Features
Intelligent Content Extraction
Automatically identify and retain the core content of the web page (article text, tables, images), and remove irrelevant elements such as advertisements, navigation bars, and footers.
Efficient Compression
Reduce the HTML file size by 90 - 95%, generate a concise Markdown format, and significantly save the token usage during AI processing.
Browser Mode
Support using real browsers (Chromium/Firefox/WebKit) to fetch web pages dynamically loaded by JavaScript, and perfectly handle modern websites such as React and Vue.
Authenticated Page Access
You can use your browser profile and cookies to access private pages that require login (such as corporate intranets and social media).
Flexible Configuration
You can customize whether to include images, tables, and links, set the timeout time and file size limit to meet different usage needs.
Cache Acceleration
Optionally enable the cache function. When accessing the same web page repeatedly, read directly from the cache to significantly improve the processing speed.
Advantages
Significantly reduce the token consumption when AI processes web page content (save 90 - 95%)
Provide clean and structured content to improve the accuracy of AI understanding
Support JavaScript dynamic web pages and pages that require authentication
Flexible configuration, and the output content can be adjusted according to needs
Fast processing speed, usually completing the conversion in 2 - 10 seconds
One - click Docker deployment, simple and convenient installation
Limitations
Some extremely complex web page layouts may not be perfectly extracted
The browser mode requires additional installation of Playwright and browsers
Processing very large web pages (over 50MB) may require configuration adjustment
A network connection is required to fetch web page content
Some websites with strict anti - crawling mechanisms may not be accessible
How to Use
Install the Server
Choose the installation method that suits you: use Docker (the simplest), the uv package manager, or pip for installation. If you want to use the browser mode, you also need to install Playwright.
Configure Claude Desktop
Edit the configuration file of Claude Desktop and add the MCP server configuration. Select the correct configuration file path according to your operating system.
Restart Claude Desktop
Completely close and restart the Claude Desktop application to make the configuration take effect.
Start Using
Directly enter the web page link in the Claude conversation or use the html_to_markdown tool command to convert the web page content.
Usage Examples
Research Paper Organization
Convert academic paper web pages into clean Markdown for AI to summarize core ideas and research methods.
Technical Document Learning
Convert complex technical document pages into a readable format to help AI understand API usage or technical concepts.
News Article Analysis
Remove the advertisements and sidebars of news websites and only retain the article text for content analysis.
Private Page Access
Access corporate intranets or social media pages that require login and extract private content.
Frequently Asked Questions
Why is the converted content sometimes incomplete?
How to access websites that require login?
What if the conversion process is too slow?
What is the supported size of web page files?
What if I encounter a Playwright error during installation?
Why can't I see this tool in Claude Desktop?
Related Resources
Official MCP Documentation
Official documentation and specifications of the Model Context Protocol
GitHub Repository
Project source code and latest updates
Trafilatura Library
Documentation of the used web content extraction library
Playwright Documentation
Documentation of the browser automation tool
Docker Installation Guide
Docker installation and basic usage tutorial

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
17.7K
4.5 points

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
29.9K
5 points

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
19.4K
4.3 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
59.2K
4.3 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
25.2K
5 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
54.8K
4.5 points

Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
18.6K
4.5 points

Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
38.8K
4.8 points
