Aistudio MCP Server
A

Aistudio MCP Server

The AI Studio MCP Server is a model context protocol server integrated with Google AI Studio/Gemini API, providing content generation functions that support files, conversation history, and system prompts.
2.5 points
6.1K

What is the AI Studio MCP Server?

The AI Studio MCP server is a Model Context Protocol (MCP) service that integrates Google AI Studio / Gemini API and provides content generation capabilities. It supports file uploads, conversation history records, and system prompts.

How to use the AI Studio MCP Server?

You can start the server through simple command - line operations. After configuring the API key, you can start using it. It supports multiple file types, such as images, PDFs, audios, etc., and can perform more accurate content generation in combination with system prompts.

Applicable Scenarios

It is suitable for various application scenarios such as content generation, document analysis, image recognition, and speech transcription. For example, converting PDFs to Markdown format, analyzing image content, transcribing audio files, etc.

Main Features

Multi - file SupportSupports multiple file types (images, PDFs, audios, texts, etc.) and allows uploading multiple files simultaneously for content generation.
System Prompt SupportAllows setting system prompts to guide the behavior and output style of the AI, improving the accuracy of content generation.
Conversation History RecordSupports saving and recalling conversation history to achieve a continuous interaction experience.
Flexible ConfigurationYou can customize parameters such as the model, timeout period, and maximum output tokens through environment variables.

Advantages and Limitations

Advantages
Supports multiple file types, including images, PDFs, audios, etc.
Provides a system prompt function to enhance the controllability of content generation.
Supports conversation history records to achieve continuous interaction.
Easy to configure and use, suitable for developers to quickly integrate.
Limitations
Depends on the Google AI Studio API and requires payment of relevant fees.
There are limitations on file size and quantity, which may affect complex tasks.
Does not support Chinese input and requires additional language conversion processing.

How to Use

Install the Server
Install the AI Studio MCP server using npm or run it directly via npx.
Configure the API Key
Set the Google AI Studio API key to ensure that the server can access the Gemini API normally.
Start the Server
After running the server, you can connect and send requests through the MCP client.
Send a Request
Use the MCP client to send requests to the server, including user prompts, system prompts, and file information.

Usage Examples

PDF to Markdown ConversionConvert a PDF file to a structured Markdown format for subsequent editing and processing.
Image AnalysisProvide a detailed description of the uploaded image, including objects, colors, text, etc.
Audio TranscriptionConvert an audio file to text, suitable for scenarios such as meeting records and interview transcriptions.

Frequently Asked Questions

What dependencies does the AI Studio MCP server require?
What file types are supported?
How to set system prompts?
What is the maximum file limit of the server?

Related Resources

Official Documentation
The official documentation of Google AI Studio to understand API details and usage methods.
GitHub Repository
The source code and development documentation of the AI Studio MCP server.
Tutorial Video
A video tutorial on how to use the AI Studio MCP server.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "aistudio": {
      "command": "npx",
      "args": ["-y", "aistudio-mcp-server"],
      "env": {
        "GEMINI_API_KEY": "your_api_key_here",
        "GEMINI_MODEL": "gemini-2.5-flash",
        "GEMINI_TIMEOUT": "600000",
        "GEMINI_MAX_OUTPUT_TOKENS": "16384",
        "GEMINI_MAX_FILES": "10",
        "GEMINI_MAX_TOTAL_FILE_SIZE": "50",
        "GEMINI_TEMPERATURE": "0.2"
      }
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.
Z
Zen MCP Server
Zen MCP is a multi-model AI collaborative development server that provides enhanced workflow tools and cross-model context management for AI coding assistants such as Claude and Gemini CLI. It supports seamless collaboration of multiple AI models to complete development tasks such as code review, debugging, and refactoring, and can maintain the continuation of conversation context between different workflows.
Python
8.7K
5 points
O
Opendia
OpenDia is an open - source browser extension tool that allows AI models to directly control the user's browser, perform automated operations using existing login status, bookmarks and other data, support multiple browsers and AI models, and focus on privacy protection.
JavaScript
8.9K
5 points
C
Container Use
Container Use is an open-source tool that provides a containerized isolated environment for coding agents, supporting parallel development of multiple agents without interference.
Go
8.1K
5 points
N
Notte Browser
Certified
Notte is an open-source full-stack network AI agent framework that provides browser sessions, automated LLM-driven agents, web page observation and operation, credential management, etc. It aims to transform the Internet into an agent-friendly environment and reduce the cognitive burden of LLMs by describing website structures in natural language.
15.3K
4.5 points
S
Search1api
The Search1API MCP Server is a server based on the Model Context Protocol (MCP), providing search and crawling functions, and supporting multiple search services and tools.
TypeScript
12.3K
4 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
24.9K
4.3 points
B
Bing Search MCP
An MCP server for integrating Microsoft Bing Search API, supporting web page, news, and image search functions, providing network search capabilities for AI assistants.
Python
12.6K
4 points
M
MCP Alchemy
Certified
MCP Alchemy is a tool that connects Claude Desktop to multiple databases, supporting SQL queries, database structure analysis, and data report generation.
Python
12.1K
4.2 points

Featured MCP Services

D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
24.9K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
16.5K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
10.9K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
11.8K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
12.7K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
30.9K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
19.2K
4.8 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
10.8K
4.5 points
AIbase
Zhiqi Future, Your AI Solution Think Tank
© 2025AIbase