Ai Research Agent MCP

An autonomous AI research agent based on the MCP protocol, capable of completing the full research process such as web search, knowledge base retrieval, code writing, chart generation, and report writing through a single prompt.

Research and data Knowledge management and memory #AI Research #Code Execution #Knowledge Retrieval #Report Generation .Python

rating : 2.5 points

downloads : 6.1K

update time : 2026-03-13

Open Site

What is an AI Research Engineer?

An AI Research Engineer is an intelligent research assistant that can understand your research needs and automatically execute multi-step research processes. All you need to do is provide a research topic or question, and it will: 1. Search the web for the latest information 2. Query your personal knowledge base 3. Write and run code for analysis 4. Generate visual charts 5. Create a complete research report 6. Self-assess the quality of the research results All operations are completed in a secure sandbox environment to ensure the safety of code execution.

How to use an AI Research Engineer?

It's very simple to use: 1. Install and configure in Claude Desktop or Cursor IDE 2. Enter your research requirements 3. Wait for the AI to automatically complete all the work 4. View the generated research reports and files For example, you can enter: 'Research the market trends of electric vehicles in 2026 and create a growth forecast chart', and the AI will automatically complete all research, analysis, and report generation work.

Applicable Scenarios

This tool is particularly suitable for the following scenarios: • AI engineers conducting technical research • Entrepreneurs analyzing market opportunities • Students completing research projects • Analysts generating data reports • Developers validating prototypes • Knowledge workers organizing information Whether it's technical research, market analysis, academic research, or data visualization, you can quickly obtain professional-level research results.

Main Features

Intelligent Web Research

Automatically search the web for the latest information, support DuckDuckGo and Brave Search, and extract the main content of web pages and organize it into structured data.

Personal Knowledge Base Retrieval

Use RAG technology to retrieve your personal notes, documents, and knowledge base, combine personal knowledge with web information, and provide personalized research results.

Secure Code Execution

Safely execute Python code in an isolated sandbox environment, support operations such as data analysis, visualization, and modeling, and automatically capture output and charts.

Automatic Report Generation

Automatically organize research results into a structured Markdown report, including data sources, analysis processes, code, and visual charts.

Self-Quality Assessment

Automatically assess the quality of research results, score from multiple dimensions such as clarity, data accuracy, and completeness, and provide improvement suggestions.

Structured File Management

Automatically organize research output files, store reports, code, charts, and data by date and task, making it easy for subsequent review and reuse.

Advantages

🚀 Complete complex research with one click: Automatically complete from search to report

🔒 Secure and reliable: Code is executed in an isolated sandbox to protect system security

📚 Personalized research: Provide customized results by combining personal knowledge base

💾 Local-first: Use local models by default to protect privacy and no API key is required

📊 Rich visualization: Automatically generate professional-level charts and visualizations

🔄 Reproducible research: Completely record the research process and the results can be reproduced

Limitations

⚠️ Dependent on network connection: Need the network for searching (local knowledge base can be configured)

⚠️ Code limitations: Only support Python and limited by the libraries allowed in the sandbox

⚠️ Need configuration: Installation and environment configuration are required for the first use

⚠️ Research depth: Manual verification may be required in complex professional fields

⚠️ File format: Mainly support text formats, and the processing of complex documents is limited

How to Use

Installation Preparation

Ensure that Python 3.10+、Claude Desktop or Cursor IDE, and Git are installed. It is recommended to use the uv tool to speed up the installation.

Clone and Install

Download the project code and install the dependency packages. Using uv can significantly speed up the installation.

Configure Claude Desktop

Edit the Claude Desktop configuration file and add the MCP server configuration. Note to use the absolute path.

Start Research

Restart Claude Desktop, enter your research requirements in the chat box, and the AI will automatically start working.

Usage Examples

Market Trend Analysis

Analyze the market development trends of a certain industry or technology, and generate data reports and prediction charts.

Technical Comparison Research

Compare the advantages and disadvantages of different technical solutions, and conduct quantitative analysis and visual display.

Financial Model Construction

Build a financial calculator or investment analysis model, and conduct data analysis and visualization.

Academic Literature Review

Collect and organize the research status of an academic field, and generate a literature review report.

Frequently Asked Questions

Do I need a paid API key?

Is code execution safe?

Which file formats are supported for the knowledge base?

Where are the research results saved?

How to add a personal knowledge base?

Does it support Chinese search and research?

Can the research process be reproduced?

What should I do if I encounter installation problems?

Related Resources

GitHub Repository

Project source code, latest version, and issue tracking

Video Demonstration

Complete function demonstration video, showing the whole process from installation to use

MCP Official Documentation

Official documentation and specifications of the Model Context Protocol

Claude Desktop

Download the Claude Desktop client

uv Installation Guide

Installation and use of the fast Python package management tool uv

Installation

Copy the following command to your Client for configuration

{
  "mcpServers": {
    "research-engineer": {
      "command": "/absolute/path/to/python",
      "args": [
        "/absolute/path/to/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "BRAVE_API_KEY": "your_brave_api_key_here_or_remove_this_line",
        "ANTHROPIC_API_KEY": "your_anthropic_api_key_here_or_remove_this_line",
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "USE_LOCAL_EMBEDDINGS": "true",
        "VECTOR_DB_PATH": "/absolute/path/to/ai-research-agent-mcp/data/vector_db",
        "CHUNK_SIZE": "1000",
        "CHUNK_OVERLAP": "200",
        "SANDBOX_TIMEOUT": "30",
        "SANDBOX_MAX_MEMORY_MB": "512",
        "ALLOWED_PACKAGES": "numpy,pandas,matplotlib,seaborn,scipy,scikit-learn",
        "RESEARCH_RUNS_DIR": "/absolute/path/to/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/absolute/path/to/ai-research-agent-mcp/knowledge_base",
        "LOG_LEVEL": "INFO",
        "LOG_FILE": "/absolute/path/to/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

{
  "mcpServers": {
    "research-engineer": {
      "command": "/Users/yourname/Projects/ai-research-agent-mcp/server/venv/bin/python3.11",
      "args": [
        "/Users/yourname/Projects/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

{
  "mcpServers": {
    "research-engineer": {
      "command": "uv",
      "args": [
        "run",
        "--directory",
        "/Users/yourname/Projects/ai-research-agent-mcp/server",
        "python",
        "src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

{
  "mcpServers": {
    "research-engineer": {
      "command": "C:/Users/yourname/Projects/ai-research-agent-mcp/server/venv/Scripts/python.exe",
      "args": [
        "C:/Users/yourname/Projects/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "C:/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "C:/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "C:/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "C:/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

{
  "mcpServers": {
    // Remove this entire block:
    // "research-engineer": { ... }
  }
}

Note: Your key is sensitive information, do not share it with anyone.

🚀 MCP-Powered AI Research Engineer

Transform a single prompt into a comprehensive research report, integrating web data, personal notes, code, and charts.

An autonomous AI agent that conducts research, writes code, and generates comprehensive reports using the Model Context Protocol (MCP).

🚀 Quick Start

Get up and running in 5 minutes.

Prerequisites

Python 3.10 or higher (Python 3.11+ recommended)
Claude Desktop or Cursor IDE (MCP-compatible client)
Git (for cloning the repository)
uv (optional but recommended - install here) or pip
(Optional) API keys for enhanced features

Installation Steps

1. Clone the Repository

git clone https://github.com/prabureddy/ai-research-agent-mcp.git
cd ai-research-agent-mcp

2. Install Dependencies

Option A: Using uv (Recommended - 10-100x faster!)

# Navigate to server directory
cd server

# Install uv if you haven't already
# macOS/Linux:
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows:
# powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

# Create and activate virtual environment
uv venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate

# Install required packages (much faster than pip!)
uv pip install -r requirements.txt

Option B: Using pip (Traditional)

# Navigate to server directory
cd server

# Create and activate virtual environment
python3.11 -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install required packages
pip install -r requirements.txt

3. Configure Environment

# Return to project root
cd ..

# Copy example environment file
cp .env.example .env

# Edit .env with your preferred text editor (optional)
nano .env  # or vim, code, etc.

Environment Configuration (Optional):

# Optional: For Brave Search (better than DuckDuckGo)
BRAVE_API_KEY=your_brave_api_key_here

# Optional: For future Anthropic integrations
ANTHROPIC_API_KEY=your_anthropic_api_key_here

# Optional: Customize paths
RESEARCH_RUNS_DIR=./research_runs
KNOWLEDGE_BASE_DIR=./knowledge_base

# RAG uses local embeddings by default (no API key needed!)
USE_LOCAL_EMBEDDINGS=true
EMBEDDING_MODEL=all-MiniLM-L6-v2

⚠️ Important Note: Never commit your .env file to version control. It's already included in .gitignore.

Note: The system uses local sentence-transformers embeddings by default, so no API keys are required for RAG features!

4. Create Required Directories

# Create directories for data storage
mkdir -p research_runs knowledge_base data/vector_db logs

5. Configure Claude Desktop

macOS: Edit ~/Library/Application Support/Claude/claude_desktop_config.json Windows: Edit %APPDATA%\Claude\claude_desktop_config.json Linux: Edit ~/.config/Claude/claude_desktop_config.json

Find your absolute paths first:

# In your project directory, run:
pwd
# Example output: /Users/yourname/Projects/ai-research-agent-mcp

# Find your Python path (if using venv):
which python  # or: which python3.11
# Example output: /Users/yourname/Projects/ai-research-agent-mcp/server/venv/bin/python3.11

Configuration Template: Add the following configuration, replacing the paths and environment variables with your actual values:

{
  "mcpServers": {
    "research-engineer": {
      "command": "/absolute/path/to/python",
      "args": [
        "/absolute/path/to/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "BRAVE_API_KEY": "your_brave_api_key_here_or_remove_this_line",
        "ANTHROPIC_API_KEY": "your_anthropic_api_key_here_or_remove_this_line",
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "USE_LOCAL_EMBEDDINGS": "true",
        "VECTOR_DB_PATH": "/absolute/path/to/ai-research-agent-mcp/data/vector_db",
        "CHUNK_SIZE": "1000",
        "CHUNK_OVERLAP": "200",
        "SANDBOX_TIMEOUT": "30",
        "SANDBOX_MAX_MEMORY_MB": "512",
        "ALLOWED_PACKAGES": "numpy,pandas,matplotlib,seaborn,scipy,scikit-learn",
        "RESEARCH_RUNS_DIR": "/absolute/path/to/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/absolute/path/to/ai-research-agent-mcp/knowledge_base",
        "LOG_LEVEL": "INFO",
        "LOG_FILE": "/absolute/path/to/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

⚠️ Important Notes:

Use absolute paths for all file paths (no ~ or relative paths)
If using a virtual environment, use the Python path from inside the venv
Remove or leave empty any API keys you don't have (DuckDuckGo works without keys)
All paths in env must be absolute paths

Example for macOS/Linux (with venv):

{
  "mcpServers": {
    "research-engineer": {
      "command": "/Users/yourname/Projects/ai-research-agent-mcp/server/venv/bin/python3.11",
      "args": [
        "/Users/yourname/Projects/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

Example for macOS/Linux (with uv):

{
  "mcpServers": {
    "research-engineer": {
      "command": "uv",
      "args": [
        "run",
        "--directory",
        "/Users/yourname/Projects/ai-research-agent-mcp/server",
        "python",
        "src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

Example for Windows:

{
  "mcpServers": {
    "research-engineer": {
      "command": "C:/Users/yourname/Projects/ai-research-agent-mcp/server/venv/Scripts/python.exe",
      "args": [
        "C:/Users/yourname/Projects/ai-research-agent-mcp/server/src/server.py"
      ],
      "env": {
        "SEARCH_PROVIDER": "duckduckgo",
        "MAX_SEARCH_RESULTS": "10",
        "USE_LOCAL_EMBEDDINGS": "true",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "VECTOR_DB_PATH": "C:/Users/yourname/Projects/ai-research-agent-mcp/data/vector_db",
        "RESEARCH_RUNS_DIR": "C:/Users/yourname/Projects/ai-research-agent-mcp/research_runs",
        "KNOWLEDGE_BASE_DIR": "C:/Users/yourname/Projects/ai-research-agent-mcp/knowledge_base",
        "LOG_FILE": "C:/Users/yourname/Projects/ai-research-agent-mcp/logs/research_engineer.log"
      }
    }
  }
}

6. Restart Claude Desktop

Completely quit and restart Claude Desktop for changes to take effect.

Verify Installation

In Claude Desktop, type:

List available tools

You should see: web_search, web_research, execute_code, create_research_run, etc.

Your First Research Task

Try this simple task:

Research the current state of electric vehicles in 2026. 
Include market size, major players, and growth trends. 
Create a simple visualization showing EV adoption over time.

The agent will:

Search the web for EV data
Write Python code to create a chart
Present findings with sources

✨ Features

✅ Autonomous multi-step research
✅ Web search and content extraction
✅ RAG over personal knowledge base
✅ Safe code execution with output capture
✅ Structured report generation
✅ Self-evaluation and quality metrics
✅ Comprehensive logging and tracing
✅ Reproducible research runs

📦 Installation

⚡ One‑liner install

Option 1: Using uv (Recommended - Fast!)

git clone https://github.com/prabureddy/ai-research-agent-mcp.git \
  && cd ai-research-agent-mcp/server \
  && uv venv \
  && source .venv/bin/activate \
  && uv pip install -r requirements.txt

Option 2: Using pip (Traditional)

git clone https://github.com/prabureddy/ai-research-agent-mcp.git \
  && cd ai-research-agent-mcp/server \
  && python3 -m venv venv \
  && source venv/bin/activate \
  && pip3 install -r requirements.txt

💻 Usage Examples

Basic Research Task

Simple Query

Research the pros and cons of electric scooters vs bikes for urban commuting.

The agent will:

Search the web for relevant information
Organize findings
Present a summary

Comprehensive Research with Code

Deep dive: Compare electric scooters vs bikes for my 5-mile daily commute. 
Build a cost calculator in Python that shows total cost of ownership over 3 years.
Include purchase price, maintenance, electricity/none, and create visualizations.

The agent will:

Research costs, maintenance, and usage data
Build a Python cost calculator
Create comparison charts
Write a comprehensive report
Save everything to a research run directory
Self-evaluate the work

Tool Usage Examples

Web Research

Search only:

Use web_search to find the latest news about AI regulation in 2026

Comprehensive research with scraping:

Use web_research to gather detailed information about multifamily real estate cap rates, 
and scrape the top 5 results for full content

Scrape specific URL:

Scrape this article and summarize the key points: https://example.com/article

Knowledge Base (RAG)

Index your notes:

Index all files in my knowledge_base directory so I can query them later

Query knowledge base:

Query my knowledge base for information about real estate investment strategies

Combine web + knowledge base:

Research current EV market trends using both web search and my personal notes 
in the knowledge base

Code Execution

Simple calculation:

Write Python code to calculate the compound annual growth rate (CAGR) 
for an investment that grew from $10,000 to $25,000 over 5 years

Data analysis:

Create a Python script that:
1. Generates sample sales data for 12 months
2. Calculates moving averages
3. Creates a line chart with trend line
4. Prints summary statistics

Financial modeling:

Build a mortgage calculator in Python that:
- Takes loan amount, interest rate, and term
- Calculates monthly payment
- Shows amortization schedule
- Creates a chart showing principal vs interest over time

Best Practices

1. Be Specific

❌ Vague:

Research AI

✅ Specific:

Research the current state of large language models in 2026, focusing on:
- Model sizes and capabilities
- Training costs
- Commercial applications
- Regulatory challenges

2. Request Structure

❌ Unstructured:

Tell me about real estate

✅ Structured:

Research multifamily real estate investment in 2026:
1. Current market conditions
2. Financial modeling
3. Risk analysis
4. Recommendations

3. Combine Tools

✅ Effective:

Research electric vehicle adoption rates using:
1. Web search for latest statistics
2. My knowledge base for past analysis
3. Python code to project future adoption
4. Visualizations of trends

4. Request Evaluation

✅ Quality-focused:

After completing the analysis, evaluate your work and tell me:
- What data sources were most valuable?
- What are the limitations of this analysis?
- What would make this analysis more robust?

📚 Documentation

Architecture

System Overview

┌─────────────────────────────────────────────────────────────┐
│                    Claude Desktop / Cursor                   │
│                     (MCP Client/Host)                        │
└────────────────────────┬────────────────────────────────────┘
                         │ MCP Protocol (stdio)
                         │
┌────────────────────────▼────────────────────────────────────┐
│                    MCP Server (Python)                       │
│  ┌──────────────────────────────────────────────────────┐  │
│  │              Tool Registry & Router                   │  │
│  └──────────────────────────────────────────────────────┘  │
│                                                              │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐     │
│  │ Web Research │  │   RAG Tool   │  │Code Sandbox  │     │
│  │              │  │              │  │              │     │
│  │ • Search     │  │ • Embeddings │  │ • Restricted │     │
│  │ • Scrape     │  │ • ChromaDB   │  │   Python     │     │
│  │ • Extract    │  │ • Query      │  │ • Safe Exec  │     │
│  └──────────────┘  └──────────────┘  └──────────────┘     │
│                                                              │
│  ┌──────────────┐  ┌──────────────┐                        │
│  │  Workspace   │  │  Evaluator   │                        │
│  │              │  │              │                        │
│  │ • File I/O   │  │ • Metrics    │                        │
│  │ • Organize   │  │ • Critique   │                        │
│  │ • Manage     │  │ • Quality    │                        │
│  └──────────────┘  └──────────────┘                        │
└─────────────────────────────────────────────────────────────┘
                         │
                         ▼
        ┌────────────────────────────────────┐
        │      External Services              │
        │                                     │
        │  • DuckDuckGo / Brave Search       │
        │  • OpenAI Embeddings API           │
        │  • Web Scraping (HTTP)             │
        └────────────────────────────────────┘
                         │
                         ▼
        ┌────────────────────────────────────┐
        │      Local Storage                  │
        │                                     │
        │  • research_runs/                  │
        │  • knowledge_base/                 │
        │  • data/vector_db/                 │
        │  • logs/                           │
        └────────────────────────────────────┘

Core Components

1. MCP Server ()

Responsibilities:

Expose tools via MCP protocol
Route tool calls to appropriate handlers
Handle errors and logging
Manage server lifecycle

Technology:

Python 3.10+
MCP SDK (mcp package)
Async/await for I/O operations

2. Web Research Tool ()

Responsibilities:

Search the web for information
Scrape and extract clean content
Handle rate limiting and retries

Components:

Search Providers: DuckDuckGo (default, no API key), Brave Search (optional)
Content Extraction: Trafilatura for main content, BeautifulSoup for metadata

3. RAG Tool ()

Responsibilities:

Index documents into vector database
Semantic search over knowledge base
Support multiple file formats (Markdown, PDF, DOCX)

Components:

Vector Database: ChromaDB (persistent, local)
Embeddings: OpenAI text-embedding-3-small
Chunking strategy: 1000 chars with 200 char overlap

4. Code Sandbox ()

Responsibilities:

Execute Python code safely
Capture output and plots
Enforce resource limits

Security Layers:

RestrictedPython: AST-level code restrictions
Resource Limits: Memory and CPU constraints
Timeout: Execution time limits
Allowed Packages: Whitelist of safe libraries (numpy, pandas, matplotlib, etc.)

5. Workspace Tool ()

Responsibilities:

Organize research outputs
Manage file I/O
Track research runs

Directory Structure:

research_runs/
└── YYYY-MM-DD_HHMMSS_task-name/
    ├── metadata.json
    ├── report.md
    ├── evaluation.json
    ├── sources.json
    ├── code/
    │   └── *.py
    ├── charts/
    │   └── *.png
    └── data/
        └── *.json

6. Evaluator Tool ()

Responsibilities:

Quality assessment
Self-critique generation
Metrics tracking

Quality Metrics (0-10 scale):

Clarity, Data Grounding, Completeness, Code Quality, Actionability, Confidence

Configuration

Environment Variables

See .env.example for all available configuration options:

# Search Configuration
BRAVE_API_KEY=...           # Optional: Better search than DuckDuckGo
SEARCH_PROVIDER=duckduckgo  # duckduckgo or brave
MAX_SEARCH_RESULTS=10

# RAG Configuration (uses local embeddings by default)
USE_LOCAL_EMBEDDINGS=true
EMBEDDING_MODEL=all-MiniLM-L6-v2
VECTOR_DB_PATH=./data/vector_db
CHUNK_SIZE=1000
CHUNK_OVERLAP=200

# Code Sandbox Configuration
SANDBOX_TIMEOUT=30
SANDBOX_MAX_MEMORY_MB=512

# Directory Configuration
RESEARCH_RUNS_DIR=./research_runs
KNOWLEDGE_BASE_DIR=./knowledge_base

# Logging
LOG_LEVEL=INFO

Cursor IDE Configuration (Alternative)

1. Open Cursor Settings

Press Cmd+, (Mac) or Ctrl+, (Windows/Linux)

2. Search for "MCP"

Find the MCP Servers configuration section.

3. Add Server Configuration

Add the same configuration as Claude Desktop (see section 5 above for detailed examples).

Basic Example:

{
  "research-engineer": {
    "command": "/absolute/path/to/python",
    "args": [
      "/absolute/path/to/ai-research-agent-mcp/server/src/server.py"
    ],
    "env": {
      "SEARCH_PROVIDER": "duckduckgo",
      "USE_LOCAL_EMBEDDINGS": "true",
      "VECTOR_DB_PATH": "/absolute/path/to/data/vector_db",
      "RESEARCH_RUNS_DIR": "/absolute/path/to/research_runs",
      "KNOWLEDGE_BASE_DIR": "/absolute/path/to/knowledge_base"
    }
  }
}

Using uv:

{
  "research-engineer": {
    "command": "uv",
    "args": [
      "run",
      "--directory",
      "/absolute/path/to/ai-research-agent-mcp/server",
      "python",
      "src/server.py"
    ],
    "env": {
      "SEARCH_PROVIDER": "duckduckgo",
      "USE_LOCAL_EMBEDDINGS": "true"
    }
  }
}

🔧 Technical Details

Project Structure

Complete file and directory structure:

ai-research-agent-mcp/
│
├── README.md                          # This file - complete documentation
├── LICENSE                            # MIT License
├── .gitignore                         # Git ignore rules
├── .env.example                       # Example environment variables
│
├── server/                            # MCP Server implementation
│   ├── requirements.txt               # Python dependencies
│   ├── pyproject.toml                 # Project metadata and build config
│   │
│   └── src/                           # Source code
│       ├── __init__.py                # Package initialization
│       ├── server.py                  # Main MCP server entry point
│       ├── config.py                  # Configuration management
│       │
│       └── tools/                     # Tool implementations
│           ├── __init__.py            # Tools package initialization
│           ├── web_research.py        # Web search and scraping
│           ├── rag_tool.py            # Vector RAG for knowledge base
│           ├── code_sandbox.py        # Safe Python code execution
│           ├── workspace.py           # File and workspace management
│           └── evaluator.py           # Quality evaluation and critique
│
├── agent/                             # Agent orchestration
│   └── prompts/                       # System prompts and templates
│       └── research_agent.md          # Main research agent prompt
│
├── config/                            # Configuration files
│   └── claude_desktop_config.json     # Example Claude Desktop config
│
├── examples/                          # Example tasks and outputs
│   └── example_research_task.md       # Detailed example with expected output
│
├── knowledge_base/                    # Personal knowledge base (user content)
│   └── example_notes.md               # Example notes for RAG
│
├── research_runs/                     # Research output directory (created at runtime)
│   └── YYYY-MM-DD_HHMMSS_task-name/   # Individual research run
│       ├── metadata.json              # Run metadata
│       ├── report.md                  # Final report
│       ├── evaluation.json            # Self-evaluation
│       ├── sources.json               # Data sources
│       ├── code/                      # Generated code
│       │   └── *.py
│       ├── charts/                    # Visualizations
│       │   └── *.png
│       └── data/                      # Data files
│           └── *.json
│
├── data/                              # Data storage (created at runtime)
│   └── vector_db/                     # ChromaDB vector database
│
└── logs/                              # Log files (created at runtime)
    └── research_engineer.log          # Application logs

Key Files Explained

File	Purpose
	Main MCP server with tool registry
	Configuration loading and validation
	Web search and scraping
	Vector database and semantic search
	Safe Python code execution
	File I/O and research run management
	Quality metrics and self-critique

📸 See It In Action

📺 Watch the full video demo on YouTube

Example Task → Output:

"Research top 3 programming languages in 2026 and create a comparison chart"

→ research_runs/2026-02-07_143022_programming-languages/
  ├── report.md              # Full analysis with sources
  ├── comparison_chart.png   # Visual comparison
  ├── data.json             # Raw statistics
  └── code/analysis.py      # Generated code

What the agent does: Searches web → Writes code → Creates charts → Generates report → Self-evaluates (all in ~60 seconds)

Troubleshooting

Common Issues and Solutions

ImportError: attempted relative import with no known parent package

Problem:

ImportError: attempted relative import with no known parent package

Solution: The server has been updated to handle both direct execution and module execution. Run:

cd server
python3.11 src/server.py

Filelock Version Incompatibility

Problem:

TypeError: BaseFileLock.__init__() got an unexpected keyword argument 'mode'

Solution:

pip3 install --upgrade filelock

Server Not Starting in Claude Desktop

Problem: Claude Desktop shows "Server not found" or the server doesn't appear in the tools list.

Checklist:

✅ Verify the path in claude_desktop_config.json is absolute
✅ Check that Python 3.11 is installed: which python3.11
✅ Ensure all dependencies are installed: pip3 install -r requirements.txt
✅ Restart Claude Desktop completely (quit and reopen)
✅ Check logs for errors

Sentence Transformers Model Download

Problem: First run takes a long time or shows download progress.

Solution: This is normal behavior. The sentence-transformers model (~90MB) is being downloaded on first use. The model is cached locally and subsequent runs will be much faster.

Module Not Found Errors

Problem:

ModuleNotFoundError: No module named 'mcp'

Solution:

cd server
pip3 install -r requirements.txt

# Or if using a virtual environment:
python3.11 -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt

Debugging Tips

Check Server Logs

tail -f logs/research_engineer.log

Test Server Import

cd server
python3.11 -c "from src.server import app; print('✓ Server imports successfully')"

Verify Python Version

python3.11 --version
# Should be 3.11 or higher

Check Environment Variables

cat .env

Example Output

Each research task creates a structured output:

research_runs/
└── 2026-02-06_multifamily-real-estate/
    ├── report.md              # Final comprehensive report
    ├── model.py               # Cash-flow model code
    ├── analysis.ipynb         # Jupyter notebook
    ├── charts/                # Generated visualizations
    │   ├── sensitivity.png
    │   └── cashflow.png
    ├── sources.json           # Data sources and citations
    └── evaluation.json        # Quality metrics and self-critique

Development

# Run tests
pytest tests/

# Format code
black server/ agent/

# Type checking
mypy server/ agent/

Uninstallation

To completely remove the MCP Research Engineer from your system:

1. Remove from Claude Desktop

Edit your Claude Desktop configuration file: macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json Linux: ~/.config/Claude/claude_desktop_config.json

Remove the research-engineer entry from mcpServers:

{
  "mcpServers": {
    // Remove this entire block:
    // "research-engineer": { ... }
  }
}

Restart Claude Desktop.

2. Deactivate Virtual Environment

If you have an active virtual environment:

deactivate

3. Remove Project Directory

# Navigate to parent directory
cd ..

# Remove the entire project
rm -rf ai-research-agent-mcp

⚠️ Warning: This will permanently delete all your research runs, knowledge base, and configuration. Make sure to backup any important data first!

4. Optional: Backup Important Data

Before uninstalling, you may want to backup:

# Backup your research outputs
cp -r ai-research-agent-mcp/research_runs ~/backup/research_runs

# Backup your knowledge base
cp -r ai-research-agent-mcp/knowledge_base ~/backup/knowledge_base

# Backup your configuration
cp ai-research-agent-mcp/.env ~/backup/.env

5. Clean Up Python Packages (Optional)

If you want to remove the Python packages that were installed:

# If you used a virtual environment, just delete it
rm -rf ai-research-agent-mcp/server/venv

# If you installed globally (not recommended), uninstall packages:
pip uninstall -y mcp chromadb sentence-transformers duckduckgo-search trafilatura httpx beautifulsoup4 lxml pypdf python-docx RestrictedPython

Contributing

Contributions are welcome! Here's how you can help:

Reporting Issues

If you find a bug or have a feature request:

Check if the issue already exists in GitHub Issues
If not, create a new issue with:
- Clear description of the problem or feature
- Steps to reproduce (for bugs)
- Expected vs actual behavior
- Your environment (OS, Python version, etc.)

Pull Requests

Fork the repository
Create a feature branch: git checkout -b feature/your-feature-name
Make your changes
Test thoroughly
Commit with clear messages: git commit -m "Add feature: description"
Push to your fork: git push origin feature/your-feature-name
Open a Pull Request with a clear description

Development Setup

# Clone your fork
git clone https://github.com/prabureddy/ai-research-agent-mcp.git
cd ai-research-agent-mcp

# Create virtual environment
cd server
python3.11 -m venv venv
source venv/bin/activate

# Install dependencies including dev tools
pip install -r requirements.txt
pip install pytest black mypy

# Run tests
pytest tests/

# Format code
black server/ agent/

# Type checking
mypy server/ agent/

Code Style

Follow PEP 8 guidelines
Use type hints where appropriate
Add docstrings to functions and classes
Write tests for new features
Keep commits atomic and well-described

Areas for Contribution

🐛 Bug fixes
✨ New tool implementations
📚 Documentation improvements
🧪 Test coverage
🎨 UI/UX improvements
🌐 Additional search providers
📊 New visualization types
🔒 Security enhancements

What's Next?

Try progressively more complex tasks: Level 1: Simple Research

Research the benefits of meditation

Level 2: Research + Code

Research average home prices in major US cities and create a bar chart

Level 3: Comprehensive Analysis

Analyze whether solar panels are worth it for a home in California.
Include cost analysis, payback period calculation, and recommendations.

Level 4: Full Research Project

Deep dive: Should I invest in multifamily real estate in 2026?
- Research market conditions
- Build cash-flow model
- Run sensitivity analysis
- Create visualizations
- Write comprehensive report
- Self-evaluate the analysis

Getting Help

If you encounter issues:

Check the logs: tail -f logs/research_engineer.log
Verify environment variables: cat .env

Test Python imports:

python -c "import mcp; print('MCP OK')"
python -c "import chromadb; print('ChromaDB OK')"

Check Claude Desktop logs (Help → View Logs)
Review the Troubleshooting section above
Open an issue on GitHub if you need further assistance

Support

Acknowledgments

Built with Model Context Protocol (MCP)
Powered by Claude by Anthropic
Uses ChromaDB for vector storage
Web scraping with Trafilatura

📄 License

MIT

Enjoy your AI Research Engineer! 🚀

Markdownify MCP

Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.

A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.

Python

24.8K

4.5 points

Duckduckgo MCP Server

Certified

The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.

The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.

UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.

38.4K

5 points

Figma Context MCP

Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.

A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.

The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.

Python

55.3K

4.8 points

Zhiqi Future, Your AI Solution Think Tank

English 简体中文繁體中文にほんご

Ai Research Agent MCP

Overview

Installation

Content Details

Alternatives

What is an AI Research Engineer?

How to use an AI Research Engineer?

Applicable Scenarios

Main Features

How to Use

Usage Examples

Frequently Asked Questions

Related Resources

Installation

🚀 MCP-Powered AI Research Engineer

🚀 Quick Start

Prerequisites

Installation Steps

1. Clone the Repository

2. Install Dependencies

3. Configure Environment

4. Create Required Directories

5. Configure Claude Desktop

6. Restart Claude Desktop

Verify Installation

Your First Research Task

✨ Features

📦 Installation

⚡ One‑liner install

Option 1: Using uv (Recommended - Fast!)

Option 2: Using pip (Traditional)

💻 Usage Examples

Basic Research Task

Simple Query

Comprehensive Research with Code

Tool Usage Examples

Web Research

Knowledge Base (RAG)

Code Execution

Best Practices

1. Be Specific

2. Request Structure

3. Combine Tools

4. Request Evaluation

📚 Documentation

Architecture

System Overview

Core Components

1. MCP Server ()

2. Web Research Tool ()

3. RAG Tool ()

4. Code Sandbox ()

5. Workspace Tool ()

6. Evaluator Tool ()

Configuration

Environment Variables

Cursor IDE Configuration (Alternative)

1. Open Cursor Settings

2. Search for "MCP"

3. Add Server Configuration

🔧 Technical Details

Project Structure

Key Files Explained

📸 See It In Action

Troubleshooting

Common Issues and Solutions

ImportError: attempted relative import with no known parent package

Filelock Version Incompatibility

Server Not Starting in Claude Desktop

Sentence Transformers Model Download

Module Not Found Errors

Debugging Tips

Check Server Logs

Test Server Import

Verify Python Version

Check Environment Variables

Example Output

Development

Uninstallation

1. Remove from Claude Desktop