Mariadb Cloud Hybrid Rag Search
M

Mariadb Cloud Hybrid Rag Search

This is a demonstration project of a hybrid retrieval augmented generation (RAG) search system based on MariaDB Cloud. It integrates MariaDB vector search and Brave Search web search enhancement implemented through the FastMCP server, uses the 20 Newsgroups dataset as an example, and provides a plug-and-play hybrid search architecture.
2.5 points
7.4K

What is MariaDB Cloud Hybrid RAG-Ready Search?

This is a complete search solution that combines your own data (stored in MariaDB Cloud) with real-time web search results. It uses vector search technology to understand the semantic meaning of queries and obtains the latest web information through the Brave search API, providing comprehensive and accurate retrieval results for AI applications.

How to use this search service?

You can start the search service through a simple Python script, input a query question, and the system will automatically retrieve relevant information from your database and the Internet and return the results in a structured format. The whole process does not require complex configuration and is ready to use out of the box.

Use cases

Suitable for various applications that need to combine internal knowledge and external information, such as intelligent customer service, research assistants, product recommendation systems, and enterprise knowledge base enhancement. It is particularly suitable for RAG application scenarios that require real-time information.

Main features

Hybrid search capability
Simultaneously retrieve structured data in the local database and real-time information on the Internet, providing comprehensive search results
Semantic understanding search
Use vector search technology to understand the semantic meaning of queries, rather than just keyword matching
Real-time web enhancement
Obtain the latest web information through the Brave search API to ensure the timeliness of the results
RAG-ready architecture
Designed specifically for retrieval augmented generation applications and can be directly integrated into AI dialogue systems
Easy to deploy
Implemented in pure Python, without complex dependencies such as Node.js, and can be quickly deployed to any environment
Scalable design
Modular architecture, supporting custom data sources and search strategies
Advantages
One-stop solution: Combines database management and web search functions
Real-time: Web search results are kept up-to-date
Accuracy: Semantic search provides more relevant internal results
Cost-effective: Uses the free or low-cost Brave search API
Easy to integrate: Provides clear APIs and example code
Scalability: Supports custom data sources and search logic
Limitations
Dependence on external APIs: Requires a Brave search API key
Requires a MariaDB Cloud account: Database service requires registration
Network latency: External search may increase response time
Data volume limitation: Free APIs may have call limitations
Configuration complexity: Requires correct setting of multiple configuration files

How to use

Environment preparation
Install Python 3.10+ and necessary dependency packages
Get API keys
Register a free trial account for MariaDB Cloud and a free key for the Brave search API
Configure connection information
Set database connection information and API keys in the configuration file
Initialize the database
Create the database table structure and vector index
Load example data
Import the 20 Newsgroups dataset and generate vector embeddings
Start the search service
Run the main program to start using the hybrid search function

Usage examples

Technical research assistant
Researchers need to understand the latest developments and historical background in a certain technical field
Product information query
Users want to understand the technical specifications and user reviews of a certain product
Enterprise knowledge base enhancement
Employees need to query company internal documents and relevant industry information

Frequently Asked Questions

Do I need to pay to use this solution?
How to replace the example data with my business data?
What is the approximate search response time?
Does it support Chinese search?
How to integrate the results into my AI application?
What should I do if I encounter a connection error?

Related resources

MariaDB Cloud Free Trial
Register a free trial account for MariaDB Cloud
Brave Search API
Get a free Brave search API key
GitHub Repository
Complete source code and documentation
MariaDB Vector Search Documentation
Detailed documentation on MariaDB vector search function
FastMCP Documentation
Documentation for the FastMCP server framework
RAG Application Guide
Best practices for building RAG applications with MariaDB

Installation

Copy the following command to your Client for configuration
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

A
Airweave
Airweave is an open - source context retrieval layer for AI agents and RAG systems. It connects and synchronizes data from various applications, tools, and databases, and provides relevant, real - time, multi - source contextual information to AI agents through a unified search interface.
Python
7.1K
5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
6.9K
4.5 points
H
Haiku.rag
Haiku RAG is an intelligent retrieval - augmented generation system built on LanceDB, Pydantic AI, and Docling. It supports hybrid search, re - ranking, Q&A agents, multi - agent research processes, and provides local - first document processing and MCP server integration.
Python
9.3K
5 points
C
Claude Context
Claude Context is an MCP plugin that provides in - depth context of the entire codebase for AI programming assistants through semantic code search. It supports multiple embedding models and vector databases to achieve efficient code retrieval.
TypeScript
17.9K
5 points
A
Acemcp
Acemcp is an MCP server for codebase indexing and semantic search, supporting automatic incremental indexing, multi-encoding file processing, .gitignore integration, and a Web management interface, helping developers quickly search for and understand code context.
Python
18.0K
5 points
M
MCP
The Microsoft official MCP server provides search and access functions for the latest Microsoft technical documentation for AI assistants
15.0K
5 points
A
Annas MCP
The MCP server and CLI tool of Anna's Archive are used to search for and download documents on the platform and support access through an API key.
Go
13.0K
4.5 points
S
Search1api
The Search1API MCP Server is a server based on the Model Context Protocol (MCP), providing search and crawling functions, and supporting multiple search services and tools.
TypeScript
16.9K
4 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.6K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.5K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.4K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.5K
4.5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
65.6K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.3K
5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.1K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
49.1K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase