Mariadb Cloud Hybrid Rag Search
This is a demonstration project of a hybrid retrieval augmented generation (RAG) search system based on MariaDB Cloud. It integrates MariaDB vector search and Brave Search web search enhancement implemented through the FastMCP server, uses the 20 Newsgroups dataset as an example, and provides a plug-and-play hybrid search architecture.
rating : 2.5 points
downloads : 7.4K
What is MariaDB Cloud Hybrid RAG-Ready Search?
This is a complete search solution that combines your own data (stored in MariaDB Cloud) with real-time web search results. It uses vector search technology to understand the semantic meaning of queries and obtains the latest web information through the Brave search API, providing comprehensive and accurate retrieval results for AI applications.How to use this search service?
You can start the search service through a simple Python script, input a query question, and the system will automatically retrieve relevant information from your database and the Internet and return the results in a structured format. The whole process does not require complex configuration and is ready to use out of the box.Use cases
Suitable for various applications that need to combine internal knowledge and external information, such as intelligent customer service, research assistants, product recommendation systems, and enterprise knowledge base enhancement. It is particularly suitable for RAG application scenarios that require real-time information.Main features
Hybrid search capability
Simultaneously retrieve structured data in the local database and real-time information on the Internet, providing comprehensive search results
Semantic understanding search
Use vector search technology to understand the semantic meaning of queries, rather than just keyword matching
Real-time web enhancement
Obtain the latest web information through the Brave search API to ensure the timeliness of the results
RAG-ready architecture
Designed specifically for retrieval augmented generation applications and can be directly integrated into AI dialogue systems
Easy to deploy
Implemented in pure Python, without complex dependencies such as Node.js, and can be quickly deployed to any environment
Scalable design
Modular architecture, supporting custom data sources and search strategies
Advantages
One-stop solution: Combines database management and web search functions
Real-time: Web search results are kept up-to-date
Accuracy: Semantic search provides more relevant internal results
Cost-effective: Uses the free or low-cost Brave search API
Easy to integrate: Provides clear APIs and example code
Scalability: Supports custom data sources and search logic
Limitations
Dependence on external APIs: Requires a Brave search API key
Requires a MariaDB Cloud account: Database service requires registration
Network latency: External search may increase response time
Data volume limitation: Free APIs may have call limitations
Configuration complexity: Requires correct setting of multiple configuration files
How to use
Environment preparation
Install Python 3.10+ and necessary dependency packages
Get API keys
Register a free trial account for MariaDB Cloud and a free key for the Brave search API
Configure connection information
Set database connection information and API keys in the configuration file
Initialize the database
Create the database table structure and vector index
Load example data
Import the 20 Newsgroups dataset and generate vector embeddings
Start the search service
Run the main program to start using the hybrid search function
Usage examples
Technical research assistant
Researchers need to understand the latest developments and historical background in a certain technical field
Product information query
Users want to understand the technical specifications and user reviews of a certain product
Enterprise knowledge base enhancement
Employees need to query company internal documents and relevant industry information
Frequently Asked Questions
Do I need to pay to use this solution?
How to replace the example data with my business data?
What is the approximate search response time?
Does it support Chinese search?
How to integrate the results into my AI application?
What should I do if I encounter a connection error?
Related resources
MariaDB Cloud Free Trial
Register a free trial account for MariaDB Cloud
Brave Search API
Get a free Brave search API key
GitHub Repository
Complete source code and documentation
MariaDB Vector Search Documentation
Detailed documentation on MariaDB vector search function
FastMCP Documentation
Documentation for the FastMCP server framework
RAG Application Guide
Best practices for building RAG applications with MariaDB

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.6K
4.3 points

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.5K
5 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.4K
4.3 points

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.5K
4.5 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
65.6K
4.5 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.3K
5 points

Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.1K
4.5 points

Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
49.1K
4.8 points



