MCP-RAG: Modular RAG Pipeline using MCP & GroundX

Created By
sujithadra year ago
MCP server Implantation for RAG (GroundX API)
Overview

what is MCP-RAG?

MCP-RAG is a modular, production-grade implementation of a Retrieval-Augmented Generation (RAG) system, designed to facilitate AI-driven applications with reusable components.

how to use MCP-RAG?

To use MCP-RAG, start the server and utilize the provided commands to ingest documents and perform searches. You can set up your environment by creating a .env file with your API keys and installing the necessary dependencies.

key features of MCP-RAG?

  • Modular Tool Design using MCP server interface
  • YAML-Based Prompt Templates with Jinja2 rendering
  • PDF File Ingestion into GroundX vector store
  • Real-Time Semantic Search via GroundX Search Tool
  • Plug-and-Play API Integration for new tools and services

use cases of MCP-RAG?

  1. Ingesting and processing PDF documents for information retrieval.
  2. Performing semantic searches to generate contextual responses.
  3. Integrating various tools and services for enhanced AI capabilities.

FAQ from MCP-RAG?

  • What is the purpose of MCP-RAG?

MCP-RAG is designed to provide a scalable and flexible framework for building AI-driven applications.

  • How do I set up the environment?

You need to create a .env file with your API keys and install the dependencies using the provided commands.

  • Can I integrate other tools with MCP-RAG?

Yes! MCP-RAG supports plug-and-play API integration for new tools and services.

Project Info
Created At
a year ago
Updated At
a year ago
Author Name
sujithadr
Star
0
Language
Python
License
MIT license

Recommend Servers

View All
Linkpulse

2 days ago
Tavily Mcp
@tavily-ai

JavaScript
a year ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

a day ago