Mcp Local Rag

Created By
imagesa year ago
Overview

what is Mcp Local Rag?

Mcp Local Rag is a primitive RAG-like web search model context protocol (MCP) server that runs locally without the need for APIs. It allows users to submit queries to a language model and fetch real-time web information to enhance the model's responses.

how to use Mcp Local Rag?

To use Mcp Local Rag, you can either run it using Docker or Python with the uv framework. After setting it up, you can submit queries through a language model that triggers the MCP server to fetch relevant web data.

key features of Mcp Local Rag?

  • Local execution without API dependencies
  • Real-time web search capabilities
  • Context extraction from web results to enhance language model outputs

use cases of Mcp Local Rag?

  1. Enhancing language models with up-to-date web information.
  2. Performing live searches for recent events or data.
  3. Integrating with AI chatbots to provide accurate and current responses.

FAQ from Mcp Local Rag?

  • Can Mcp Local Rag be used without an internet connection?

No, it requires internet access to perform web searches.

  • Is Mcp Local Rag easy to set up?

Yes, it can be set up using Docker or Python with straightforward configuration steps.

  • What kind of queries can I submit?

You can submit any query that requires recent web information, such as news or updates on specific topics.

Server Config

{
  "mcpServers": {
    "mcp-local-rag": {
      "command": "uvx",
      "args": [
        "--python=3.10",
        "--from",
        "git+https://github.com/nkapila6/mcp-local-rag",
        "mcp-local-rag"
      ]
    }
  }
}
Project Info
Created At
a year ago
Updated At
a year ago
Author Name
images
Star
-
Language
-
License
-

Recommend Servers

View All
Mnemom

14 hours ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

13 hours ago
Docwand

13 hours ago