Docsearch Mcp

Created By
PatrickKoss8 months ago
ripgrep for docs but via MCP
Overview

What is Docsearch MCP?

Docsearch MCP is a local-first document search and indexing system that provides hybrid semantic and keyword search across local files (including PDFs) and Confluence pages through the Model Context Protocol (MCP). It is designed to assist AI tools in accessing documentation, codebases, and research materials.

How to use Docsearch MCP?

To use Docsearch MCP, you can install it via npm or run it using Docker. After installation, you can index your documents and search through them using the command-line interface (CLI).

Key features of Docsearch MCP?

  • Hybrid Search: Combines full-text search with vector similarity for optimal results.
  • Multi-Source Indexing: Index local files and Confluence spaces.
  • PDF Support: Extract and search text from PDF documents.
  • Image Search: AI-powered image description and search.
  • Database Flexibility: Supports SQLite and PostgreSQL.
  • Real-time Updates: Automatic re-indexing with file watching.
  • Multiple Output Formats: Supports text, JSON, and YAML outputs.

Use cases of Docsearch MCP?

  1. Searching through code documentation.
  2. Indexing and searching academic papers.
  3. Assisting AI tools in retrieving relevant documentation.

FAQ from Docsearch MCP?

  • Can Docsearch MCP index all file types?

Yes, it supports various file types including code files, PDFs, and images.

  • Is Docsearch MCP free to use?

Yes, it is open-source and free to use.

  • How does the hybrid search work?

It combines traditional keyword search with semantic search using vector embeddings.

Server Config

{
  "mcpServers": {
    "docsearch": {
      "command": "npx",
      "args": [
        "docsearch-mcp",
        "start"
      ],
      "env": {
        "OPENAI_API_KEY": "your-openai-key",
        "EMBEDDINGS_PROVIDER": "openai",
        "FILE_ROOTS": ".,../other-project",
        "DB_PATH": "/path/to/your/index.db"
      }
    }
  }
}
Project Info
Created At
8 months ago
Updated At
7 months ago
Author Name
PatrickKoss
Star
-
Language
-
License
-

Recommend Servers

View All
Tavily Mcp
@tavily-ai

JavaScript
a year ago
Docwand

13 hours ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

13 hours ago
Linkpulse

16 hours ago