Documentation Crawler & MCP Server

Created By

alizdavoodia year ago

This project provides a toolset to crawl websites wikis, tool/library documentions and generate Markdown documentation, and make that documentation searchable via a Model Context Protocol (MCP) server, designed for integration with tools like Cursor.

# crawler

# mcp

Overview Content Tools Comments

Overview

What is MCPDocSearch?

MCPDocSearch is a toolset designed to crawl websites, generate Markdown documentation, and make that documentation searchable via a Model Context Protocol (MCP) server, facilitating integration with tools like Cursor.

How to use MCPDocSearch?

To use MCPDocSearch, you first run the crawler_cli to crawl a website and generate a Markdown file. Then, you run the mcp_server to load and serve the documentation, allowing clients like Cursor to query the content.

Key features of MCPDocSearch?

Web Crawler (crawler_cli): Configurable crawling of websites with options for depth, URL patterns, and HTML cleaning.
MCP Server (mcp_server): Loads Markdown files, parses them into semantic chunks, and exposes tools for searching and retrieving documentation.
Cursor Integration: Designed for seamless operation with Cursor, allowing for easy querying of documentation.

Use cases of MCPDocSearch?

Crawling and documenting API references from various websites.
Creating searchable documentation for internal company resources.
Integrating with tools like Cursor for enhanced documentation accessibility.

FAQ from MCPDocSearch?

Can MCPDocSearch crawl any website?

Yes, as long as the website allows crawling and follows the robots.txt rules.

Is there a limit to the crawl depth?

Yes, the maximum crawl depth is configurable, typically between 1 and 5.

How do I integrate MCPDocSearch with Cursor?

You need to configure a .cursor/mcp.json file in the project root with the appropriate settings for the MCP server.

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

alizdavoodi

Star

Language

Python

License

MIT license

Recommend Servers

View All

Devops Mcp — Safe Ai Devops & Linux Server Automation Over Ssh

@MHasnainJafri

devops-mcp lets AI assistants (Claude Desktop, Cursor, Windsurf) connect, scan, plan, and deploy on real Linux servers over SSH — without handing the model the keys to the kingdom. Reading is always allowed; anything that changes state on a production-like server is refused unless the user supplies a secret elevation token the model never sees. Three time-limited modes (SAFE / PROVISION / FULL) with auto-expiry, a production write-gate that demands backup confirmation for irrecoverable ops, shell-quoted arguments, prompt-injection-tagged output, and a full JSON-lines audit log. Self-hosted: clone and build the project locally — an online/hosted MCP is not safe for this security tooling. 32 tools, TypeScript, MIT.

2 days ago

Aws Kb Retrieval Server

@modelcontextprotocol

An MCP server implementation for retrieving information from the AWS Knowledge Base using the Bedrock Agent Runtime.

a year ago

D3vtools

@Igor Ilic

MCP server for 200+ developer utilities — discover and execute tools through a unified API.

21 hours ago

AgentQL MCP Server

@tinyfish-io

Model Context Protocol server that integrates AgentQL's data extraction capabilities.

JavaScript

a year ago

Wax Seal

@degenlegion-com

Cryptographic identity verification for AI agents — verify on-chain seals, validate Ed25519 signatures, and gate high-risk actions with human-signed approvals.

5 hours ago

AI Tool Directory MCP Server

@AI Tool Directory

Query 2,000+ AI tools from your agent — search, compare, find alternatives, and check whether a tool is still alive. Public, read-only, no API key.

5 hours ago

Time

@modelcontextprotocol

A Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.

5 months ago

Enigmata

@Enigmata

Enigmata MCP provides AI clients with secure access to personalized Enigmata astrological forecasts, including day, week, month, and thematic forecast contexts. Forecasts contain the percentage of favorability of any day and the probability of any event. It helps assistants retrieve structured guidance, compare forecast periods, search themes, check profile readiness, and complete required solar-location setup when needed. Tools get_mcp_capabilities - Returns metadata about the available Enigmata MCP tools and their capabilities. get_day_forecast - Returns an AI-friendly personalized day forecast. get_day_forecast_section - Returns a specific section of a day forecast, useful when a client needs smaller responses. get_week_forecast - Returns an AI-friendly personalized week forecast. get_month_forecast - Returns an AI-friendly personalized month forecast. get_theme_forecast - Returns a forecast for a selected theme, including favorability or probability context. get_theme_best_days - Returns the best days for a selected forecast theme. compare_theme_days - Compares two dates for a selected theme without returning the full theme calendar. get_theme_probability_trends - Returns the subscription-period probability trend for a selected theme. get_theme_catalog - Returns the user’s saved themes and popular available themes. search_themes - Searches forecast themes and returns ranked candidates for precise theme selection. compare_forecasts - Compares two day, week, month, or theme forecast contexts. get_profile_readiness - Returns the user’s profile and onboarding readiness for forecast access. get_solar_requirement - Checks whether a solar meeting city is required for the user’s forecast flow. search_cities - Searches cities that can be used as a solar meeting place. submit_solar_place - Submits the user’s solar meeting city when it is required to unlock or update forecasts.

2 days ago

Fixzi AI Validator

@rohitcodilya

Monitor AI output schemas and API contracts for breaking changes — validate LLM JSON responses against defined schemas on a schedule

2 days ago

Datahyena

@Akash Rajpurohit

Datahyena turns messy growth signals into one clean API. Get funding rounds, acquisitions, and executive moves as structured, deduplicated, enriched records — plus company and investor entities. Pull over REST, stream over webhooks, or connect it to your AI agent over MCP (works with Claude, Cursor, Codex). Free tier: 50 credits, no card.

12 hours ago

Outlit

@Outlit

Outlit gives agents real-time customer understanding to automate customer operations through a hosted remote MCP server.

2 days ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Opencloudcosts Mcp

@x7even

Anchor AI FinOps to real, live cloud pricing. Multi-cloud MCP server for AWS, GCP & Azure — public list prices and enterprise negotiated rates (Reserved Instances, Savings Plans, CUDs, EDPs). No credentials needed to get started. Configuration keys optional for private contract pricing - see README https://github.com/x7even/cloudcostsmcp

an hour ago

Innovation Report Generator

@rd-innovation

PatSnap Innovation Report Generator MCP is a Model Context Protocol server that connects AI agents to PatSnap's Innovation Report Generator capabilities. Turns analytical results into deliverable reports and task-based outputs. It supports enterprise reports, patent value reports, and novelty-check reports for technical ideas, with download and similar-patent comparison support to help teams produce standardized materials more efficiently. It is suitable for client delivery, internal reporting, and pre-sales support.

4 hours ago

Com.virohanalife/site (or Slug Site)

@akshay

virohanalife site

11 hours ago

MCP Advisor

@istarwyh

MCP Advisor & Installation - Use the right MCP server for your needs

TypeScript

a year ago

Qiniu MCP Server

@Qiniu

基于七牛云产品构建的 Model Context Protocol (MCP) Server，支持用户在 AI 大模型客户端的上下文中通过该 MCP Server 来访问七牛云存储资源、利用 Dora 服务进行图片操作等。如果有什么需求欢迎在下方评论，您也可以在 github 仓库中提 issue。

Python

a year ago

Anzsco

@anzsco.com.au

Australian ANZSCO occupation reference with visa eligibility, state nominations, SkillSelect rounds, and tech-synonym search for skilled migration

2 days ago

abap-mcp — read & write ABAP from Claude & Cursor

@joaodelapace

An MCP server that connects Claude, Cursor or any MCP client to a SAP system over the standard ADT API. Read & search the ABAP repository, run syntax checks and classrun; opt-in write/patch/create/activate, implicit enhancements, customizing tables, dynpros, and the SAP Note Assistant. Pure Python standard library, so it starts fast even behind corporate antivirus. Read-only by default; writes require an explicit flag, a confirm, and a transport. Free / MIT.

9 minutes ago

AgentDocs

@AgentDocs

An agent-first office suite your AI reads and writes over MCP — Docs, Sheets, Slides, a Database, Drive, and Notion-style Pages. Sign in with Google; free to start.

2 days ago

Playwright Mcp

@microsoft

Playwright MCP server

TypeScript

a year ago

Stagenth Excel 数据可视化

Turn Excel/CSV into charts over MCP. excel_inspect previews sheets & columns; excel_to_chart renders bar/line/pie/scatter/radar etc. and returns a download URL. stagenth's 2nd MCP service, credit-based.

an hour ago

GitLab

@modelcontextprotocol

GitLab API, enabling project management

a year ago

Agentready Mcp

@AshutoshRaj97

Make any website queryable by AI agents — index any site, ask questions, get cited answers via RAG

2 hours ago

2 days ago

Configure a Specter game backend from chat — players, economy, progression, leaderboards, tournaments, battle passes, and real-time multiplayer — via your AI assistant. Browser sign-in; create/mutate tools opt-in.

a day ago

Obol — Metered Api Marketplace

@superbigroach

Pay-per-call API marketplace for AI agents. Discover and pay for metered services in USDC on Arc — no subscription, no account. Agents pay per call, sellers earn instantly.

15 hours ago

Cool Workflow

@coo1white

Cool Workflow keeps AI agent work in order. It saves signed JSON reports with citations and checks, and can run as an MCP server.

a day ago

19 hours ago

a day ago