Mineru Open Mcp

Created By
OpenDataLab2 months ago
Official MinerU MCP server for document parsing. Convert PDF, DOCX, PPTX, images, and HTML to high-quality Markdown. Supports 109-language OCR, table/formula extraction, and both token-free Flash mode (20 pages / 10MB) and Precision mode (600 pages / 200MB). Works with Claude Desktop, Cursor, and Windsurf.
Overview

MinerU Open MCP (Official)

Official MCP server for MinerU document parsing API.

What it does

  • Parse PDF, DOCX, PPTX, images, and HTML into Markdown
  • OCR support for 109 languages
  • Extract tables and formulas (Precision mode)
  • Works in Claude Desktop, Cursor, Windsurf, and other MCP clients

Modes

  • Flash mode (no token): free, markdown-only, up to 20 pages / 10MB per file
  • Precision mode (token required): up to 600 pages / 200MB, richer parsing and optional extra formats

Tools

  • parse_documents — parse local files or URLs to Markdown
  • get_ocr_languages — list OCR languages
  • clean_logs — clean old log files (when ENABLE_LOG=true)

Quick setup

{
  "mcpServers": {
    "mineru": {
      "command": "uvx",
      "args": ["mineru-open-mcp"],
      "env": {
        "MINERU_API_TOKEN": "your_token_here"
      }
    }
  }
}

Server Config

{
  "mcpServers": {
    "mineru": {
      "command": "uvx",
      "args": [
        "mineru-open-mcp"
      ],
      "env": {
        "MINERU_API_TOKEN": "your token"
      }
    }
  }
}
Project Info
Created At
2 months ago
Updated At
2 months ago
Author Name
OpenDataLab
Star
-
Language
-
License
-
Category

Recommend Servers

View All
Bring your real authenticated browser session to AI coding agents. Local-first MCP server + Chrome MV3 extension. No cloud. No telemetry.
@Cubenest

peek records the user's actual logged-in browser (DOM via rrweb, console events, network metadata, optional response bodies via opt-in Deep capture) through a Chrome MV3 extension. The extension ships events through a native-messaging stdio bridge to a local MCP server (peek-mcp), which persists them to a SQLite database at ~/.peek/sessions.db. AI coding agents (Claude Code, Cursor, Cline, Windsurf) read sessions from the database via 10 MCP tools: Tool What it does list_recent_sessions List recently recorded sessions (id, origin, ts, event count). get_session_summary LLM-readable narrative summary of a session. get_session_console_errors Console errors recorded in a session. get_session_network_errors Failed/notable network requests in a session. get_user_action_before_error Last N user actions before a console error. generate_playwright_repro Generate a runnable Playwright test from a session. get_dom_snapshot Reconstruct the DOM at a given timestamp. query_dom_history Timeline of attribute/text changes for a selector. request_authorization Side-panel consent for write actions (Level 3). execute_action Dispatch a UI action (gated by permission level + destructive blocklist). Why local-first matters Every other "browser session for AI" tool ships to a vendor cloud. peek's SQLite + extension live on the user's machine — no remote endpoints, no telemetry. The privacy policy (docs/peek/PRIVACY_POLICY.md) is the source of truth. Install # 1. Add the MCP server to Claude Code claude mcp add peek -- npx -y @peekdev/mcp # 2. Install the Chrome extension from the Chrome Web Store # (link added once the CWS listing is approved)

19 hours ago
Crevio

a day ago