TextIn OCR MCP

Created By
intsig-textina year ago
TextIn.com 官方OCR MCP Server, 支持快速识别文档中的ocr文字,转换文档为markdown,智能识别文档中的关键信息。
Overview

What is TextIn?

TextIn is a server that supports any MCP protocol client, is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

How to use TextIn MCP?

To use TextIn MCP, configure it in a compatible client like Cursor by copying your APP ID and APP SECRET and setting up the server command as specified in the documentation.

Use cases of TextIn MCP?

  1. Text recognition from images, Word documents, and PDF files.
  2. Convert images, PDFs, and Word documents to Markdown.
  3. Automatically and intelligently extract key information from documents.

Tools

  • recognition_text

    • Text recognition from images, Word documents, and PDF files.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Text of the document.
    • Supports conversion for:
      • PDF
      • Image (Jpeg, Jpg, Png, Bmp)
  • doc_to_markdown

    • Convert images, PDFs, and Word documents to Markdown.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Markdown of the document.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)
  • general_information_extration

    • Automatically identify and extract information from documents, or identify and extract user-specified information.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
      • key (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.
      • table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.
    • Return: The key information JSON.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)

When the input is a URL, it does not support handling access to protected resources.

FAQ from TextIn MCP?

  • How to register a TextIn account?

    Click here to register for a free account.

  • Is there a limit to the number of requests I can make?

    The usage limits depend on the APP ID and the specific service being used. Please refer to the TextIn documentation for details.

  • How do I obtain an API key?

    Get TextIn APP_KEY and APP_SECRET by following the instructions here.

Server Config

{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}
Project Info
Created At
a year ago
Updated At
a year ago
Author Name
intsig-textin
Star
-
Language
-
License
-
Category

Recommend Servers

View All
Bring your real authenticated browser session to AI coding agents. Local-first MCP server + Chrome MV3 extension. No cloud. No telemetry.
@Cubenest

peek records the user's actual logged-in browser (DOM via rrweb, console events, network metadata, optional response bodies via opt-in Deep capture) through a Chrome MV3 extension. The extension ships events through a native-messaging stdio bridge to a local MCP server (peek-mcp), which persists them to a SQLite database at ~/.peek/sessions.db. AI coding agents (Claude Code, Cursor, Cline, Windsurf) read sessions from the database via 10 MCP tools: Tool What it does list_recent_sessions List recently recorded sessions (id, origin, ts, event count). get_session_summary LLM-readable narrative summary of a session. get_session_console_errors Console errors recorded in a session. get_session_network_errors Failed/notable network requests in a session. get_user_action_before_error Last N user actions before a console error. generate_playwright_repro Generate a runnable Playwright test from a session. get_dom_snapshot Reconstruct the DOM at a given timestamp. query_dom_history Timeline of attribute/text changes for a selector. request_authorization Side-panel consent for write actions (Level 3). execute_action Dispatch a UI action (gated by permission level + destructive blocklist). Why local-first matters Every other "browser session for AI" tool ships to a vendor cloud. peek's SQLite + extension live on the user's machine — no remote endpoints, no telemetry. The privacy policy (docs/peek/PRIVACY_POLICY.md) is the source of truth. Install # 1. Add the MCP server to Claude Code claude mcp add peek -- npx -y @peekdev/mcp # 2. Install the Chrome extension from the Chrome Web Store # (link added once the CWS listing is approved)

a day ago