EasyOCR MCP tool

Created By
WindoCa year ago
A Model Context Protocol (MCP) server that provides OCR capabilities using the EasyOCR library.
Overview

EasyOCR MCP Overview

EasyOCR MCP is a Model Context Protocol (MCP) server that exposes Optical Character Recognition (OCR) capabilities using the EasyOCR library. It allows you to extract text from images via three flexible tools: processing images from base64 strings, local files, or URLs.


Key Features

  • Multiple Input Methods: OCR images from base64, file paths, or URLs.
  • Multi-language Support: Configure supported languages via the EASYOCR_LANGUAGES environment variable.
  • Flexible Output: Choose between text-only or detailed results (with bounding boxes and confidence scores).
  • Performance Optimized: Caches OCR readers for faster repeated requests.
  • Native EasyOCR Output: Returns results in EasyOCR’s original format.

Available Tools

  1. ocr_image_base64: OCR from a base64-encoded image string.
  2. ocr_image_file: OCR from a local image file.
  3. ocr_image_url: OCR from an image URL.

Each tool supports parameters for output detail, paragraph detection, and text merging thresholds.


Usage Example

# Example output with detail=1:
[
    ([[189, 75], [469, 75], [469, 165], [189, 165]], '愚园路', 0.37),
    ([[86, 80], [134, 80], [134, 128], [86, 128]], '西', 0.40)
]

# Example output with detail=0:
['愚园路', '西', '东', '315', '309', 'Yuyuan Rd.', 'W', 'E']

Configuration

  • Languages: Set EASYOCR_LANGUAGES (e.g., en,ch_tra,ja) in your environment or MCP config.
  • Installation: Requires Python, EasyOCR, and PyTorch (with optional GPU support).
  • Running: Start the server with uv run easyocr-mcp.py or via your MCP orchestrator.

Supported Languages

Supports 80+ languages, including English (en), Chinese (ch_sim, ch_tra), Japanese (ja), Korean (ko), French (fr), German (de), Spanish (es), and

Server Config

{
  "mcpServers": {
    "easyocr-mcp": {
      "command": "uv",
      "args": [
        "--directory",
        "/path/to/your/project/easyocr-mcp",
        "run",
        "easyocr-mcp.py"
      ],
      "env": {
        "EASYOCR_LANGUAGES": "en,ch_tra,ja"
      }
    }
  }
}
Project Info
Created At
a year ago
Updated At
a year ago
Author Name
WindoC
Star
-
Language
-
License
-
Category
Tags

Recommend Servers

View All
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

16 hours ago