Pdf Reader

Created By
sunderbharath8510 months ago
An MCP server that can extract text from PDFs and images. It supports: * Native text extraction from PDFs (embedded text) * OCR for scanned PDFs and images using Tesseract
Overview

pdf-reader-mcp

An MCP server that can extract text from PDFs and images. It supports:

  • Native text extraction from PDFs (embedded text)
  • OCR for scanned PDFs and images using Tesseract

Requirements

  • Node.js >= 18.18
  • For best OCR on PDFs, install Poppler (for pdftoppm) and Tesseract (CLI):

macOS

brew install poppler tesseract

Windows

  1. Tesseract: Download and install from GitHub releases or use chocolatey:
    choco install tesseract
    
  2. Poppler: Download from poppler-windows and add to PATH, or use chocolatey:
    choco install poppler
    

Linux (Ubuntu/Debian)

sudo apt-get install poppler-utils tesseract-ocr

Note: Poppler is optional; without it we fall back to slower whole-file OCR. Tesseract CLI is preferred for OCR; if it is not installed, the server falls back to tesseract.js (which downloads language data on first run).

Install

npm install

Build

npm run build

Run (stdio)

npm start

This server uses stdio transport per MCP. Configure your MCP client with the command node dist/index.js in this folder.

Example MCP client configuration (JSON):

{
	"mcpServers": {
		"pdf-reader-mcp": {
			"command": "node",
			"args": ["dist/index.js"],
			"env": { }
		}
	}
}

Tool: extract_text_from_pdf

Inputs (one of):

  • path: Path to a local PDF or image file
  • base64: Base64-encoded content (optionally with data URL prefix)

Options:

  • ocr (boolean, default false): Force using OCR. When false, the server tries native extraction then falls back to OCR if needed.

Output: The extracted text as a single text item.

Server Config

{
  "mcpServers": {
    "pdf-reader-mcp": {
      "command": "node",
      "args": [
        "dist/index.js"
      ],
      "env": {}
    }
  }
}
Project Info
Created At
10 months ago
Updated At
10 months ago
Author Name
sunderbharath85
Star
-
Language
-
License
-
Category
Tags

Recommend Servers

View All
Mnemom

15 hours ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

13 hours ago
Docwand

13 hours ago