Simple Document Processing MCP Server

Created By
cablatea year ago
MCP server that provides doc forge capabilities
Overview

What is MCP Doc Forge?

MCP Doc Forge is a powerful Model Context Protocol (MCP) server that provides comprehensive document processing capabilities, allowing users to read, convert, and manipulate various document formats.

How to use MCP Doc Forge?

To use MCP Doc Forge, you can install it via Smithery or manually using npm. After installation, you can run it through the command line interface (CLI) or integrate it with Dive Desktop by adding it as an MCP server.

Key features of MCP Doc Forge?

  • Document Reader: Supports reading DOCX, PDF, TXT, HTML, and CSV formats.
  • Document Conversion: Allows conversion between DOCX to HTML/PDF, HTML to TXT/Markdown, and PDF manipulation (merge, split).
  • Text Processing: Includes multi-encoding transfer support, text formatting, cleaning, comparison, and splitting.
  • HTML Processing: Offers HTML cleaning, formatting, resource extraction, and structure-preserving conversion.

Use cases of MCP Doc Forge?

  1. Converting documents for web publishing.
  2. Extracting text and resources from various document formats.
  3. Automating document processing workflows in applications.

FAQ from MCP Doc Forge?

  • What document formats does MCP Doc Forge support?

MCP Doc Forge supports DOCX, PDF, TXT, HTML, and CSV formats.

  • Is MCP Doc Forge free to use?

Yes! MCP Doc Forge is open-source and free to use under the MIT license.

  • How can I contribute to MCP Doc Forge?

You can contribute by reporting issues, suggesting features, or submitting pull requests on GitHub.

Project Info
Created At
a year ago
Updated At
a year ago
Author Name
cablate
Star
11
Language
TypeScript
License
MIT license

Recommend Servers

View All
Wpnews

18 hours ago
Linkpulse

2 days ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

a day ago
Shippo
@Shippo

2 days ago
Orkestr

19 hours ago