mcp-ollama

Created By

~alter3 months ago

MCP server wrapping local Ollama for offload from API-priced orchestrators. Nine stdio tools: generation, summarisation, analysis, drafting, code tasks, diff-driven tasks, transforms, model management.

Overview Content Tools Comments

Overview

mcp-ollama

MCP server wrapping local Ollama models for offload from API-priced orchestrators.

Exposes nine tools that pass work to a local model (text generation, summarisation, code tasks, mechanical transforms, commit/PR/changelog drafting). The orchestrator decides what to route locally; this server does the routing.

Transport: stdio
Runtime: Node 18+
Default model: hermes3:8b (override via OLLAMA_MODEL)
Ollama host: http://localhost:11434 (override via OLLAMA_HOST)
Ships no model weights, no cloud call-outs, no telemetry. Every request stays on the host where Ollama is running.
License: Apache-2.0

Why

Orchestrators priced by the token (Claude Code, Cursor, the Anthropic API, Cline, Aider) pay for every classification, every docstring, every commit message. Most of that work doesn't need a frontier model. Routed to Ollama on the same machine, the same work is free and faster. mcp-ollama is the routing surface.

The orchestrating model decides what to route where. This server is plumbing — it does not try to be clever about task classification. Pick the right tool, pass the text, get a result back.

Install

From source

git clone https://github.com/true-alter/mcp-ollama.git
cd mcp-ollama
npm install
npm run build

You also need a running Ollama instance with at least one model pulled:

# Default — 8B, fast, good for classifications and short generations
ollama pull hermes3:8b

# Optional — code-specialised, heavier, better for local_code tasks
ollama pull qwen2.5-coder:32b

Docker

docker build -t mcp-ollama .
docker run -i --rm \
  -e OLLAMA_HOST=http://host.docker.internal:11434 \
  -e OLLAMA_MODEL=hermes3:8b \
  mcp-ollama

The supplied Dockerfile points at host.docker.internal:11434 so the container reaches Ollama on the host.

Run (stdio)

node dist/index.js

Stdio servers are launched by the MCP client (Claude Code, Cursor, etc.) — running it directly is only useful for debugging.

Configure Claude Code

claude mcp add --transport stdio ollama -- node /absolute/path/to/mcp-ollama/dist/index.js

Or in ~/.claude/settings.json:

{
  "mcpServers": {
    "ollama": {
      "transport": "stdio",
      "command": "node",
      "args": ["/absolute/path/to/mcp-ollama/dist/index.js"],
      "env": {
        "OLLAMA_HOST": "http://localhost:11434",
        "OLLAMA_MODEL": "hermes3:8b"
      }
    }
  }
}

Tools

Tool	Purpose
`local_generate`	General-purpose generation with system + user prompt
`local_summarize`	Summarise a blob of text
`local_analyze`	Analyse text against a specific question
`local_draft`	Draft content in a given style
`local_code`	Code tasks: docstring / test / explain / review / types / refactor-suggest
`local_diff`	Diff-driven tasks: commit-message / pr-description / changelog / summary / impact
`local_transform`	Mechanical code transformations
`local_models`	List models available on the local Ollama host
`local_pull`	Pull a model onto the local Ollama host

Full tool schemas are exposed over MCP introspection — any MCP-aware client will enumerate them automatically.

Environment variables

Variable	Default	Purpose
`OLLAMA_HOST`	`http://localhost:11434`	Ollama HTTP endpoint
`OLLAMA_MODEL`	`hermes3:8b`	Default model when a tool call omits `model`

Any tool call may override model explicitly — the env default only applies when unset. local_code tends to work better with a code-specialised model passed per-call, while local_summarize and local_draft are fine on the default.

Model selection guidance

Workload	Recommended model	Rationale
Classification, one-liners, tags	`hermes3:8b`	Fastest round-trip, cheap to run
Commit messages, changelogs, summaries	`qwen2.5-14b-instruct`	Higher quality, still comfortable on 16GB GPU
Code review, docstrings, tests	`qwen2.5-coder:32b`	Code-specialised
Fallback / unknown model	whatever `local_models` returns	Inspect first, then route

Use local_models at session start if you're unsure what's available on a host.

Troubleshooting

Ollama error 404 when calling a tool. The model isn't pulled. Run ollama pull <name> or call local_pull from the client.

fetch failed / connection refused. Ollama isn't running, or OLLAMA_HOST points somewhere wrong. Verify with curl $OLLAMA_HOST/api/tags. Inside a container, localhost is the container itself — use host.docker.internal on macOS/Windows or a bridge IP on Linux.

Tool calls feel slow. First call to a cold model incurs a load. Subsequent calls within the same Ollama process are much faster. If the model is larger than available VRAM, Ollama falls back to CPU — watch ollama ps to confirm.

Empty or truncated output. max_tokens defaults to 2048 per tool. For long generations, pass max_tokens explicitly in the tool call.

Security posture

mcp-ollama makes no network call of its own beyond the configured OLLAMA_HOST. It ships no telemetry, no analytics, no auto-update pinger. Tool inputs are forwarded to Ollama's HTTP API verbatim and the response is relayed back; the server itself is stateless between calls.

If you run Ollama on localhost (the default) the entire loop stays on the host. If you point OLLAMA_HOST at a remote endpoint, treat that endpoint's security posture as authoritative — a typo sending prompts to a third-party host is trivially possible.

To report a security issue, see SECURITY.md.

Contributing

Bug reports and small patches welcome — see CONTRIBUTING.md. Larger design changes: please open an issue first so we can talk about scope before you invest time.

Part of ALTER

mcp-ollama is maintained by ALTER as part of the identity infrastructure for the AI economy. The ALTER identity MCP server is hosted at mcp.truealter.com — see @truealter/sdk for the TypeScript client.

License

Try in Playground

Server Config

{
  "mcpServers": {
    "ollama": {
      "command": "node",
      "args": [
        "/absolute/path/to/mcp-ollama/dist/index.js"
      ],
      "env": {
        "OLLAMA_HOST": "http://localhost:11434",
        "OLLAMA_MODEL": "hermes3:8b"
      }
    }
  }
}

Project Info

Created At

3 months ago

Updated At

3 months ago

Author Name

~alter

Star

Language

License

Recommend Servers

12 days ago

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

302_browser_use_mcp

@302ai

Automatically create a remote browser to complete your specified tasks, developed based on Browser Use + Sandbox. 自动创建一个远程浏览器，完成你指定的任务，基于Browser Use + Sandbox开发。

a year ago

Search1API

One API for Search, Crawling, and Sitemaps

a year ago

12 days ago

12 days ago

@modelcontextprotocol

Web and local search using Brave's Search API

a year ago

Arcgis Portal Mcp

@Asem-D

MCP server for ArcGIS Portal and ArcGIS Online. Lets AI assistants search content, query feature layers, manage features, handle content operations, and administer users and groups. Built on the Model Context Protocol for integration with Claude Desktop, Cursor, VS Code Copilot, and other MCP clients. Disclaimer: This is an independent open-source project. It is not affiliated with, endorsed by, or sponsored by Esri Inc. "ArcGIS" is a registered trademark of Esri.

12 days ago

MCP Advisor

@istarwyh

MCP Advisor & Installation - Use the right MCP server for your needs

TypeScript

a year ago

Bucket Feature Flags MCP Server

@bucketco

Flag features directly from chat in your code editor, including VS Code, Cursor, Windsurf, Claude Code—any IDE with MCP support.

a year ago

ContextBridge

@tijuthomas5

ContextBridge (CB) is a local-first retrieval layer that sits between your codebase and your AI coding agent. Instead of pasting raw source files into context or letting the AI guess which files matter, CB indexes your codebase's real structure (via Graphify) and returns a compact, ranked result — owner file, related files, key symbols, and dependency chains — grounded in your actual code. A typical response is ~4–8 KB instead of the 100+ KB of raw source an AI would otherwise need to read to answer the same question — roughly a 96% cut in input tokens sent to your cloud AI, with zero hallucinated file paths or method names.

12 days ago

Aws Kb Retrieval Server

@modelcontextprotocol

An MCP server implementation for retrieving information from the AWS Knowledge Base using the Bedrock Agent Runtime.

a year ago

12 days ago

12 days ago

Ivory Coast Payments Mcp

@junter1989k-ai

12 days ago

Memory

@modelcontextprotocol

a year ago

Songcheck Ai Music Detector

@afghanfansmedia-ai

Is this song AI or human? SongCheck detects AI-generated music (Suno, Udio, and more) and media from any AI agent. Point it at an audio file, an image/video, or a whole music folder and it returns a verdict (LIKELY AI-GENERATED / UNCERTAIN / LIKELY HUMAN), an AI-probability score, confidence, and provenance signals (Content Credentials / SynthID watermark, generator hints). Tools: detect_ai_music, detect_ai_media, scan_catalog (audit an entire catalog), and songcheck_health. Free tier is 5 checks/day; a paid key unlocks unlimited catalog scans. Powered by SongCheck (Khaled Media), a self-hosted v9 ensemble detector.

6 days ago

MiniMax MCP

@MiniMax-AI

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python

a year ago

Blender

@ahujasid

BlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.

a year ago

12 days ago

12 days ago

Guatemala Payments Mcp

@junter1989k-ai

12 days ago

Time

@modelcontextprotocol

A Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.

5 months ago

Google Maps

@modelcontextprotocol

Location services, directions, and place details

a year ago

Serper MCP Server

@garymengcom

A Serper MCP Server

Python

a year ago

Papaya Pay Any Bill

@Papaya

Ready for a new way to bill pay? Pay any bill in a snap, right from a chat. Describe or snap a photo of your bill (electric, water, gas, internet, phone, medical, credit card, rent, parking tickets, traffic violations and more) and Papaya reads it, then hands you a secure link to pay by card. Fast, secure, and no juggling twelve logins, with full or partial payments and status updates. Powered by Papaya (papayapay.com).

12 days ago

ShareMyPage

@Henning Witzel-Acikgöz

Host the HTML or Markdown pages your AI generates and share each as a link with comments, versioning, and access control. Create, update, and organize pages and read reviewer comments over MCP.

12 days ago

12 days ago

12 days ago

Perplexity Ask MCP Server

@ppl-ai

A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.

JavaScript

a year ago