Peekaboo MCP – lightning-fast macOS screenshots for AI agents

Created By

steipetea year ago

## What Peekaboo Can Do Peekaboo provides three main tools that give AI agents visual capabilities: - **`image`** - Capture screenshots of screens or specific applications - **`analyze`** - Ask AI questions about captured images using vision models - **`list`** - Enumerate available screens and windows for targeted captures Each tool is designed to be powerful and flexible. The most powerful feature is visual question answering - agents can ask questions about screenshots like "What do you see in this window?" or "Is the submit button visible?" and get accurate answers. This saves context space since asking specific questions is much more efficient than returning raw image data. Peekaboo supports both cloud and local vision models, letting you choose between accuracy and privacy.

# screenshots

# analyze

Overview Content Tools Comments

Overview

My MCP Ecosystem

Peekaboo is part of a growing collection of MCP servers I'm building:

claude-code-mcp - Integrates Claude Code into Cursor for task offloading
macos-automator-mcp - Run AppleScript and JXA on macOS
Terminator - External terminal so agents don't get stuck on long-running commands

Each serves a specific purpose in building autonomous AI workflows.

Technical Architecture

Peekaboo combines TypeScript and Swift for the best of both worlds. TypeScript provides excellent MCP support and easy distribution via npm, while Swift enables direct access to Apple's ScreenCaptureKit for capturing windows without focus changes.

My initial AppleScript prototype had a fatal flaw: it required focus changes to capture windows. The Swift rewrite uses ScreenCaptureKit to access the window manager directly - no focus changes, no user disruption.

The system uses a Swift CLI that communicates with a Node.js MCP server, supporting both local models and cloud providers with automatic fallback. Built with Swift 6 and the new Swift Testing framework (now that I have experience with it!), Peekaboo delivers fast, non-intrusive screenshot capture with intelligent window matching.

For detailed testing instructions using the MCP Inspector, see the Peekaboo README.

The Vision: Autonomous Agent Debugging

Peekaboo is like one puzzle piece in a larger set of MCPs I'm building to help agents stay in the loop. The goal is simple: if an agent can answer questions by itself, you don't have to intervene and it can simply continue and debug itself. This is the holy grail for building applications with CI - you want to do everything so the agent can loop and work until what you want is done.

When your build fails, when your UI doesn't look right, when something breaks - instead of stopping and asking you "what do you see?", the agent can take a screenshot, analyze it, and continue fixing the problem autonomously. That's the power of giving agents their eyes.

👻 Peekaboo MCP is available now - ⭐ the repo if this saves you a debug session!

Try in Playground

Server Config

{
  "mcpServers": {
    "peekaboo": {
      "command": "npx",
      "args": [
        "-y",
        "@steipete/peekaboo-mcp"
      ],
      "env": {
        "PEEKABOO_AI_PROVIDERS": "ollama/llava:latest"
      }
    }
  }
}

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

steipete

Star

Language

License

Recommend Servers

View All

Serper MCP Server

@garymengcom

A Serper MCP Server

Python

a year ago

Brave Search

@modelcontextprotocol

Web and local search using Brave's Search API

a year ago

Sentry

@modelcontextprotocol

Retrieving and analyzing issues from Sentry.io

a year ago

Costwright Mcp

@hernaninverso

Static worst-case token-budget analysis of LLM-agent workflows (LangGraph / CrewAI / OpenAI-Agents) without running the code, plus signed budget certificates.

10 hours ago

PostgreSQL

@modelcontextprotocol

Read-only database access with schema inspection

a year ago

Yandex Direct Manager

@marketscore

ИИ-коннектор для Яндекс Директа: подключите Claude или ChatGPT к рекламному кабинету и управляйте кампаниями прямо в диалоге — статистика, поиск неэффективных расходов, ставки, бюджеты, минус-слова, аудит. AI/MCP connector for Yandex Direct: connect Claude or ChatGPT to your ad account and manage campaigns from chat — analytics, wasted-spend detection, bids, budgets, negative keywords, audit.

8 hours ago

Tavily Mcp

@tavily-ai

JavaScript

a year ago

EverArt

@modelcontextprotocol

AI image generation using various models

a year ago

Fetch

@test

Web content fetching and conversion for efficient LLM usage

9 months ago

MCP Server for Milvus

@zilliztech

The Milvus MCP server enables AI applications to interact with Milvus vector databases using natural language commands. It allows AI models to perform vector searches, manage collections, and retrieve data without writing custom database queries. This integration facilitates seamless access to vector data, enhancing the capabilities of AI tools like Claude Desktop and Cursor.

a year ago

GBOX Android MCP

@babelcloud

GBOX provides environments for AI Agents to operate computer and mobile devices. Mobile Scenario: Your agents can use GBOX to develop/test android apps, or run apps on the Android to complete various tasks(mobile automation). Desktop Scenario: Your agents can use GBOX to operate desktop apps such as browser, terminal, VSCode, etc(desktop automation). MCP: You can also plug GBOX MCP to any Agent you like, such as Cursor, Claude Code. These agents will instantly get the ability to operate computer and mobile devices.

10 months ago

Framelink Figma MCP Server

@GLips

MCP server to provide Figma layout information to AI coding agents like Cursor

TypeScript

a year ago

Mcp Server Chatsum

@chatmcp

summarize chat message

typescript

a year ago

Bucket Feature Flags MCP Server

@bucketco

Flag features directly from chat in your code editor, including VS Code, Cursor, Windsurf, Claude Code—any IDE with MCP support.

a year ago

Flowcast

@Orkhan Jafarov

Turn your AI coding agent into a producer of interactive, narrated walkthroughs — code, whiteboard, and 3D casts, each a single self-contained HTML file that opens in any browser. Runs locally over npx.

3 hours ago

Google Maps

@modelcontextprotocol

Location services, directions, and place details

a year ago

Github

@modelcontextprotocol

Repository management, file operations, and GitHub API integration

a year ago

Apollo.io

@apolloio

Apollo connects your sales data and outreach tools directly to any MCP client. Ask it to find leads, look up a company, write and send emails, or check how a campaign is performing. All from a conversation, without opening another tab. Here's what you can do: - Find the right people: Search Apollo's database of 240M+ verified contacts by any criteria: job title, industry, company size, recent funding, time in role, and more. - Get the full picture: Pull detailed contact and company info including phone numbers, buying signals, and tech stack, so you always have what you need before reaching out. - Build sequences: Write and send personalized emails, create sequences, and enroll contacts right from the conversation. Apollo keeps track of who's already been contacted so nothing gets duplicated. - See what's working: Check email opens, replies, and call activity. Use what you learn to improve the next campaign automatically. Apollo is free to try. Connect once and start prospecting from any MCP client.

6 hours ago

Stocklake - Stock & Market Intelligence

@Michael

Stock data, AI market intelligence, and portfolio tools for 1000+ stocks. Tools include: prices + fundamentals, RSI/MACD/Bollinger/SMA/EMA indicators, AI sentiment analysis, insider trading activity, institutional holdings, earnings calendar, sector intelligence (LEADING/STRONG/NEUTRAL/WEAK/LAGGING), market outlook (BULLISH/NEUTRAL/BEARISH), stock news with AI flag scores, discovery ideas, screener, and more. Free tier: 200 calls/day, 10 non-AI tools. Pro tier: 5,000 calls/day, all 20 tools including AI summaries, sentiment, and market intelligence. Get your API key at https://stocklake.dev/register (instant, no credit card for free tier).

9 hours ago

QuantumScan PQC Scanner

@gaiabio12-design

Post-quantum cryptography (PQC) security scanner for blockchain and AI agents. Detects quantum-vulnerable algorithms (ECDSA, RSA, DH) and provides NIST FIPS 203/204/205 migration paths. Free, open-source, privacy-first. Scan any GitHub repository or smart contract via MCP tools: scan_repository, check_pqc_risk, scan_contract. Used by autonomous AI agents for automated security audits.

7 hours ago

MiniMax MCP

@MiniMax-AI

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python

a year ago

Raster

@Raster

Browse, search, upload, tag, transfer, and delete images in your Raster libraries over MCP. Raster is a photo manager for modern teams — connect over a remote Streamable HTTP endpoint with OAuth 2.1 sign-in, scoped to the libraries your account can see.

11 hours ago

Zfuzz

@Zfuzz-dev

Real security scanners for AI coding agents: SAST (441 rules), secret detection (419+ patterns), dependency CVEs, MCP/skill vetting, MITRE ATT&CK. Real scanners, not the model guessing. Rust, Apache-2.0, free.

17 hours ago

Bitcoin Monetary Data

@Team23gm

Honest monetary data for AI agents. Compare purchasing power erosion over time, reveal the gap between CPI and real M2 money supply growth, and show Bitcoin vs dollar performance. No spin. Live SSE endpoint at http://80.91.65.91:8082/sse

a day ago

Test

@modelcontextprotocol

test

7 months ago

MCP Advisor

@istarwyh

MCP Advisor & Installation - Use the right MCP server for your needs

TypeScript

a year ago

Slack

@modelcontextprotocol

Channel management and messaging capabilities

a year ago

Delegum

@meskeIA

Servidor MCP con 42 herramientas de fiscalidad, derecho laboral y finanzas de España: IRPF, autónomos, nóminas, despidos, herencias, pensiones e hipotecas. Cálculos con normativa española. Sin registro, sin API key.

a day ago

Aws Kb Retrieval Server

@modelcontextprotocol

An MCP server implementation for retrieving information from the AWS Knowledge Base using the Bedrock Agent Runtime.

a year ago

Version Pill

@versionpill

2 days ago