PyMCPAutoGUI 🖱️⌨️🖼️ - GUI Automation via MCP

Created By

kitfactorya year ago

GUI manipulation MCP server

Overview

what is PyMCPAutoGUI?

PyMCPAutoGUI is a GUI automation tool that allows AI agents to interact with desktop applications by controlling the mouse and keyboard, enabling them to perform tasks just like a human user.

how to use PyMCPAutoGUI?

To use PyMCPAutoGUI, install it in a virtual environment, run the MCP server, and connect it to compatible clients like Cursor for seamless automation.

key features of PyMCPAutoGUI?

Direct interaction with desktop applications by AI agents.
Simple integration with MCP-compatible clients.
Comprehensive control over GUI elements using PyAutoGUI and PyGetWindow.
Tools for taking screenshots and locating images on the screen.
Window management capabilities including resizing and state control.
User interaction through alert and prompt boxes.

use cases of PyMCPAutoGUI?

Automating repetitive GUI tasks.
Testing desktop applications.
Building powerful AI assistants that can perform complex workflows.

FAQ from PyMCPAutoGUI?

What operating systems does PyMCPAutoGUI support?

It supports Windows, macOS, and Linux.

Is it easy to integrate with existing projects?

Yes! It is designed for simple integration with MCP-compatible clients like Cursor.

What Python version is required?

Python 3.11 or higher is required.

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

kitfactory

Star

Language

Python

License

MIT license

Recommend Servers

View All

XGR.Network MCP

@xgr-network

XGR.Network MCP is a remote MCP server that gives AI agents access to the XGR stack. It supports XDaLa workflow preparation, XGRChain evidence lookup, Explorer data access and programmable process automation through a public streamable HTTP endpoint.

2 hours ago

Schemabrain

@Arun-kc

A read-only trust + intelligence layer between AI agents and your database — the agent never writes SQL, PII is refused before the query runs, and every call lands in a tamper-evident audit log. Postgres today.

16 hours ago

Howtocook Mcp

@worryzyy

基于Anduin2017 / HowToCook （程序员在家做饭指南）的mcp server，帮你推荐菜谱、规划膳食，解决“今天吃什么“的世纪难题； Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"

a year ago

Cirdan

@adanb13

Cirdan maps and watches the live infrastructure your agent session can reach — Docker, Kubernetes, cloud, IaC, and telemetry — then exposes it over MCP. It fingerprints the environment, builds a dependency graph, detects incidents, and can run evidence-backed actions. It inherits the session's own access and never escalates beyond it.

7 hours ago

Sentry

@modelcontextprotocol

Retrieving and analyzing issues from Sentry.io

a year ago

Inboxguard

Scan and fix a domain's email deliverability (SPF, DKIM, DMARC, MTA-STS, TLS-RPT, BIMI, DNS blocklists) — and remediate the DNS at the registrar.

a day ago

//beforeyouship — LLM Cost Modeling From Your Editor

@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ``` **Cursor / other clients** — add a remote server: ```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

2 days ago

Layup Sport Booking

Search bookable London sports availability — courts, pitches, lanes, classes, pickup games — across every major UK leisure-centre operator and aggregator. ~100k slots indexed across 527 venues, 5 sports. Read-only, anonymous, CC-BY-4.0 attribution for OpenActive sources.

2 days ago

Agentline

@Sameer

AgentLine is the telephony layer for AI agents. It gives your agent a real phone number making outbound calls, receiving inbound calls, and handling SMS all through a single API. No telecom infrastructure, no WebSocket wrangling, no separate STT/TTS providers to configure.

2 days ago

Cliqo Mcp

Create and manage short links - shorten URLs, list / inspect links, track credits. No subscriptions.

a day ago

Memara Memory

@Memara

Persistent memory API and MCP server for AI agents and workflows. Store, search, and retrieve memories with semantic search across Claude, ChatGPT, n8n, Zapier, Dify, and more. Built for developers who need reliable, long-term context for their AI applications.

a day ago

Chess.com Mcp (interactive Views)

@nbialk

Chess.com player, game, and daily-puzzle tools where each tool ships its own interactive React view — board replays and a playable puzzle widget, not just text. Built with Skybridge for ChatGPT & Claude.

17 hours ago

Rockmoon Financial Data

@RockMoon

Cross-market (US/JP/KR) financial data over MCP — financials, segments, ownership, valuation metrics & prices, every value traceable to its source filing. 18 tools; one API key works for both REST and MCP.

30 minutes ago

Mcp Server Chatsum

@chatmcp

summarize chat message

typescript

a year ago

Blender

@ahujasid

BlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.

a year ago

Linkpulse

@Joost Boer

Know what every affiliate link actually earns, and fix what's bleeding revenue. See revenue per article, catch dead links before they cost you, and ask it anything in plain English. Works on any site.

2 days ago

Senado BR MCP

@Sidney da Silva Pereira Bissoli

Brazilian Federal Senate open data over MCP — 90 tools across the legislative process, Senate administration and the e-Cidadania portal. Cloudflare Workers, Streamable HTTP, no auth. Responses in pt-BR.

11 hours ago

Hevy MCP

@InvIngeniero

Analyze your workout history, manage routines, track progress, and help you plan future training using your real fitness data from Hevy

2 days ago

Wpnews

21 hours ago

Github

@modelcontextprotocol

Repository management, file operations, and GitHub API integration

a year ago

Filesystem

Secure file operations with configurable access controls

a year ago

Docwand

2 days ago

Zhipu Web Search

@BigModel

Zhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.

a year ago

Senado Br — Brazilian Federal Senate Open Data

11 hours ago

21 hours ago

A local-first cognitive substrate for neurodivergent professionals. Gives Claude memory, a sense of time, a translator for corporate ambiguity, and a guardrail that refuses to amplify rumination, hyperfocus, or sycophancy. MCP-native. No telemetry. AGPL-3.0-or-later. Self-ID sufficient — no diagnosis gating.

11 hours ago

mcp-server-flomo MCP Server

@chatmcp

Write notes to Flomo

JavaScript

a year ago

HourLedger — Work Hours & Overtime Calculator

@wudongjie

Calculate work hours, overtime, and gross pay with tested rulesets for US federal, California, Alaska, Colorado, and Nevada law. Handles overnight shifts, rounding policies, and workweek start. Local, no API key, no data leaves your machine.

2 days ago

Linkpulse

2 days ago

Test

@modelcontextprotocol

test

6 months ago