omniparser-autogui-mcp

Created By

NON906a year ago

Automatic operation of on-screen GUI.

Overview

What is omniparser-autogui-mcp?

Omniparser-autogui-mcp is a tool that automates the operation of on-screen graphical user interfaces (GUIs) by analyzing the screen using the OmniParser framework.

How to use omniparser-autogui-mcp?

To use omniparser-autogui-mcp, clone the repository, install the necessary models, and configure the settings in the 'claude_desktop_config.json' file to specify the command and environment variables.

Key features of omniparser-autogui-mcp?

Automatic GUI operation based on screen analysis.
Compatibility with Windows.
Customizable settings for different environments and use cases.

Use cases of omniparser-autogui-mcp?

Automating repetitive tasks in software applications.
Enhancing accessibility for users with disabilities.
Streamlining workflows by automating GUI interactions.

FAQ from omniparser-autogui-mcp?

Is omniparser-autogui-mcp compatible with other operating systems?

Currently, it is confirmed to work on Windows, but other systems may require additional configurations.

What is the license for omniparser-autogui-mcp?

It is under the MIT license, excluding submodules and packages which may have different licenses.

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

NON906

Star

Language

Python

License

MIT license

Recommend Servers

View All

//beforeyouship — LLM Cost Modeling From Your Editor

@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ``` **Cursor / other clients** — add a remote server: ```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

19 hours ago

Agentline

@Sameer

AgentLine is the telephony layer for AI agents. It gives your agent a real phone number making outbound calls, receiving inbound calls, and handling SMS all through a single API. No telecom infrastructure, no WebSocket wrangling, no separate STT/TTS providers to configure.

19 hours ago

Apex Copilot

@Apex-Foundation

Web3 founder diligence: contract audit, jurisdiction matching, fund discovery, portfolio comparison, scoring. Free, open source.

13 hours ago

flatten-mcp

@shayaShav

An MCP server that flattens Claude Code sessions — keeping every prompt and event verbatim while reclaiming context tokens, so you resume the exact same raw conversation at a lower token count instead of compacting it into a lossy summary. It moves bulky tool output (large file reads, command logs, base64 screenshots) into a sidecar file, leaving a tiny retrievable reference in its place. Crash-safe, idempotent, and fully reversible. Real example from the README: a 317,236-token session flattened to 182,287 tokens.

14 hours ago

Versium Reach

@Versium

Find leads, enrich your contacts, and verify emails just by describing what you need. Versium REACH builds and sizes B2B and B2C audiences and fills in the contact and company data you're missing, all in plain language with no manual exports or API code. US data only. Estimates are free; building a list draws on your Versium account credits and always confirms with you first. Requires an active Versium REACH subscription with API access.

2 days ago

MCP for Indexa Capital

@InvIngeniero

Check your portfolio, cash transactions, movements, payed fees, growth history and more

a day ago

Liveauth Mcp Server

@dulzuradev

LiveAuth MCP Server gives AI agents cryptographic proof-of-work + Lightning Network authentication. Agents solve a PoW challenge, get a signed JWT, and use it to call paid MCP tools. Each tool call is metered in sats and recorded as a signed revenue event with a verifiable receipt. Built on L402 (the Lightning-Native HTTP 402 protocol from Lightning Labs). Compatible with x402 (Cloudflare/Coinbase). Non-custodial — no KYC, no account, no email. Pay per call in sats over Lightning. Install: `npx -y @liveauth-labs/mcp-server` Docs: https://docs.liveauth.app/mcp-liveauth-gate Source: https://github.com/dulzuradev/liveauth-mcp

14 hours ago

Rightblogger

@RightBlogger

RightBlogger MCP gives any AI agent direct access to SEO keyword research, Google Search Console performance, and your WordPress/Ghost/Webflow CMS — research keywords, read posts, and pull GSC data straight from Claude, Cursor, or any MCP client.

2 days ago

Mnemom

@Mnemom

Trust ratings for AI agents and websites. Look up an agent's reputation, scan a site's AI-trust-readiness, and verify signed scorecards in-band — reads are zero-auth. From Mnemom, the trust layer for the agent internet.

14 hours ago

2d Games Assets Generator

@crony-io

An MCP (Model Context Protocol) server that generates advanced mock 2D PNG assets for games prototypes — directly from any MCP-compatible AI client such as Claude Desktop. This MCP is engine-agnostic and works with any game engine that supports PNG import: Godot Unity Unreal Engine GameMaker Construct RPG Maker And many more... Create placeholder sprites, UI elements, health bars, spritesheets, and more with full support for gradients, patterns, transparency, text rotation, and auto-scaling — all without opening an image editor. Each generated PNG embeds rich JSON metadata (dimensions, color, shape, description) directly in its EXIF data, so AI models without vision can still understand what an asset contains.

19 hours ago

Name Brewery Domain Checker

@Name Brewery

Bulk domain checking for AI chats: availability, aftermarket prices, archive.org history, social handle links, and buy links. Your AI brainstorms names; check_domains reports what's real for up to 50 names across 6 TLDs per call. Free to start — 20 credits. Setup: https://namebrewery.com/mcp

8 hours ago

Aiimagemultistyle

@codecraftm

A Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

a year ago

Human Design (gethumandesign)

@gethumandesign

Calculate Human Design bodygraph charts from birth data, save people, compare charts, and analyse group dynamics — in any MCP client. Hosted remote server (Streamable HTTP, OAuth 2.0) by gethumandesign.com; free account to connect. Listed in the official MCP registry as com.gethumandesign.www/mcp.

14 hours ago

Puppeteer

@modelcontextprotocol

Browser automation and web scraping

a year ago

Fonteum Mcp Server

@Fonteum

Hosted MCP server for source-provenanced US federal healthcare provider data — NPPES, CMS PECOS, Care Compare, OIG LEIE, Open Payments. Every field returns with its exact federal source, snapshot date, and SHA-256 attestation. Public data only; no PHI. Install: npx -y @fonteum/mcp

7 hours ago

Giveradar Mcp Server

Remote MCP server exposing 8.7M+ registered charities across 60+ countries, sourced from official government registries (IRS, Charity Commission, ACNC, DSD, RNA, and 60+ more). Read-only, no key required to start.

a day ago

Wundervault MCP

@wundervault

MCP server for Wundervault zero-knowledge secret management. Exposes vault secrets to AI agents via the Model Context Protocol — secrets are decrypted server-side and never returned to the agent in plaintext.

17 hours ago

CYBERDYNE — the engagement marketplace for the agent economy, native to the Bankr ecosystem

@Cyberdyne-OS

Engagement marketplace on Base, native to the Bankr ecosystem: AI agents and communities fund quests (follows, reposts, replies, quotes, original posts); verified-X humans complete them and are paid per approved action from a non-custodial x402 escrow — in USDC, BNKR, or any Bankr-launched token.

9 hours ago

Sentry

@modelcontextprotocol

Retrieving and analyzing issues from Sentry.io

a year ago

MCP Server for Milvus

@zilliztech

The Milvus MCP server enables AI applications to interact with Milvus vector databases using natural language commands. It allows AI models to perform vector searches, manage collections, and retrieve data without writing custom database queries. This integration facilitates seamless access to vector data, enhancing the capabilities of AI tools like Claude Desktop and Cursor.

a year ago

Mailtrap Email Sending MCP

@Mailtrap

An MCP server that provides a tool for sending transactional emails via Mailtrap

a year ago

302_sandbox_mcp

@302ai

Create a remote sandbox that can execute code/run commands/upload and download files. 创建远程沙盒，可以执行代码/运行命令/上传下载文件

a year ago

Firecrawl Mcp Server

@mendableai

Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.

JavaScript

a year ago

Sendpulse MCP

@sendpulse

Your AI agent just got a marketing team. SendPulse MCP Server provides full access to the SendPulse marketing platform — email campaigns, CRM, chatbots, SMTP, and online courses. 134 methods across 5 modules.

14 hours ago

GBOX Android MCP

@babelcloud

GBOX provides environments for AI Agents to operate computer and mobile devices. Mobile Scenario: Your agents can use GBOX to develop/test android apps, or run apps on the Android to complete various tasks(mobile automation). Desktop Scenario: Your agents can use GBOX to operate desktop apps such as browser, terminal, VSCode, etc(desktop automation). MCP: You can also plug GBOX MCP to any Agent you like, such as Cursor, Claude Code. These agents will instantly get the ability to operate computer and mobile devices.

10 months ago

Jina AI MCP Tools

@PsychArch

A Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.

JavaScript

a year ago

Acopio

@Daniel Valcarce

Save developer tools once — repos, CLIs, API docs — then let Claude, Cursor, and any MCP client search and recommend from your own curated catalog instead of generic model knowledge. Remote MCP over Streamable HTTP (OAuth 2.0 + DCR).

5 hours ago

Serper MCP Server

@garymengcom

A Serper MCP Server

Python

a year ago

Cliqo Mcp

Create and manage short links - shorten URLs, list / inspect links, track credits. No subscriptions.

19 hours ago

Erabi

@HMAKT99

ERABI is the open, cryptographically auditable intent exchange for AI agents: register an identity in one command, discover providers ranked by reputation (never by payment), fire intents, and build verifiable reputation and earnings from dual-signed outcomes on a public hash-chained ledger. Zero-config — `npx -y erabi-mcp` joins the live public network with no accounts, no API keys. Six tools: register, discover, intent, report_outcome, my_reputation, my_earnings. Live explorer: https://erabi-explorer.vercel.app

a day ago