omniparser-autogui-mcp

Created By
NON906a year ago
Automatic operation of on-screen GUI.
Overview

What is omniparser-autogui-mcp?

Omniparser-autogui-mcp is a tool that automates the operation of on-screen graphical user interfaces (GUIs) by analyzing the screen using the OmniParser framework.

How to use omniparser-autogui-mcp?

To use omniparser-autogui-mcp, clone the repository, install the necessary models, and configure the settings in the 'claude_desktop_config.json' file to specify the command and environment variables.

Key features of omniparser-autogui-mcp?

  • Automatic GUI operation based on screen analysis.
  • Compatibility with Windows.
  • Customizable settings for different environments and use cases.

Use cases of omniparser-autogui-mcp?

  1. Automating repetitive tasks in software applications.
  2. Enhancing accessibility for users with disabilities.
  3. Streamlining workflows by automating GUI interactions.

FAQ from omniparser-autogui-mcp?

  • Is omniparser-autogui-mcp compatible with other operating systems?

Currently, it is confirmed to work on Windows, but other systems may require additional configurations.

  • What is the license for omniparser-autogui-mcp?

It is under the MIT license, excluding submodules and packages which may have different licenses.

Project Info
Created At
a year ago
Updated At
a year ago
Author Name
NON906
Star
40
Language
Python
License
MIT license
Tags

Recommend Servers

View All
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

19 hours ago