UniversalBench: run code, web, databases, and any LLM from your AI, safely

Created By
UniversalBench12 hours ago
An MCP execution platform that gives any AI real infrastructure to act. Through three tools it can run code and shell commands, install packages, run tasks in parallel, search the web, make HTTP calls, read write and query databases, call any LLM, commit to GitHub, take screenshots, handle files, and keep persistent memory across sessions, all with safety checks before results reach the model. AI never ships broken code, never burns your wallet, and cannot reach your internal network. Up to 96.5 percent fewer tokens than doing the work in chat. Works with Claude, ChatGPT, Gemini, and any MCP-compatible AI.
Overview

UniversalBench

AI has the intelligence. UniversalBench gives it the infrastructure to act.

Connect one URL to Claude, ChatGPT, Gemini, or any MCP-compatible AI, and behind it your AI can run code, search the web, read and write databases, call any LLM, and commit to GitHub, all before results reach the model. That filtering is what cuts up to 96.5 percent of tokens versus doing the work in chat.

Three guarantees, all live in code:

  • AI never ships broken code. Every code push is validated and confirmed to load before commit.

  • AI never burns your wallet. Every LLM call is cost-estimated before it runs, and calls over your ceiling are rejected.

  • AI cannot reach your internal network. Every request is checked against an internal-network blocklist.

What your AI can do through three tools (ub_read, ub_write, ub_ai):

  • Run code and shell commands in an isolated sandbox

  • Install packages and run tasks in parallel

  • Search the web and make safe HTTP calls

  • Read, write, and query databases with your own credentials

  • Call any major LLM with cost caps on by default

  • Read from and commit to GitHub, with every push validated first

  • Take screenshots and handle file delivery and conversion

  • Persist memory and state across sessions

  • Store credentials in an encrypted vault that unlocks the right tools automatically

Connect: https://universalbench-mcp.penantiaglobal.workers.dev/u/{api_key}

Free tier: 1,000 calls per month, no card required, then pay as you go.

Docs: https://docs.universalbench.dev

Server Config

{
  "mcpServers": {
    "universalbench": {
      "url": "https://universalbench-mcp.penantiaglobal.workers.dev/u/{api_key}"
    }
  }
}
Project Info
Created At
12 hours ago
Updated At
12 hours ago
Author Name
UniversalBench
Star
-
Language
-
License
-
Category

Recommend Servers

View All
Trainzilla Mcp

8 hours ago
Shippo
@Shippo

2 days ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

a day ago