Forge - GPU Kernel Optimization

Created By
RightNow-AI4 months ago
Turn slow PyTorch into fast CUDA/Triton kernels. 32 parallel swarm agents optimize your code on real datacenter GPUs (B200, H200, H100, A100) with up to 14x speedup over torch.compile.
Overview

Forge MCP Server

Swarm agents that turn slow PyTorch into fast CUDA/Triton kernels, from any AI coding agent.

What it does

  • Optimize existing kernels - Submit PyTorch code, get back an optimized Triton/CUDA kernel
  • Generate new kernels - Describe an operation, get a production-ready optimized kernel
  • 32 parallel swarm agents - Coder+Judge pairs compete to find the fastest kernel
  • Real GPU benchmarking - Every kernel is tested on datacenter hardware (B200, H200, H100, A100, L40S, T4)
  • Up to 14x faster than torch.compile(mode='max-autotune')
  • One-click auth - Browser-based OAuth, no API keys needed

Quick Start

claude mcp add forge-mcp -- npx -y @rightnow/forge-mcp-server

Tools
┌────────────────┬───────────────────────────────────────────────┐
│      Tool      │                  Description                  │
├────────────────┼───────────────────────────────────────────────┤
│ forge_auth     │ Authenticate with Forge via browser           │
├────────────────┼───────────────────────────────────────────────┤
│ forge_optimize │ Optimize PyTorch code into fast GPU kernels   │
├────────────────┼───────────────────────────────────────────────┤
│ forge_generate │ Generate optimized kernels from a description │
├────────────────┼───────────────────────────────────────────────┤
│ forge_credits  │ Check credit balance                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_status   │ Check job status                              │
├────────────────┼───────────────────────────────────────────────┤
│ forge_cancel   │ Cancel a running job                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_sessions │ List past optimization sessions               │
└────────────────┴───────────────────────────────────────────────┘


Pricing

Pay-as-you-go. $15/credit, 25% off at 10+. Free trial included - optimize 1 kernel, no credit card required.

https://rightnowai.co - https://github.com/RightNow-AI/forge-mcp-server - https://www.npmjs.com/package/@rightnow/forge-mcp-server

Server Config

{
  "mcpServers": {
    "forge": {
      "command": "npx",
      "args": [
        "-y",
        "@rightnow/forge-mcp-server"
      ]
    }
  }
}
Project Info
Created At
4 months ago
Updated At
4 months ago
Author Name
RightNow-AI
Star
-
Language
-
License
-
Category

Recommend Servers

View All
Tavily Mcp
@tavily-ai

JavaScript
a year ago
AI Work Market — USDC settlement rails for AI labor on Base Mainnet)
@Dario (DME)

AI Work Market is a USDC escrow protocol on Base Mainnet, designed for autonomous AI agents to find work, post jobs, and settle payments without humans in the loop. This MCP server exposes 10 tools: **Escrow lifecycle** - `create_intent_quote` — get calldata + gas estimate for funding a new escrow intent - `submit_proof_quote` — get calldata for the seller to submit a proof URI - `release_funds_quote` — get calldata for the buyer to release payment (or claim/refund) **x402 single-call binding** - `x402_consume` — replaces the 5-step x402 flow with one HMAC-signed POST that returns a delivery URL **Onboarding & discovery** - `agent_onboard` — generate a signed agent card with marketplace attestation - `agent_search` — tf-idf search over the live agent catalog - `agent_reputation` — server-side reputation from on-chain Released/Refunded/Disputed events **Live state** - `system_status` — live on-chain state (nextIntentId, accumulatedFees, contract balance, owner) - `escrow_rules` — contract semantics, lifecycle, call guides, failure modes - `events_subscribe` — SSE stream of new on-chain intent events All endpoints are serverless (Vercel) and return their schema on GET. No browser, no wallet UI required for an agent to integrate. The protocol takes a 1% commission on every settlement; the rest goes to the seller. The full AgentCard is at `/.well-known/agent-card.json` (A2A-compatible). The OpenAPI 3.0.3 spec is at `/.well-known/openapi.json` with `components.securitySchemes` (none, hmacX402). `robots.txt` allows GPTBot, ClaudeBot, anthropic-ai, PerplexityBot, Google-Extended, Applebot-Extended, CCBot, Amazonbot.

5 hours ago