Forge - GPU Kernel Optimization

Created By

RightNow-AI4 months ago

Turn slow PyTorch into fast CUDA/Triton kernels. 32 parallel swarm agents optimize your code on real datacenter GPUs (B200, H200, H100, A100) with up to 14x speedup over torch.compile.

# gpu

# cuda

Overview Content Tools Comments

Overview

Forge MCP Server

Swarm agents that turn slow PyTorch into fast CUDA/Triton kernels, from any AI coding agent.

What it does

Optimize existing kernels - Submit PyTorch code, get back an optimized Triton/CUDA kernel
Generate new kernels - Describe an operation, get a production-ready optimized kernel
32 parallel swarm agents - Coder+Judge pairs compete to find the fastest kernel
Real GPU benchmarking - Every kernel is tested on datacenter hardware (B200, H200, H100, A100, L40S, T4)
Up to 14x faster than torch.compile(mode='max-autotune')
One-click auth - Browser-based OAuth, no API keys needed

Quick Start

claude mcp add forge-mcp -- npx -y @rightnow/forge-mcp-server

Tools
┌────────────────┬───────────────────────────────────────────────┐
│      Tool      │                  Description                  │
├────────────────┼───────────────────────────────────────────────┤
│ forge_auth     │ Authenticate with Forge via browser           │
├────────────────┼───────────────────────────────────────────────┤
│ forge_optimize │ Optimize PyTorch code into fast GPU kernels   │
├────────────────┼───────────────────────────────────────────────┤
│ forge_generate │ Generate optimized kernels from a description │
├────────────────┼───────────────────────────────────────────────┤
│ forge_credits  │ Check credit balance                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_status   │ Check job status                              │
├────────────────┼───────────────────────────────────────────────┤
│ forge_cancel   │ Cancel a running job                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_sessions │ List past optimization sessions               │
└────────────────┴───────────────────────────────────────────────┘


Pricing

Pay-as-you-go. $15/credit, 25% off at 10+. Free trial included - optimize 1 kernel, no credit card required.

https://rightnowai.co - https://github.com/RightNow-AI/forge-mcp-server - https://www.npmjs.com/package/@rightnow/forge-mcp-server

Try in Playground

Server Config

{
  "mcpServers": {
    "forge": {
      "command": "npx",
      "args": [
        "-y",
        "@rightnow/forge-mcp-server"
      ]
    }
  }
}

Project Info

Created At

4 months ago

Updated At

4 months ago

Author Name

RightNow-AI

Star

Language

License

Recommend Servers

View All

Aiimagemultistyle

@codecraftm

A Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

a year ago

Almega

@almega-ai

Give your AI agents a wallet they can't abuse. Almega is an MCP server that puts a control layer in front of every payment: per-agent spending limits, allow-listed categories, 1-click human approval on sensitive transactions, and a full audit ledger. Two backends ship in one file — `memory` (zero-config, 30-second demo) and `stripe` (real Stripe Issuing test-mode virtual cards, no real money). 7 tools, stdio transport, Python 3.10+, MIT.

2 hours ago

Perplexity Ask MCP Server

@ppl-ai

A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.

JavaScript

a year ago

Convika - LP ops

@Tomoya

Your AI can ship landing pages now. Convika is a landing page ops platform built for MCP clients. Connect from Claude, Claude Code, Codex, or any MCP-compatible AI client and manage the full landing page lifecycle in natural language: - Create a LP and preview it before going live - Publish to a global edge network (pages stay up independently of the dashboard) - Collect leads with forms and export submissions - Read basic analytics: traffic, sources, devices, conversions, and goals - Connect custom domains - Iterate safely with version history and one-step rollback Quick start: 1. Sign up free at https://app.convika.com/signup 2. Claude Desktop: Settings → Connectors → Add custom connector → https://mcp.convika.com Claude Code: claude mcp add --transport http convika https://mcp.convika.com 3. Ask your AI: "Create a landing page for my product and show me the preview." Auth: OAuth 2.1 — sign in with your Convika account when the client prompts you.

a day ago

Mailtrap Email Sending MCP

@Mailtrap

An MCP server that provides a tool for sending transactional emails via Mailtrap

a year ago

Ironclaw Bitcoin Blockchain Api

Bitcoin blockchain MCP server providing 17 tools: BTC price/info, mempool fees, transaction details, address lookups, blockchain debug, whale alerts, SEC insider trading tracker, web scraping/summarization, systems theory, game theory, capital flows analysis, and Reddit API (hot/search/trending). SSE endpoint.

a day ago

Indian Food Nutrition Mcp - Log Indian meals with your AI using accurate data. India's official IFCT 2017 nutrition tables + USDA (8,335 foods), by text or photo. Local-first, open source.

@krishnabhat

One-line description: Log Indian meals with your AI using accurate data. India's official IFCT 2017 nutrition tables + USDA (8,335 foods), by text or photo. Local-first, open source. Long description: An MCP server that gives Claude (and soon ChatGPT) accurate Indian food data. Most calorie databases are US-centric and wrong for home-cooked Indian food. This wraps India's official Food Composition Tables (IFCT 2017, National Institute of Nutrition) plus USDA. Log by talking ("2 rotis and a katori of dal") or by photo; the model identifies the food, the database supplies the numbers (no LLM guessing), and your history feeds back so the AI can coach you against what you actually ate. Local SQLite, no account, no telemetry. AGPL-3.0. Tools: search_food, log_meal, get_day, get_history, edit_entry, delete_entry, fetch_image

2 hours ago

Socialclaw

@ndesv21

Social media scheduling MCP for AI agents posting to X, LinkedIn, Instagram, Facebook Pages, TikTok, Discord, Telegram, YouTube, Reddit, WordPress, and Pinterest.

9 hours ago

PostgreSQL

@modelcontextprotocol

Read-only database access with schema inspection

a year ago

EverArt

@modelcontextprotocol

AI image generation using various models

a year ago

Auditspark

@brianontech-bot

AI-powered website audit for Claude. Analyze any URL across SEO, performance, accessibility, UX, content quality, and 10+ other categories — get a scored report in under 2 minutes. Free tier included (3 audits/day).

a day ago

//beforeyouship — LLM Cost Modeling From Your Editor

@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ``` **Cursor / other clients** — add a remote server: ```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

7 hours ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Google Maps

@modelcontextprotocol

Location services, directions, and place details

a year ago

CYBERDYNE — Let your AI agent hire and pay a verified human

@Cyberdyne-OS

CYBERDYNE is an MCP gateway that lets an AI agent post real-world tasks it can't do alone (voice, observation, judgment) and pay verified humans in USDC via a non-custodial x402 auth-capture escrow on Base — budget frozen at deploy, released first-come-first-served. Humans verify their X identity before submitting. Open source (MIT); self-onboard with: npx -y cyberdyne-mcp onboard

7 hours ago

Agentline

@Sameer

AgentLine is the telephony layer for AI agents. It gives your agent a real phone number making outbound calls, receiving inbound calls, and handling SMS all through a single API. No telecom infrastructure, no WebSocket wrangling, no separate STT/TTS providers to configure.

7 hours ago

MCP Advisor

@istarwyh

MCP Advisor & Installation - Use the right MCP server for your needs

TypeScript

a year ago

Shippo

@Shippo

15 hours ago

Giveradar Mcp Server

Remote MCP server exposing 8.7M+ registered charities across 60+ countries, sourced from official government registries (IRS, Charity Commission, ACNC, DSD, RNA, and 60+ more). Read-only, no key required to start.

19 hours ago

2d Games Assets Generator

@crony-io

An MCP (Model Context Protocol) server that generates advanced mock 2D PNG assets for games prototypes — directly from any MCP-compatible AI client such as Claude Desktop. This MCP is engine-agnostic and works with any game engine that supports PNG import: Godot Unity Unreal Engine GameMaker Construct RPG Maker And many more... Create placeholder sprites, UI elements, health bars, spritesheets, and more with full support for gradients, patterns, transparency, text rotation, and auto-scaling — all without opening an image editor. Each generated PNG embeds rich JSON metadata (dimensions, color, shape, description) directly in its EXIF data, so AI models without vision can still understand what an asset contains.

7 hours ago

Gp Intel

@gparientee

Verified European private equity ownership data: who owns a company, PE firm portfolios, exits by year. 21,000+ companies, 900+ GPs, hand-checked, source link on every response. No auth.

2 hours ago

Tavily Mcp

@tavily-ai

JavaScript

a year ago

Rightblogger

@RightBlogger

RightBlogger MCP gives any AI agent direct access to SEO keyword research, Google Search Console performance, and your WordPress/Ghost/Webflow CMS — research keywords, read posts, and pull GSC data straight from Claude, Cursor, or any MCP client.

a day ago

Slack

@modelcontextprotocol

Channel management and messaging capabilities

a year ago

Matchbox

@Matchbox (Co-fe GmbH)

Describe a real-world problem in plain language and Matchbox finds products built to solve it - with reasoning, honest caveats, what each product won't cover, and a frank 'no strong match' when nothing fits. The catalog (~12,000 products) focuses on early-stage and lesser-known products that search engines and LLM training data usually miss. Never sponsored; payment never affects ranking. Tools: find_products_for_problem, search_catalog, get_product. No auth required.

9 hours ago

Figma Mcp Express

@sunhome243

figma-mcp-express connects AI agents directly to Figma via a local plugin bridge. No Figma token required. No quota. No per-seat billing. Ships 70 discrete tools for reading design context, creating and mutating nodes, importing library components and variables, managing styles and tokens, running batch operations in a single round-trip, and exporting frames. Designed for agent-driven design automation workflows in Claude, Cursor, Codex, and other MCP-compatible clients.

20 hours ago

Sequential Thinking

@modelcontextprotocol

An MCP server implementation that provides a tool for dynamic and reflective problem-solving through a structured thinking process.

a year ago

StudiePoint AI

@StudiePoint

Scholarship discovery and academic guidance for African postgraduate students. Search 172 full-ride + 61 full-tuition scholarships across 32 countries. Convert African GPA (13 grading systems, 54 countries). Check visa access. Get personalised matches. Estimate study costs. 6 tools. No auth required.

a day ago

Mnemom

8 hours ago

Liveauth Mcp Server

@dulzuradev

LiveAuth MCP Server gives AI agents cryptographic proof-of-work + Lightning Network authentication. Agents solve a PoW challenge, get a signed JWT, and use it to call paid MCP tools. Each tool call is metered in sats and recorded as a signed revenue event with a verifiable receipt. Built on L402 (the Lightning-Native HTTP 402 protocol from Lightning Labs). Compatible with x402 (Cloudflare/Coinbase). Non-custodial — no KYC, no account, no email. Pay per call in sats over Lightning. Install: `npx -y @liveauth-labs/mcp-server` Docs: https://docs.liveauth.app/mcp-liveauth-gate Source: https://github.com/dulzuradev/liveauth-mcp

2 hours ago