Artificial Analysis

Created By
David Hariri4 months ago
I built this to give my AI assistants the latest information on model performance benchmarks provided by https://artificialanalysis.ai to make informed decisions on model choice.
Overview

Artificial Analysis MCP Server

An MCP (Model Context Protocol) server that provides LLM model pricing, speed metrics, and benchmark scores from Artificial Analysis.

Features

  • Get real-time pricing for 300+ LLM models (input/output/blended rates)
  • Compare speed metrics (tokens/sec, time to first token)
  • Access benchmark scores (Intelligence Index, Coding Index, MMLU-Pro, GPQA, and more)
  • Filter by provider (OpenAI, Anthropic, Google, etc.)
  • Sort by any metric

Installation

Claude Code

claude mcp add artificial-analysis -e AA_API_KEY=your-key -- npx -y artificial-analysis-mcp

Or install from GitHub:

claude /mcp add https://github.com/davidhariri/artificial-analysis-mcp

Manual Configuration

Add to your Claude settings (~/.claude/settings.json):

{
  "mcpServers": {
    "artificial-analysis": {
      "command": "npx",
      "args": ["-y", "artificial-analysis-mcp"],
      "env": {
        "AA_API_KEY": "your-api-key"
      }
    }
  }
}

Configuration

Environment VariableRequiredDescription
AA_API_KEYYesYour Artificial Analysis API key

Get your API key at artificialanalysis.ai.

Tools

list_models

List all available LLM models with optional filtering and sorting.

Parameters:

NameTypeRequiredDescription
creatorstringNoFilter by model creator (e.g., "OpenAI", "Anthropic")
sort_bystringNoSort field (see below)
sort_orderstringNo"asc" or "desc" (default: "desc")
limitnumberNoMaximum results to return

Sort fields: price_input, price_output, price_blended, speed, ttft, intelligence_index, coding_index, math_index, mmlu_pro, gpqa, release_date

Example usage:

  • "List the top 5 fastest models"
  • "Show me Anthropic models sorted by price"
  • "What are the cheapest models with high intelligence scores?"

get_model

Get detailed information about a specific model.

Parameters:

NameTypeRequiredDescription
modelstringYesModel name or slug (e.g., "gpt-4o", "claude-4-5-sonnet")

Returns: Complete model details including pricing, speed metrics, and all benchmark scores.

Example usage:

  • "Get pricing for GPT-4o"
  • "What are Claude 4.5 Sonnet's benchmark scores?"

Model Data

Each model includes:

  • Pricing: Input/output/blended rates per 1M tokens (USD)
  • Speed: Output tokens per second, time to first token
  • Benchmarks: Intelligence Index, Coding Index, Math Index, MMLU-Pro, GPQA, LiveCodeBench, and more

Development

# Install dependencies
npm install

# Build
npm run build

# Run locally
AA_API_KEY=your-key node dist/index.js

License

MIT

Server Config

{
  "mcpServers": {
    "artificial-analysis": {
      "command": "npx",
      "args": [
        "-y",
        "artificial-analysis-mcp"
      ],
      "env": {
        "AA_API_KEY": "your-api-key"
      }
    }
  }
}
Project Info
Created At
4 months ago
Updated At
4 months ago
Author Name
David Hariri
Star
-
Language
-
License
-
Category

Recommend Servers

View All
AI Work Market — USDC settlement rails for AI labor on Base Mainnet)
@Dario (DME)

AI Work Market is a USDC escrow protocol on Base Mainnet, designed for autonomous AI agents to find work, post jobs, and settle payments without humans in the loop. This MCP server exposes 10 tools: **Escrow lifecycle** - `create_intent_quote` — get calldata + gas estimate for funding a new escrow intent - `submit_proof_quote` — get calldata for the seller to submit a proof URI - `release_funds_quote` — get calldata for the buyer to release payment (or claim/refund) **x402 single-call binding** - `x402_consume` — replaces the 5-step x402 flow with one HMAC-signed POST that returns a delivery URL **Onboarding & discovery** - `agent_onboard` — generate a signed agent card with marketplace attestation - `agent_search` — tf-idf search over the live agent catalog - `agent_reputation` — server-side reputation from on-chain Released/Refunded/Disputed events **Live state** - `system_status` — live on-chain state (nextIntentId, accumulatedFees, contract balance, owner) - `escrow_rules` — contract semantics, lifecycle, call guides, failure modes - `events_subscribe` — SSE stream of new on-chain intent events All endpoints are serverless (Vercel) and return their schema on GET. No browser, no wallet UI required for an agent to integrate. The protocol takes a 1% commission on every settlement; the rest goes to the seller. The full AgentCard is at `/.well-known/agent-card.json` (A2A-compatible). The OpenAPI 3.0.3 spec is at `/.well-known/openapi.json` with `components.securitySchemes` (none, hmacX402). `robots.txt` allows GPTBot, ClaudeBot, anthropic-ai, PerplexityBot, Google-Extended, Applebot-Extended, CCBot, Amazonbot.

8 hours ago