Inferbench

Created By
JoniMartin272 hours ago
InferBench's MCP server lets coding agents run, serve and benchmark local LLMs (text + image, llama.cpp + Stable Diffusion) on your own hardware on demand. Measures real tokens/sec, picks the optimal quant for your GPU, and exposes a 124-model catalog. Local-first, no cloud required.

Server Config

{
  "mcpServers": {
    "inferbench": {
      "command": "C:\\Users\\<user>\\AppData\\Local\\Programs\\InferBench\\resources\\sidecar\\inferbench-backend.exe",
      "args": [
        "--mcp"
      ]
    }
  }
}
Project Info
Created At
2 hours ago
Updated At
2 hours ago
Author Name
JoniMartin27
Star
-
Language
-
License
-
Category

Recommend Servers

View All
PDFGate
@pdfgate

2 days ago
Zoviz Mcp

7 hours ago