🦊 MCPBench: A Benchmark for Evaluating MCP Servers

Created By

modelscopea year ago

The evaluation benchmark on MCP servers

Overview

what is MCPBench?

MCPBench is an evaluation framework designed for assessing the performance of MCP Servers, specifically focusing on Web Search and Database Query tasks. It evaluates various servers like Brave Search and DuckDuckGo based on task completion accuracy, latency, and token consumption.

how to use MCPBench?

To use MCPBench, install the required dependencies, configure your LLM key and endpoint, launch the MCP server with the appropriate configuration, and run evaluations for Web Search or Database Query tasks.

key features of MCPBench?

Supports evaluation of multiple MCP Servers
Measures task completion accuracy, latency, and token consumption
Compatible with local and remote MCP Servers
Provides datasets for evaluation

use cases of MCPBench?

Evaluating the performance of different web search engines.
Comparing database query efficiency across various MCP Servers.
Analyzing the impact of different configurations on server performance.

FAQ from MCPBench?

What types of servers can be evaluated with MCPBench?

MCPBench can evaluate both Web Search and Database Query servers.

Is there a specific Python version required?

Yes, MCPBench requires Python version >= 3.11.

Where can I find the evaluation report?

The evaluation report is available in the project repository.

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

modelscope

Star

Language

Python

License

Apache-2.0 license

Recommend Servers

View All

Framesail AI

@framesail

Official remote MCP server for Framesail AI. Create long-form (faceless YouTube) videos end to end from any MCP client: script, locked character references, storyboard, voiceover, and final video editing — with characters and style held consistent across every shot. Making long-form AI video today means 8+ tabs stitched by hand — an LLM for the script, a voice model, an image model, a video model — with characters drifting between tools and style resetting at every export. Framesail replaces the patchwork: the whole pipeline runs in one place and manages your video's context end to end. Six stages: Style (paste images, videos, or YouTube links and Framesail reverse-engineers the look, voice, and direction), Script (write it yourself or generate it in your narrative style), Reference images (auto-generated for every character, place, and prop), Voiceover (one narrator or many characters, with word-level timing), Storyboard (planned scene by scene), and Editor (captions, music, SFX, then export). No black box: you control every prompt, asset, model, and setting.

11 days ago

Neon MCP Server

@neondatabase-labs

MCP server for interacting with Neon Management API and databases

TypeScript

a year ago

Baidu Map

@baidu-maps

百度地图核心API现已全面兼容MCP协议，是国内首家兼容MCP协议的地图服务商。

a year ago

Slack

@modelcontextprotocol

Channel management and messaging capabilities

a year ago

Bucket Feature Flags MCP Server

@bucketco

Flag features directly from chat in your code editor, including VS Code, Cursor, Windsurf, Claude Code—any IDE with MCP support.

a year ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Jina AI MCP Tools

@PsychArch

A Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.

JavaScript

a year ago

GBOX Android MCP

@babelcloud

GBOX provides environments for AI Agents to operate computer and mobile devices. Mobile Scenario: Your agents can use GBOX to develop/test android apps, or run apps on the Android to complete various tasks(mobile automation). Desktop Scenario: Your agents can use GBOX to operate desktop apps such as browser, terminal, VSCode, etc(desktop automation). MCP: You can also plug GBOX MCP to any Agent you like, such as Cursor, Claude Code. These agents will instantly get the ability to operate computer and mobile devices.

a year ago

11 days ago

A Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

a year ago

MCP Server for Milvus

@zilliztech

The Milvus MCP server enables AI applications to interact with Milvus vector databases using natural language commands. It allows AI models to perform vector searches, manage collections, and retrieve data without writing custom database queries. This integration facilitates seamless access to vector data, enhancing the capabilities of AI tools like Claude Desktop and Cursor.

a year ago

Playwright Mcp

@microsoft

Playwright MCP server

TypeScript

a year ago

Amap Maps

@amap

高德地图官方 MCP Server

a year ago

Puppeteer

@modelcontextprotocol

Browser automation and web scraping

a year ago

Aws Kb Retrieval Server

@modelcontextprotocol

An MCP server implementation for retrieving information from the AWS Knowledge Base using the Bedrock Agent Runtime.

a year ago

Framelink Figma MCP Server

@GLips

MCP server to provide Figma layout information to AI coding agents like Cursor

TypeScript

a year ago

Google Maps

@modelcontextprotocol

Location services, directions, and place details

a year ago

Test

@modelcontextprotocol

test

7 months ago

Firecrawl Mcp Server

@mendableai

Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.

JavaScript

a year ago

Filesystem

@modelcontextprotocol

3 months ago

Serper MCP Server

@garymengcom

A Serper MCP Server

Python

a year ago

Perplexity Ask MCP Server

@ppl-ai

A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.

JavaScript

a year ago

Ivory Coast Payments Mcp

@junter1989k-ai

11 days ago

Kazakhstan Payments Mcp

@junter1989k-ai

11 days ago

Time

@modelcontextprotocol

A Model Context Protocol server that provides time and timezone conversion capabilities. This server enables LLMs to get current time information and perform timezone conversions using IANA timezone names, with automatic system timezone detection.

5 months ago

Memory

@modelcontextprotocol

a year ago

11 days ago

Create a remote sandbox that can execute code/run commands/upload and download files. 创建远程沙盒，可以执行代码/运行命令/上传下载文件

a year ago

Qiniu MCP Server

@Qiniu

基于七牛云产品构建的 Model Context Protocol (MCP) Server，支持用户在 AI 大模型客户端的上下文中通过该 MCP Server 来访问七牛云存储资源、利用 Dora 服务进行图片操作等。如果有什么需求欢迎在下方评论，您也可以在 github 仓库中提 issue。

Python

a year ago

Search1API

One API for Search, Crawling, and Sitemaps

a year ago