MCP YOLOE: Zero-Shot Object Detection & Segmentation

Created By

rjn32s4 months ago

Provide your AI agents with "eyes." This server enables open-vocabulary object detection and instance segmentation using naturally phrased text prompts (e.g., "detect the laptop next to the coffee").

# Vision

# AI

Overview Content Tools Comments

Overview

MCP-YOLO

MCP-YOLO is a powerful Model Context Protocol server that grants AI agents advanced computer vision capabilities. Unlike traditional YOLO models that only detect a fixed list of objects, this server uses Zero-Shot Learning to detect and segment anything you describe.

Key Features

Zero-Shot Detection: Detect arbitrary objects using natural language prompts.
Precision Segmentation: Get exact polygon masks for every detected object.
Flexible Inputs: Works with local file paths, remote image URLs, and Base64 strings.
Agent-First: Designed specifically for integration with Claude, IDEs, and autonomous workspace agents.

Example Usage

Ask your agent to:

"Find the 'vintage typewriter' in this image and give me its exact coordinates."

Performance

Uses the state-of-the-art YOLOE26-L architecture, providing a perfect balance of high precision (55.0 mAP) and rapid inference (~6.2ms on T4 GPUs).

Try in Playground

Server Config

{
  "mcpServers": {
    "mcp-yolo": {
      "command": "uvx",
      "args": [
        "mcp-yolo"
      ]
    }
  }
}

Project Info

Created At

4 months ago

Updated At

4 months ago

Author Name

rjn32s

Star

Language

License

Recommend Servers

View All

Mailtrap Email Sending MCP

@Mailtrap

An MCP server that provides a tool for sending transactional emails via Mailtrap

a year ago

Bucket Feature Flags MCP Server

@bucketco

Flag features directly from chat in your code editor, including VS Code, Cursor, Windsurf, Claude Code—any IDE with MCP support.

a year ago

Smart Match

@Wallstrdev

AI-powered job matching and application tracker. Analyze job listings against your resume, get a match score (0-100), identify skill gaps, generate cover letters, and track your application pipeline.

a day ago

Docwand

14 hours ago

Vectoralix Healthy Food

@vectoralix

Healthy Food MCP is a public MCP server that gives AI agents structured access to healthy recipe content. It lets Claude, Cursor, Gemini, and other MCP-compatible clients browse recipes by calorie category, diet group, meal type, macros, and keywords. The server includes tools for listing recipe categories, exploring diet and meal groups, browsing available recipe files, fetching full structured recipes, and searching the recipe collection. It is designed as a practical example of how MCP can be used beyond developer tools — turning curated food knowledge into something AI agents can query, understand, and use in real conversations. Healthy Food MCP is powered by Vectoralix and includes connection metadata, plugin manifests, reusable skills, documentation, assets, and validation scripts for easier installation and testing.

a day ago

SeedBase — Synthetic Test Data

@Marcel Gläser

Generate realistic, FK-consistent test data for your databases from your AI assistant. List projects, get schema DDL, generate datasets as SQL.

a day ago

Filesystem

Secure file operations with configurable access controls

a year ago

Fetch

@test

Web content fetching and conversion for efficient LLM usage

8 months ago

Firecrawl Mcp Server

@mendableai

Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.

JavaScript

a year ago

Primerfp Scout

@PrimeRFP

Remote govcon intelligence MCP for US federal + SLED contracting: semantic opportunity search, USASpending awards, recompete pipeline, congressional policy intel, GAO protests, capture/teaming, and 32 read-only tools. Streamable HTTP at https://mcp.primerfp.com/mcp.

a day ago

Mcp Server Chatsum

@chatmcp

summarize chat message

typescript

a year ago

Sequential Thinking

@modelcontextprotocol

An MCP server implementation that provides a tool for dynamic and reflective problem-solving through a structured thinking process.

a year ago

LLMtoMD

@Gabriel Jacob

LLMtoMD turns your docs into clean, AI-ready Markdown and serves them to Cursor, Claude Code, and any MCP client so your coding agent retrieves your spec instead of forgetting it.

2 days ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Howtocook Mcp

@worryzyy

基于Anduin2017 / HowToCook （程序员在家做饭指南）的mcp server，帮你推荐菜谱、规划膳食，解决“今天吃什么“的世纪难题； Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"

a year ago

302_browser_use_mcp

@302ai

Automatically create a remote browser to complete your specified tasks, developed based on Browser Use + Sandbox. 自动创建一个远程浏览器，完成你指定的任务，基于Browser Use + Sandbox开发。

a year ago

Convika - LP ops

@Tomoya

Your AI can ship landing pages now. Convika is a landing page ops platform built for MCP clients. Connect from Claude, Claude Code, Codex, or any MCP-compatible AI client and manage the full landing page lifecycle in natural language: - Create a LP and preview it before going live - Publish to a global edge network (pages stay up independently of the dashboard) - Collect leads with forms and export submissions - Read basic analytics: traffic, sources, devices, conversions, and goals - Connect custom domains - Iterate safely with version history and one-step rollback Quick start: 1. Sign up free at https://app.convika.com/signup 2. Claude Desktop: Settings → Connectors → Add custom connector → https://mcp.convika.com Claude Code: claude mcp add --transport http convika https://mcp.convika.com 3. Ask your AI: "Create a landing page for my product and show me the preview." Auth: OAuth 2.1 — sign in with your Convika account when the client prompts you.

a day ago

Linkpulse

@Joost Boer

Know what every affiliate link actually earns, and fix what's bleeding revenue. See revenue per article, catch dead links before they cost you, and ask it anything in plain English. Works on any site.

17 hours ago

HourLedger — Work Hours & Overtime Calculator

@wudongjie

Calculate work hours, overtime, and gross pay with tested rulesets for US federal, California, Alaska, Colorado, and Nevada law. Handles overnight shifts, rounding policies, and workweek start. Local, no API key, no data leaves your machine.

20 hours ago

Figma Mcp Express

@sunhome243

figma-mcp-express connects AI agents directly to Figma via a local plugin bridge. No Figma token required. No quota. No per-seat billing. Ships 70 discrete tools for reading design context, creating and mutating nodes, importing library components and variables, managing styles and tokens, running batch operations in a single round-trip, and exporting frames. Designed for agent-driven design automation workflows in Claude, Cursor, Codex, and other MCP-compatible clients.

a day ago

Mnemom

@Mnemom

Trust ratings for AI agents and websites. Look up an agent's reputation, scan a site's AI-trust-readiness, and verify signed scorecards in-band — reads are zero-auth. From Mnemom, the trust layer for the agent internet.

9 hours ago

MiniMax MCP

@MiniMax-AI

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python

a year ago

Jina AI MCP Tools

@PsychArch

A Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.

JavaScript

a year ago

Erabi

@HMAKT99

ERABI is the open, cryptographically auditable intent exchange for AI agents: register an identity in one command, discover providers ranked by reputation (never by payment), fire intents, and build verifiable reputation and earnings from dual-signed outcomes on a public hash-chained ledger. Zero-config — `npx -y erabi-mcp` joins the live public network with no accounts, no API keys. Six tools: register, discover, intent, report_outcome, my_reputation, my_earnings. Live explorer: https://erabi-explorer.vercel.app

a day ago

Latlng

@latlng-work

Official LatLng MCP server for geocoding, reverse geocoding, places search, nearby POI lookup, and place categories. Powered by the LatLng API and OpenStreetMap data.

4 hours ago

MCP for Indexa Capital

@InvIngeniero

Check your portfolio, cash transactions, movements, payed fees, growth history and more

a day ago

Sentry

@modelcontextprotocol

Retrieving and analyzing issues from Sentry.io

a year ago

Linkedai

@DatTheMaster

LinkedIn for AI agents. Agents register structured profiles, list projects, evaluate fit via FitReports, and propose connections — all through a hosted MCP server. 27 tools, zero install. Handlers (humans) approve connections. Built on Cloudflare Workers + KV.

7 hours ago

CYBERDYNE — the engagement marketplace for the agent economy, native to the Bankr ecosystem

@Cyberdyne-OS

Engagement marketplace on Base, native to the Bankr ecosystem: AI agents and communities fund quests (follows, reposts, replies, quotes, original posts); verified-X humans complete them and are paid per approved action from a non-custodial x402 escrow — in USDC, BNKR, or any Bankr-launched token.

4 hours ago

Fonteum Mcp Server

@Fonteum

Hosted MCP server for source-provenanced US federal healthcare provider data — NPPES, CMS PECOS, Care Compare, OIG LEIE, Open Payments. Every field returns with its exact federal source, snapshot date, and SHA-256 attestation. Public data only; no PHI. Install: npx -y @fonteum/mcp

2 hours ago