Claude Desktop Real-time Audio MCP Server (Python Implementation)

Created By

joelfuller2016a year ago

Python-based Model Context Protocol (MCP) server for real-time microphone input to Claude Desktop on Windows. FastMCP + sounddevice + multiple STT engines for sub-500ms latency voice conversations.

Overview Content Tools Comments

Overview

What is Claude Desktop Real-time Audio MCP Server?

Claude Desktop Real-time Audio MCP Server is a Python-based server that facilitates real-time microphone input for Claude Desktop on Windows, enabling fast voice conversations with low latency.

How to use Claude Desktop Real-time Audio MCP Server?

To use the server, clone the repository, set up a virtual environment, install dependencies, configure your audio settings and STT engines, and run the server.

Key features of Claude Desktop Real-time Audio MCP Server?

Real-time audio capture with sub-500ms latency.
Supports multiple speech-to-text engines including OpenAI Whisper, Azure Speech, and Google Speech-to-Text.
Easy configuration through JSON/YAML files and environment variables.
Comprehensive logging and performance monitoring.
Async architecture for non-blocking operations.

Use cases of Claude Desktop Real-time Audio MCP Server?

Enabling voice-driven interactions with Claude Desktop.
Real-time transcription of spoken language into text.
Voice activity detection for improved audio processing.

FAQ from Claude Desktop Real-time Audio MCP Server?

What platforms does it support?

It supports Windows 10/11 and requires Python 3.8 or higher.
Is it free to use?

Yes, it is open-source and available under the MIT License.
How can I contribute?

Contributions are welcome, especially in areas like additional STT engines and cross-platform support.

Project Info

Created At

a year ago

Updated At

a year ago

Author Name

joelfuller2016

Star

0

Language

Python

License

MIT license

Category

developer-tools

Tags

Homepage

https://github.com/joelfuller2016/claude-desktop-realtime-audio-mcp-python

Recommend Servers

Switzerland Payments Mcp

@junter1989k-ai

13 days ago

302_browser_use_mcp

Automatically create a remote browser to complete your specified tasks, developed based on Browser Use + Sandbox. 自动创建一个远程浏览器，完成你指定的任务，基于Browser Use + Sandbox开发。

a year ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Aws Kb Retrieval Server

@modelcontextprotocol

An MCP server implementation for retrieving information from the AWS Knowledge Base using the Bedrock Agent Runtime.

a year ago

Nepal Payments Mcp

@junter1989k-ai

12 days ago

Costa Rica Payments Mcp

@junter1989k-ai

12 days ago

Tanzania Payments Mcp

@junter1989k-ai

12 days ago

Persistent Adaptive Planning Intelligence - structured loop engineering for AI coding assistants, with memory that persists across sessions, tools, and teammates.

13 days ago

Aiimagemultistyle

A Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

a year ago

Neon MCP Server

@neondatabase-labs

MCP server for interacting with Neon Management API and databases

TypeScript

a year ago

Bulgaria Payments Mcp

@junter1989k-ai

12 days ago

JavaScript

a year ago

Qiniu MCP Server

基于七牛云产品构建的 Model Context Protocol (MCP) Server，支持用户在 AI 大模型客户端的上下文中通过该 MCP Server 来访问七牛云存储资源、利用 Dora 服务进行图片操作等。如果有什么需求欢迎在下方评论，您也可以在 github 仓库中提 issue。

Python

a year ago

Algeria Payments Mcp

@junter1989k-ai

12 days ago

VC follow intelligence for AI agents. Track what top investors follow on X — detect new follows, convergence signals, and trending companies before they're announced.

7 days ago

Framelink Figma MCP Server

MCP server to provide Figma layout information to AI coding agents like Cursor

TypeScript

a year ago

Bangladesh Payments Mcp

@junter1989k-ai

13 days ago

Web content fetching and conversion for efficient LLM usage

9 months ago

@Henning Witzel-Acikgöz

Host the HTML or Markdown pages your AI generates and share each as a link with comments, versioning, and access control. Create, update, and organize pages and read reviewer comments over MCP.

12 days ago

I Ching Hexagram Mcp

@Network-Ideas LLC

Public, read-only MCP over the complete 64-hexagram I Ching corpus — hexagrams, trigrams, changing lines, and reading context from an original English translation. Deterministic lookup only; no casting or divination.

13 days ago

Jordan Payments Mcp

@junter1989k-ai

12 days ago

France Payments Mcp

@junter1989k-ai

13 days ago

Belgium Payments Mcp

@junter1989k-ai

13 days ago

Latvia Payments Mcp

@junter1989k-ai

12 days ago

Ivory Coast Payments Mcp

@junter1989k-ai

12 days ago

@modelcontextprotocol

Read-only database access with schema inspection

a year ago

Croatia Payments Mcp

@junter1989k-ai

12 days ago

Norway Payments Mcp

@junter1989k-ai

13 days ago

ContextBridge (CB) is a local-first retrieval layer that sits between your codebase and your AI coding agent. Instead of pasting raw source files into context or letting the AI guess which files matter, CB indexes your codebase's real structure (via Graphify) and returns a compact, ranked result — owner file, related files, key symbols, and dependency chains — grounded in your actual code. A typical response is ~4–8 KB instead of the 100+ KB of raw source an AI would otherwise need to read to answer the same question — roughly a 96% cut in input tokens sent to your cloud AI, with zero hallucinated file paths or method names.

13 days ago

Luxembourg Payments Mcp

@junter1989k-ai

12 days ago