Gemini Nanobanana Mcp

Created By
Junhan28 months ago
Generate images from text with Claude! Simply type "Draw a cute cat" and get instant AI-generated images.
Overview

What is Gemini Nanobanana MCP?

Gemini Nanobanana MCP is a Model Context Protocol server that allows users to generate images from text prompts using Google's Gemini 2.5 Flash Image generation integrated into Claude conversations.

How to use Gemini Nanobanana MCP?

To use the Gemini Nanobanana MCP, you need to obtain an API key from Google AI Studio, install it in your Claude client, and then you can start generating images by typing prompts like "Draw a cute cat".

Key features of Gemini Nanobanana MCP?

  • Text-to-image generation from natural language prompts.
  • Image editing capabilities with natural language instructions.
  • Image composition and style transfer features.
  • Automatic saving of generated images to a specified directory.

Use cases of Gemini Nanobanana MCP?

  1. Creating unique images based on user-defined descriptions.
  2. Editing existing images by applying filters or backgrounds.
  3. Combining multiple images into a single creative output.
  4. Applying artistic styles from one image to another.

FAQ from Gemini Nanobanana MCP?

  • What is required to use this project?

You need a Google API key and a compatible Claude client to run the server.

  • Is there a cost associated with using Gemini Nanobanana MCP?

The usage of the server is free, but you may incur costs based on your Google API usage.

  • What formats are supported for image generation?

The server supports PNG, JPEG, WebP, and GIF formats.

Server Config

{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": [
        "gemini-nanobanana-mcp@latest"
      ],
      "env": {
        "GEMINI_API_KEY": "your-api-key",
        "AUTO_SAVE": "true",
        "DEFAULT_SAVE_DIR": "~/Pictures/AI-Images",
        "LOG_LEVEL": "debug"
      }
    }
  }
}
Project Info
Created At
8 months ago
Updated At
7 months ago
Author Name
Junhan2
Star
-
Language
-
License
-

Recommend Servers

View All
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

16 hours ago