AgentHubAgentHub

multimodal

MCP ServerMCP Registry官方收录

io.github.rsmdt/multimodal · v1.3.1

Multi-provider media generation — images, video, audio, and transcription via a unified interface

概览

multimodal 是一个MCP Server,收录自 官方 MCP Registry。支持 stdio 传输。本页提供 Cursor、Claude Code 等客户端的安装配置片段。

安装

选择你的平台查看安装方式

{
  "mcpServers": {
    "multimodal": {
      "command": "npx",
      "args": [
        "-y",
        "@r16t/multimodal-mcp"
      ]
    }
  }
}

环境变量

OPENAI_API_KEY可选secret

OpenAI API key for image, video, audio generation and transcription

XAI_API_KEY可选secret

xAI API key for image and video generation

GEMINI_API_KEY可选secret

Google Gemini API key for image, video, and audio generation

ELEVENLABS_API_KEY可选secret

ElevenLabs API key for audio generation and transcription

BFL_API_KEY可选secret

BFL API key for FLUX image generation and editing

MEDIA_OUTPUT_DIR可选

Directory for saved media files (defaults to cwd)

相关资源

统一 Manifest

{
  "id": "io.github.rsmdt/multimodal",
  "type": "mcp-server",
  "version": "1.3.1",
  "displayName": "multimodal",
  "description": "Multi-provider media generation — images, video, audio, and transcription via a unified interface",
  "repository": {
    "url": "https://github.com/rsmdt/multimodal-mcp",
    "source": "github"
  },
  "distribution": {
    "packages": [
      {
        "registryType": "npm",
        "identifier": "@r16t/multimodal-mcp",
        "version": "1.3.1",
        "transport": "stdio",
        "environmentVariables": [
          {
            "name": "OPENAI_API_KEY",
            "description": "OpenAI API key for image, video, audio generation and transcription",
            "isSecret": true
          },
          {
            "name": "XAI_API_KEY",
            "description": "xAI API key for image and video generation",
            "isSecret": true
          },
          {
            "name": "GEMINI_API_KEY",
            "description": "Google Gemini API key for image, video, and audio generation",
            "isSecret": true
          },
          {
            "name": "ELEVENLABS_API_KEY",
            "description": "ElevenLabs API key for audio generation and transcription",
            "isSecret": true
          },
          {
            "name": "BFL_API_KEY",
            "description": "BFL API key for FLUX image generation and editing",
            "isSecret": true
          },
          {
            "name": "MEDIA_OUTPUT_DIR",
            "description": "Directory for saved media files (defaults to cwd)"
          }
        ]
      }
    ],
    "remotes": []
  },
  "dependencies": [],
  "installTargets": [
    "claude-code",
    "claude-desktop",
    "cursor",
    "vscode"
  ],
  "keywords": [],
  "provenance": {
    "origin": "official-mcp-registry",
    "originalId": "io.github.rsmdt/multimodal",
    "originalUrl": "https://registry.modelcontextprotocol.io/v0.1/servers/io.github.rsmdt%2Fmultimodal/versions/latest",
    "isOfficial": true,
    "status": "active"
  }
}
multimodal — MCP Server 安装与配置 · AgentHub