kiro-discord-bot

Media MCP #

mcp-media-server is an optional MCP server for media generation. It is not required for normal bot operation.

Providers #

The server registers providers from available API keys:

API keyEnabled capabilities
GEMINI_API_KEYImage generation, video generation, music generation, text to speech.
OPENAI_API_KEYImage generation and text to speech.

If no provider API key is set, the server exits at startup.

Optional defaults:

VariablePurpose
MEDIA_DEFAULT_IMAGE_MODELOverride the default image model.
MEDIA_DEFAULT_TTS_MODELOverride the default text-to-speech model.

Tools #

ToolPurpose
generate_imageGenerate an image from prompt, with optional model, size, and aspect_ratio.
edit_imageEdit an existing local image using image_path and prompt.
generate_videoGenerate a video from prompt, optionally conditioned on image_path.
generate_musicGenerate music from prompt, with optional duration_sec.
text_to_speechGenerate speech from text, with optional model and voice.
list_modelsList available models, optionally filtered by image, video, music, or tts.

Generated artifacts are returned as local file paths to the agent. Use a channel policy that matches how much media generation cost and egress you want to allow.

Operational Notes #

Media tools can spend external provider quota and may produce files that are not suitable for every Discord channel. Keep the media server disabled by default and enable it per channel through /mcp manage when the channel has a clear use case.