multimedia processing
30AI tools in the multimedia processing category
transloadit/node-sdk
Agent-native media processing via Transloadit's 86+ Robots, supporting video encoding, image manipulation, document conversion, OCR, and speech transcription. Hosted or self-hosted via npx.
...moreAIDC-AI/Pixelle-MCP
🐍 📇 🏠 🎥 🔊 🖼️ - An omnimodal AIGC framework that seamlessly converts ComfyUI workflows into MCP tools with zero code, enabling full-modal support for Text, Image, Sound, and Video generation with Chainlit-based web interface.
...morehamflx/imagen3-mcp
📇 🏠 🪟 🍎 🐧 - A powerful image generation tool using Google's Imagen 3.0 API through MCP. Generate high-quality images from text prompts with advanced photography, artistic, and photorealistic controls.
...morejyjune/mcp_vms
🐍 🏠 🪟 - A Model Context Protocol (MCP) server designed to connect to a CCTV recording program (VMS) to retrieve recorded and live video streams. It also provides tools to control the VMS software, such as showing live or playback dialogs for specific channels at specified times.
...morec-rick/jimeng-mcp
A TypeScript-based MCP server integrating Volcengine's AI image generation service, offering tools for creating images with customizable parameters and direct URL returns.
...morezjf2671/hh-mcp-comfyui
Facilitates image generation through natural language commands by interfacing with a local ComfyUI instance via the MCP protocol.
...moreFlyworks-AI/lipsync-mcp
Facilitates fast and free lipsync video creation for digital avatars using the Flyworks API.
bads1de/youtube-mp3-mcp
Facilitates the extraction of high-quality MP3 audio from YouTube URLs with seamless Claude Desktop integration.
SkyworkAI/Mureka-mcp
Facilitates the creation of lyrics, songs, and background music through an MCP server, enabling seamless integration with platforms like Claude Desktop and OpenAI Agents.
...morekdr/mcp-draw
Facilitates AI-driven image generation from text prompts via a standardized interface.
4kk11/mcp-gpt-image
Generates and edits images using OpenAI API, providing scalable previews and Docker integration.
falahgs/mcp-3d-style-cartoon-gen-server
A server that combines 3D-style cartoon image generation with secure file system operations, leveraging Google's Gemini AI and MCP SDK.
...moreechozyr2001/ali-flux-mcp
Facilitates image generation and management using Alibaba Cloud's DashScope API, with task tracking and local storage capabilities.
...moremario-andreschak/mcp_video_recognition
Facilitates image, audio, and video recognition using Google's Gemini AI.
aimino/imagemagic-mcp
Enhance images with binarization, color adjustment, and resizing using ImageMagick via the MCP protocol.
joshmouch/mcp-image-generator
Facilitates image generation, editing, and variation creation using OpenAI's DALL-E API.
HYPERVAPOR/mcp-image-processor
High-performance image processing server offering format conversion, resizing, and optimization capabilities.
SealinGp/mcp-video-extraction
Facilitates text extraction from videos and audio files across multiple platforms using OpenAI's Whisper model.
MalluBeast69/gemini-img-gen-MCP
Generate images using Google's Gemini model via a dedicated MCP server.
omergocmen/json2video-mcp-server
Facilitates video creation and status monitoring through the json2video API, enabling seamless integration with LLMs and automation agents.
...more