media
23AI tools in the media category
Video Research Mcp
Galbaz1
Give Claude Code 41 research & video tools with one command. Video analysis, deep research, content extraction, explainer video creation, and Weaviate vector search — powered by Gemini 3.1 Pro.
...moreLilbee
tobocop2
Local knowledge base for documents and code. Index PDFs, Office docs, spreadsheets, images, and code — search or ask questions standalone, or plug into AI agents via MCP. Fully local with Ollama + LanceDB + tree-sitter.
...moreAstrbot Plugin Office Assistant
Clhikari
这是一个为 AstrBot 设计的 Office 助手插件。它赋予大语言模型(LLM)直接操作文件的能力,支持读取并分析多种格式文件,以及生成 Office 文档和office互转pdf的功能
CrewAI Agents PDF RAG
sushant1827
PDF-RAG is a collaborative crew of AI agents that autonomously RAG & summarizes given PDF file.
Hypr Video Skills
superyhee
A Claude Code Skill that turns natural language into rendered MP4 videos — animations, transitions, captions, 50+ effects, all from your terminal via npx.
...moreMR Video
ziqipang
MR. Video: MapReduce is the Principle for Long Video Understanding
Vap Media Skill
RenSeiji27
🎨 Generate AI-powered images, videos, and music effortlessly with VAP Media Skill for Claude Code and Codex CLI. Enjoy free daily access or full features.
...moreClaude Image Gen
guinacio
AI-powered image generation using Google Gemini, integrated with Claude Code via Skills or Claude.ai via MCP (Model Context Protocol).
...moreImage Fetcher
sacredvoid
Fetch relevant, high-quality, free-to-use images from the web. Context-aware search from Unsplash, Pexels, Pixabay. Works with any AI coding tool.
...moreBRAD Video
Jpickard1
Retrieval Augmented Generation for youtube videos with a BRAD agent
Agentic News Generator
florianbuetow
Generate a custom newspaper with an AI agent based on your favorite YouTube channels.
World To Image
mhson-kyle
WORLD-TO-IMAGE: GROUNDING TEXT-TO-IMAGE GENERATION WITH AGENT-DRIVEN WORLD KNOWLEDGE
Vapagent Vap Media Skill
elestirelbilinc-sketch
AI image, video, and music generation skill for Claude Code. Flux, Veo 3.1, Suno V5.
ViewRAG
David-Lolly
图文并茂的 PDF RAG 系统:支持版式感知分块、图表深度理解与精准视觉溯源。 Multimodal PDF RAG: Features layout-aware chunking, visual chart understanding, and precise inline image citations.
...moreShort Video Maker
gyoridavid
Creates short videos for TikTok, Instagram Reels, and YouTube Shorts using the Model Context Protocol (MCP) and a REST API.
...moreImage Gen Mcp
lansespirit
An MCP server that integrates with gpt-image-1 & Gemini imagen4 model for text-to-image generation services
Ebook Mcp
onebirdrocks
A MCP server that supports mainstream eBook formats including EPUB, PDF and more. Simplify your eBook user experience with LLM.
...morePdf Rag Mcp Server
hyson666
PDF RAG server for cursor.
HXAudioPlayer
huhx0015
HX Audio Player: A custom audio wrapper library for Android 2.3 and above. Originally designed as an audio library for games, HX Audio Player is an easy-to-use, alternative approach to implementing music and sound playback into Android applications.
...moreComfyUI FFMPEGA
AEmotionStudio
Intelligent FFMPEG agent node for ComfyUI - transforms natural language video editing prompts into automated video transformations
...more