awesome-mcp-servers-tensorblock/docs/multimedia-processing.md at main · howard-eridani/awesome-mcp-servers-tensorblock

🖼️ Multimedia Processing

Servers focused on generating or manipulating images, processing video, audio transcription, text-to-speech, or document conversion.

AIDC-AI/Pixelle-MCP: An omnimodal AIGC framework that seamlessly converts ComfyUI workflows into MCP tools with zero code, enabling full-modal support for Text, Image, Sound, and Video generation with Chainlit-based web interface.
aimino/imagemagic-mcp: Enhance images with binarization, color adjustment, and resizing using ImageMagick via the MCP protocol.
SkyworkAI/Mureka-mcp: Facilitates the creation of lyrics, songs, and background music through an MCP server, enabling seamless integration with platforms like Claude Desktop and OpenAI Agents.
joshmouch/mcp-image-generator: Facilitates image generation, editing, and variation creation using OpenAI's DALL-E API.
omergocmen/json2video-mcp-server: Facilitates video creation and status monitoring through the json2video API, enabling seamless integration with LLMs and automation agents.
echozyr2001/ali-flux-mcp: Facilitates image generation and management using Alibaba Cloud's DashScope API, with task tracking and local storage capabilities.
c-rick/jimeng-mcp: A TypeScript-based MCP server integrating Volcengine's AI image generation service, offering tools for creating images with customizable parameters and direct URL returns.
hamflx/imagen3-mcp: Harnesses Google's Imagen 3.0 for photorealistic image generation via MCP, requiring a Google Gemini API key.
SealinGp/mcp-video-extraction: Facilitates text extraction from videos and audio files across multiple platforms using OpenAI's Whisper model.
kdr/mcp-draw: Facilitates AI-driven image generation from text prompts via a standardized interface.
4kk11/mcp-gpt-image: Generates and edits images using OpenAI API, providing scalable previews and Docker integration.
antvis/mcp-server-chart: Facilitates the creation of diverse visual charts using AntV through a TypeScript-based MCP server.
HYPERVAPOR/mcp-image-processor: High-performance image processing server offering format conversion, resizing, and optimization capabilities.
MalluBeast69/gemini-img-gen-MCP: Generate images using Google's Gemini model via a dedicated MCP server.
Bigchx/mcp_3d_relief: Transform 2D images into detailed 3D relief models in STL format for 3D printing or rendering.
falahgs/mcp-3d-style-cartoon-gen-server: A server that combines 3D-style cartoon image generation with secure file system operations, leveraging Google's Gemini AI and MCP SDK.
zjf2671/hh-mcp-comfyui: Facilitates image generation through natural language commands by interfacing with a local ComfyUI instance via the MCP protocol.
mario-andreschak/mcp_video_recognition: Facilitates image, audio, and video recognition using Google's Gemini AI.
Flyworks-AI/lipsync-mcp: Facilitates fast and free lipsync video creation for digital avatars using the Flyworks API.
bads1de/youtube-mp3-mcp: Facilitates the extraction of high-quality MP3 audio from YouTube URLs with seamless Claude Desktop integration.
jyjune/mcp_vms: Facilitates integration with CCTV systems by providing tools to access and control video streams and PTZ cameras.
nguyendinhsinh361/elevenlabs-mcp: Facilitates interaction with ElevenLabs' Text to Speech and audio processing APIs, enabling MCP clients to generate speech, clone voices, and transcribe audio.
tjh19971228/mcp_video_analysis: Facilitates video content analysis and mind map generation using the Model Context Protocol.
intsig-textin/textin-mcp: TextIn MCP Server facilitates text extraction and OCR on documents, supporting recognition and conversion to Markdown format.
PixVerseAI/PixVerse-MCP: Enables seamless interaction with PixVerse's AI video generation models through applications supporting the Model Context Protocol.
falahgs/mcp-minimax-music-server: Facilitates AI-driven music and audio content creation through the MiniMax Music API, seamlessly integrating with Claude Desktop.
morim3/mcp_adobe_premiere: Facilitates LLM control over Adobe Premiere Pro via an MCP server and UXP plugin integration.
savethepolarbears/google-photos-mcp: Facilitates AI assistants' access to Google Photos, enabling photo search and retrieval by content, date, and location.
luebken/playlist-mcp: Provides access to YouTube playlist transcripts via an experimental MCP server, with initial support for KubeCon London 2025 content.
sandst1/mcp-server-midi: Facilitates the transmission of MIDI sequences from an LLM to any MIDI-compatible software, enabling seamless integration with digital audio workstations and hardware synthesizers.
falahgs/image-gen3-google-mcp-server: Harness Google's Imagen 3.0 model via the Gemini API for high-quality image generation, seamlessly integrating with Claude Desktop and other MCP-compatible hosts.
mang0cola/watermark-mcp-server: Enhance images by adding customizable text or image watermarks using a simple MCP server.
falahgs/flux-imagegen-mcp-server: A specialized server for generating and manipulating images using Pollinations AI, compliant with the Model Context Protocol.
nikmaniatis/Pd-MCP-Server: Facilitates dynamic interaction between Claude AI and Pure Data for real-time music creation through natural language commands.
ZHANGYA0/mcp-server-iqiyi: Facilitates access to iQiyi's latest and trending video APIs using the FastMCP framework.
MichaelYangjson/mcp-ghibli-video: A TypeScript-based server offering AI-driven image and video generation with Ghibli-style animations.
undertaker86001/mcp-process-pdf: A robust MCP server for processing PDF documents with features like text extraction, image optimization, and intelligent classification using deep learning.
Synohara/supercollider-mcp: Facilitates the execution of SuperCollider synths using supercolliderjs through an MCP server.
slot181/sd-image-gen-mcp: Provides text-to-image generation using Stable Diffusion WebUI API with optional Cloudflare ImgBed integration.
led-ray/mcp-voicevox: Facilitates AI agents in reading aloud text using the VOICEVOX engine with customizable voice parameters and dialogue formats.
coderjun/shaka-packager-mcp-server: Integrates Shaka Packager with Claude AI for video transcoding, packaging, and analysis, enhancing media processing capabilities.
rjn32s/mcp-ocr: A robust OCR server leveraging MCP to extract text from images using Tesseract, supporting various input types and languages.
An-3/mcp-audacity: Facilitates remote control of Audacity through MCP endpoints using named pipes and the mod-script-pipe interface.
Anna-Pinewood/mcp-pallete: Generates color palettes from images using an MCP server with IMAGGA integration.
shuntagami/elevenlabs-scribe-transcriber-ts: A TypeScript tool for transcribing audio and video files using the ElevenLabs Scribe model, with MCP server capabilities for integration with clients like Claude Desktop.
mcai/podcast-tts-mcp: Generate multilingual podcast conversations with alternating male and female voices using Microsoft Edge's TTS technology.
HANON-games/midi-control-mcp: Facilitates MIDI message transmission to output devices via a TypeScript-based MCP server.
grentank/mcp-server-gigachat-image-generation: Facilitates image generation through a Dockerized MCP server with Gigachat integration.
mjpstj12/abletonOsc_mcp: Facilitates communication between LLMs and Ableton Live using OSC for music production and session manipulation.
xhiroga/blender-mcp-senpai: Facilitates integration with Blender by indexing documents, searching with DuckDB-VSS, and utilizing RAG through MCP.
slot181/gemini-image-generator-mcp: Generate and transform high-quality images from text prompts using Google's Gemini model, with features like intelligent filename generation and optional image uploading.
andyanalog/mcp-ffmpeg-livestream-aws: A toolkit for FFmpeg video processing and live streaming with AWS integration.
mystique920/together-flux-mcp: Facilitates image generation using the Together AI API with customizable parameters for model, dimensions, and diffusion steps.
302ai/302_image_mcp: Facilitates image processing tasks through an MCP server, integrating seamlessly with Claude Desktop.
karoterra/aviutl-mcp: Facilitates the control of AviUtl through MCP, enabling seamless integration with various MCP hosts like Claude Desktop.
slot181/tts-mcp: A command-line tool and MCP server for generating high-quality text-to-speech using the OpenAI TTS API, supporting multiple voice options and output formats.
Yoshino-Yukitaro/imagemagick_mcp_server: Facilitates image processing tasks using ImageMagick through an MCP server setup.
yhwancha/mcp-spotify: A TypeScript starter template for building MCP servers with a type-safe development environment and sample tool implementation.
kachiO/mlx-whisper-mcp: Facilitates audio transcription and YouTube video transcription using MLX Whisper on Apple Silicon Macs.
instructa/mcp-youtube-music: Facilitates searching and playing YouTube Music tracks through AI assistants like Cursor or Claude Desktop.
shivammaurya042/mcp-render: Effortlessly manage your Render.com account with this MCP server, enabling seamless deployment management and environment variable handling through your preferred MCP client.
hanqizheng/media_kit_mcp_server: Facilitates the parsing of Media Kit PDF files for key content extraction in the advertising and marketing sector.
djbriane/plex-mcp: Integrates with Plex Media Server API to manage movies and playlists using a Python-based MCP server.
GoatWang/YTTranscipterMultilingualMCP: Transcribe YouTube videos into multiple languages using a multilingual MCP server.
tubone24/midi-mcp-server: Enables AI models to generate MIDI files from text-based music data, facilitating programmatic musical composition through a standardized interface.
grafikogr/freepik-mcp-server: Generates images from text descriptions using Freepik's Flux AI service, integrated with Claude Desktop.
diivi/aseprite-mcp: Facilitates interaction with the Aseprite API through an MCP server, enabling automated graphic manipulations.
Henry-The-Yang-101/spotify_mcp: Connects to the Spotify Web API to provide playback and search functionalities to AI clients via MCP.
palvindersander/youtube-transcript-mcp: Fetches YouTube video transcripts and metadata for seamless integration with Claude, including fact-checking capabilities.
minbang930/Youtube-Vision-MCP: Utilizes the Google Gemini Vision API to analyze and interact with YouTube videos, offering features like summarization, Q&A, and key moment extraction.
fl0w1nd/grok2-image-mcp-server: Facilitates image generation using the Grok-2 model via the MCP protocol for chat assistants.
x007xyz/image-process-mcp-server: Facilitates image manipulation tasks such as resizing, format conversion, cropping, and rotation using the Sharp library.
demon24ru/video-transcribe-mcp: Integrates with optivus to provide video transcription capabilities for LLMs, supporting platforms like YouTube, Facebook, and Tiktok.
demon24ru/fish-speech-mcp: Facilitates text-to-speech synthesis for LLMs with FishSpeech integration.
R-lz/mcp-video-digest: Facilitates video transcription and summarization from platforms like YouTube and Bilibili using multiple transcription services.
longbowzz/svg2png_mcp: Facilitates SVG to PNG conversion using CairoSVG or Inkscape, integrated with MCP protocol for seamless client interaction.
MCPJam/mcpjam-spotify: Facilitates seamless integration with Spotify through an MCP server developed by the MCPJam team.
madpharmy/dalle: Facilitates image generation, editing, and variation creation using OpenAI's DALL-E API.
richbai90/spotify-mcp: Integrates with Spotify API to manage and create playlists through Claude.
wheattoast11/mcp-video-gen: Facilitates video and image generation using RunwayML and Luma AI APIs, with capabilities to enhance prompts via OpenRouter LLMs.
IA-Entertainment-git-organization/youtube-video-summarize: Summarizes YouTube videos by extracting and analyzing transcripts using various algorithms, including OpenAI's API for high-quality summaries.
nabid-pf/youtube-video-summarizer-mcp: Enables Claude to fetch and summarize YouTube videos by extracting titles, descriptions, and transcripts.
jlevy/deep-transcribe: Facilitates deep transcription of video and audio content with speaker identification and annotations, operable as an MCP server.
TomokiIshimine/svg-render-mcp: Transforms SVG strings into PNG images with customizable dimensions and background colors.
wanga90/blender-mcp: BlenderMCP enables Claude AI to interact with Blender for enhanced 3D modeling and scene manipulation through the Model Context Protocol.
vidau-ai/asr_mcp_server: Provides ASR capabilities using the whisper engine and exposes TTS functionality for seamless speech synthesis integration.
abhiemj/manim-mcp-server: Facilitates the execution of Manim animation scripts and returns the generated video through an MCP server interface.
tkoba1974/mcp-kroki: Transforms various diagram formats into images using Kroki.io, supporting formats like Mermaid and PlantUML.
Nazruden/mcp-openvision: Leverage OpenRouter vision models to enable AI-driven image analysis within the MCP ecosystem.
thuhoai27/mcp-image-reader: ImageReader converts local image files to Base64-encoded data for AI model analysis.
13rac1/videocapture-mcp: Enables AI assistants to control webcams and capture images using OpenCV.
infinitimeless/radiofrance-podcast-explorer-mcp: Facilitates AI-driven exploration and access to Radio France's podcasts and audio content through a Model Control Protocol server.
frankdeno/flux-image-generator-mcp: Generate images from text prompts using the FLUX model with customizable settings and batch processing capabilities.
GaoCan702/mcp-gererate-image: Facilitates AI-driven image generation requests from Cursor using Cloudflare's Flux AI model, with local or temporary storage options.
video-creator/ffmpeg-mcp: Facilitates local video search, editing, and playback using FFmpeg commands through an MCP server interface.
sanxfxteam/gemini-mcp-server: Facilitates image generation using Google's Gemini 2 API through an MCP server interface.
champierre/image-mcp-server: Analyzes image content from URLs or local paths using the GPT-4o-mini model, providing high-precision recognition and description.
Ichigo3766/image-gen-mcp: Facilitates text-to-image generation using Stable Diffusion WebUI API, enabling creative visual outputs from textual prompts.
Ichigo3766/audio-transcriber-mcp: Facilitates audio transcription using OpenAI's Whisper API, enabling seamless integration with MCP environments.
rakeshgangwar/tmdb-mcp-server: Facilitates AI-driven movie searches and information retrieval via The Movie Database API using the MCP interface.
rocksun/media-mcp-server: Facilitates multimedia metadata management through a RESTful API using the FastMCP framework.
yasar38/BLENDER-MCP-CURSOR-: Facilitates AI-driven 3D modeling in Blender through seamless integration with Cursor using the Model Context Protocol.
sinco-lab/mcp-youtube-transcript: Facilitates the extraction and processing of YouTube video transcripts for content analysis.
blacktop/mcp-say: Facilitates text-to-speech capabilities using macOS 'say' command and ElevenLabs API for integration with MCP protocol-based applications.
tijs/py-sound-mcp: Enhances coding environments with audio feedback for events using the Model Context Protocol, compatible with Cursor and other IDEs.
Johnnyhtw/audio2txt_mcp: Transforms meeting audio recordings into text using Whisper, integrating with CLINE for AI-generated summaries.
abhishekjairath/sonic-pi-mcp: Facilitates AI-driven musical creation by enabling AI assistants to control Sonic Pi through OSC messages.
crazyrabbitLTC/mcp-vibecoder: Facilitates structured LLM-based coding workflows with feature clarification, task tracking, and document management.
ejfox/vulpes-spotify-mcp: Facilitates AI assistants in interacting with Spotify for track search and playback.
surferdot/mcp-svg-converter: Transforms SVG code into high-quality PNG and JPG images with customizable options, ensuring secure file handling and integration with Claude Desktop.
vinayak-mehta/mcp-sonic-pi: Connects MCP clients with Sonic Pi to create music using English commands.
IA-Programming/mcp-images: Enterprise-grade image processing server offering tools for fetching and processing images from various sources, ideal for AI applications and data pipelines.
royshil/obs-mcp: Control OBS Studio through an MCP server using the OBS WebSocket protocol for scene management, source control, and streaming operations.
yoavniran/cloudinary-mcp-server: Facilitates AI-driven interactions with Cloudinary by exposing its Upload & Admin API methods as callable tools.
bjkemp/mcp-midi: Facilitates MIDI device interaction through natural language commands, enabling control over synthesizers, note playback, and instrument changes.
BigUncle/Fast-Whisper-MCP-Server: High-performance speech recognition server leveraging Faster Whisper for efficient audio transcription.
flipsai/davinci-resolve-mcp: Facilitates AI-driven video editing and color grading in DaVinci Resolve through Claude AI integration using the Model Context Protocol.
hammeiam/koroko-speech-mcp: Provides high-quality text-to-speech capabilities using the Kokoro TTS model with customizable voice and speed options.
JavaProgrammerLB/unsplash-mcp-server: Facilitates image searches on Unsplash using a Java-based MCP server.
Tooflex/davinci-resolve-mcp: Facilitates AI-driven control over DaVinci Resolve Studio for advanced editing and media management.
obre10off/spotify-mcp: Control Spotify playback using natural language commands through an MCP client.
ahujasid/ableton-mcp: Connects Ableton Live to Claude AI for interactive music production and session control via the Model Context Protocol.
marcelmarais/spotify-mcp-server: A lightweight server enabling AI assistants to control Spotify playback and manage playlists.
rocksun/gemini-image-mcp-server: Provides AI-driven image generation and editing services using the MCP protocol with Google Gemini technology.
okooo5km/unsplash-mcp-server-swift: Swift-based server enabling LLMs to search and retrieve photos from Unsplash with advanced filtering options.
samuelgursky/davinci-resolve-mcp: Connects AI coding assistants to DaVinci Resolve, enabling natural language control and querying.
mfleurival/FFmpeg: Facilitates video and audio processing through FFmpeg with capabilities like video trimming, frame extraction, and audio segmentation.
Rupeebw/mcp-image-reader: A TypeScript-based MCP server that facilitates the creation, management, and summarization of text notes using URIs and metadata.
Garblesnarff/gemini-mcp-server: Facilitates image generation on Claude Desktop using Google's Gemini AI models.
eetumartola/houdini-mcp: Facilitates seamless interaction between Claude AI and Houdini for enhanced 3D modeling and scene manipulation.
elliotxx/favicon-mcp-server: Transforms SVG images into ICO and PNG favicon formats for web applications, supporting seamless integration with LLM-powered tools.
Ut13158/blender-mcp: BlenderMCP enables seamless interaction between Blender and Claude AI for enhanced 3D modeling and scene manipulation through the Model Context Protocol.
hugohow/mcp-music-analysis: Leverage librosa and Whisper to analyze music audio through an MCP server, integrating seamlessly with LLMs for enhanced audio insights.
xue160709/yt-mcp-server: A Model Context Protocol server utilizing the mcp-framework for tool development and integration with Claude Desktop.
MCERQUA/freepik-mcp: Facilitates interaction with Freepik's API for accessing stock photos and generating images using Mystic AI.
zaptrem/music-mcp: Facilitates music generation through text prompts using Sonauto's API, integrated with Claude Desktop.
SIAM-TheLegend/sound-mcp: A lightweight MCP server that remotely triggers customizable notification sounds via AI agents using npx.
androidStern/ableton-vibe: Facilitates MIDI track creation in Ableton Live through MCP server integration.
GMKR/mcp-imagegen: Facilitates image generation using Together AI's models via a configurable MCP server.
bitscorp-mcp/mcp-ffmpeg: A Node.js server leveraging FFmpeg to provide APIs for video resizing and audio extraction, seamlessly integrating with Claude Desktop for natural language video processing.
zym9863/pollinations-ai-image-server: A TypeScript-based server for generating images using Pollinations AI, featuring integration with Claude Desktop.
everaldo/mcp-mistral-ocr: Facilitates OCR processing of images and PDFs using Mistral AI's API, with support for local and URL-based files.
GongRzhe/Audio-MCP-Server: Facilitates audio input/output for AI assistants, enabling interaction with computer audio systems for recording and playback.
VxASI/blender-mcp-vxai: Blender MCP VXAI enables seamless natural language control over Blender for creating and manipulating 3D models and animations, integrating AI-driven automation for enhanced creative workflows.
CLOUDWERX-DEV/DiffuGen: DiffuGen offers a seamless interface for generating images using Stable Diffusion models, integrating directly with IDEs via MCP for efficient development workflows.
tomcat65/flux-images-mcp: A server for generating, modifying, and inpainting images using Replicate's Flux models, with features for obtaining image URLs and browsing stored images.
MushroomFleet/TranscriptionTools-MCP: Enhances transcript processing with intelligent formatting, error repair, and summarization using advanced LLMs.
zym9863/elevenlabs-sound-effect-server: Generates sound effects from text descriptions using ElevenLabs API, saving them as MP3 files.
AngeloGiacco/elevenlabs-mcp: Exposes OpenAPI-defined ElevenLabs endpoints as MCP resources for seamless integration with Claude Desktop.
Krekun/mcp-voicevox: Facilitates text-to-speech conversion using VoiceVox through an MCP server, enabling Claude to generate diverse audio outputs.
Handwriting-OCR/handwriting-ocr-mcp-server: Facilitates integration between MCP clients and the Handwriting OCR service for document transcription and retrieval.
haltakov/meme-mcp: Generate memes from user prompts using the ImgFlip API with this MCP server.
okooo5km/zipic-mcp-server: Swift-based server enabling AI assistants to compress and optimize images using advanced tools and batch processing capabilities.
bendusy/pollinations-mcp: Connects AI models to Pollinations.ai for image and text generation via MCP protocol.
gregkop/sketchfab-mcp-server: Facilitates seamless interaction with Sketchfab's 3D model platform, enabling search, detailed viewing, and downloading of models.
peng-shawn/mermaid-mcp-server: Transforms Mermaid diagram code into PNG images using Puppeteer for seamless AI integration.
rmcendarfer2017/MCP-image-gen: Connects to Replicate's API to generate and manage images with customizable prompts and styles.
giannisanni/kokoro-tts-mcp: Facilitates text-to-speech synthesis using the Kokoro TTS engine, enabling seamless integration of speech capabilities into applications via MCP tools.
supercurses/powerpoint: Facilitates the creation and editing of PowerPoint presentations with dynamic slide generation and multimedia integration.
Tomocrystal/generate_image-mcp-server: A server implementation for generating images using Model Context Protocol, compatible with OpenAI DALL-E and other large language model APIs.
rhish9h/image-filter-mcp: Provides 10 image filters using Python and Pillow, integrating seamlessly with Claude Desktop.
setkyar/youtube-subtitles-mcp: Facilitates AI assistants in downloading and analyzing YouTube video subtitles with seamless integration capabilities.
DaInfernalCoder/unsplash-server: Integrates Unsplash image search and retrieval with AI assistants, enabling seamless access to high-quality images.
DeepSRT/deepsrt-mcp: Facilitates YouTube video summarization through DeepSRT API integration, offering narrative and bullet-point summaries with multi-language support.
manascb1344/together-mcp-server: Enables high-quality image generation using the Flux.1 Schnell model via Together AI.
Simon-Kansara/ableton-live-mcp-server: Facilitates communication between LLMs and Ableton Live using OSC for seamless music production control.
hirokidaichi/imark: A versatile CLI tool that leverages AI for image recognition, cataloging, and generation, with MCP server capabilities for enhanced image processing.
ckz/flux-schnell-mcp: Generates images using the Flux Schnell model via Replicate API with customizable prompts.
index01d/ytrnscrpt-mcp-server: Facilitates Claude's ability to retrieve and analyze YouTube video transcripts through an MCP server.
jkawamoto/mcp-florence2: Processes images and PDFs to extract text or generate descriptive captions using Florence-2.
glifxyz/glif-mcp-server: Facilitates AI workflows by managing glifs and bots, offering customizable tools and metadata access through MCP.
maoxiaoke/mcp-media-processor: Node.js server for advanced video and image processing using MCP, featuring conversion, compression, and editing capabilities.
lalanikarim/comfy-mcp-server: Utilizes the FastMCP framework to generate images from prompts by interfacing with a remote Comfy server.
husniadil/mcp-image-placeholder: Generates placeholder images from multiple providers, offering seamless integration with tools like Claude for Desktop and Cursor.
dandeliongold/mcp-decent-sampler-drums: Generates DecentSampler drum kit configurations with WAV analysis and XML generation tools.
kennethreitz/mcp-applemusic: Control Apple Music on macOS using AppleScript commands via a FastMCP server.
jkawamoto/mcp-youtube-transcript: Retrieve transcripts from YouTube videos using a Model Context Protocol server.
mondweep/youtube-music-mcp-server: Facilitates AI-driven YouTube Music playback through Chrome, enabling song search and play by song or artist name.
superseoworld/mcp-spotify: ArtistLens provides seamless access to Spotify's music catalog, enabling advanced search and retrieval of artist and album information through a robust MCP server.
catalystneuro/mcp_read_images: Facilitates image analysis using OpenRouter vision models with configurable model selection and automatic image optimization.
felores/placid-mcp-server: Integrate with Placid.app's API to dynamically generate images and videos using customizable templates.
Here-and-Tomorrow-LLC/audio-player-mcp: Enables Claude to manage audio playback on your computer through a Model Context Protocol server.
felores/cloudinary-mcp-server: Facilitates media uploads to Cloudinary via Claude Desktop and compatible MCP clients.
truaxki/mcp-Pdf2png: Facilitates the conversion of PDF documents into PNG images using a Model Context Protocol server.
Kush36Agrawal/Video_Editor_MCP: A robust video editing server utilizing FFmpeg for executing video operations via natural language commands.
tadasant/mcp-server-stability-ai: Integrates MCP Clients with Stability AI for advanced image manipulation, including generation, editing, and upscaling.
apinetwork/piapi-mcp-server: A TypeScript-based MCP server that enables media content generation through PiAPI's API, integrating seamlessly with platforms like Midjourney and Claude.
zxkane/mcp-server-amazon-bedrock: Integrates Amazon Bedrock's Nova Canvas model for generating high-quality images from text descriptions with advanced control options.
punkpeye/webperfect-mcp-server: WebPerfect leverages AI to transform images into web-optimized masterpieces with advanced processing techniques and batch automation.
kamekamek/runway-vedeo-server: Transform images into videos using RunwayAPI with customizable prompts for dynamic video generation.
sparfenyuk/mcp-youtube: Facilitates AI assistants' interaction with YouTube by bridging the YouTube API and AI tools for tasks like downloading captions and summarizing videos.
spencerhhubert/illustrator-mcp-server: Facilitates script execution on Adobe Illustrator via an MCP server, leveraging JavaScript and AppleScript for MacOS compatibility.
burningion/video-editing-mcp: Facilitates video editing and management through an MCP interface, integrating with Video Jungle for uploading, editing, and searching videos using LLMs.
angheljf/dalle-image-server: A TypeScript-based server that generates images using DALL·E 2 from text prompts, requiring an OpenAI API key.
sammyl720/image-generator-mcp-server: Generates images from prompts using OpenAI's DALL-E-3 model, with TypeScript implementation for seamless integration with Claude Desktop.
kimtaeyoon83/mcp-server-youtube-transcript: Facilitates direct transcript retrieval from YouTube videos with language-specific options.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🖼️ Multimedia Processing

FilesExpand file tree

multimedia-processing.md

Latest commit

History

multimedia-processing.md

File metadata and controls

🖼️ Multimedia Processing