Video editing
for AI agents and humans.

82 MCP tools that give Claude Code, Cursor, and any AI agent the ability to trim, merge, overlay text, sync audio, apply filters, stabilize video, detect scenes, generate subtitles, extract waveforms, upscale with AI, separate audio stems, and more. Plus a rich CLI with templates for humans. Local, fast, free. All 690+ features tested with real media.

690+ tests passing
82 MCP tools
3 interfaces
View on GitHub pip install mcp-video

81-second overview of mcp-video features — created with Remotion + mcp-video


The Problem
AI agents can't edit video. Until now.

AI agents can write code, analyze documents, browse the web, create images — but video editing? Existing tools are either GUI-only (agents can't click buttons), raw FFmpeg wrappers (agents can't memorize hundreds of flags), or cloud APIs (expensive, slow, vendor lock-in). mcp-video bridges this gap.

82 Video Editing Tools

Trim, merge, add text, sync audio, resize, convert, speed change, watermarks, subtitles, storyboards, crop, rotate, fade, filters, color grading, audio processing, stabilization, scene detection, AI-powered editing, transitions, and more.

Local & Private

Runs on your machine. No cloud uploads, no API keys, no per-minute billing. Your video never leaves your computer.

Auto-Fix Error Handling

Parses FFmpeg errors into structured responses with actionable suggestions. "Codec error: vp9" becomes "Auto-convert from vp9 to H.264/AAC."

Filters & Color Grading

Blur, sharpen, brightness, contrast, saturation, grayscale, sepia, invert, vignette, Ken Burns animation. Plus color presets: warm, cool, vintage, cinematic, noir.

Audio Processing Suite

Normalize to LUFS, add reverb, compress, pitch shift, reduce noise. Extract waveforms with peak detection and silence region analysis.

Composition Tools

Picture-in-picture, split-screen, masks with feathering, and multi-track timeline editing for complex projects.

Video Analysis

Auto scene detection with threshold control, quality metrics (PSNR/SSIM) comparison, and comprehensive metadata editing.

Stabilization & Effects

Stabilize shaky footage with motion vectors. Create videos from image sequences. Generate and burn subtitles from text.

Batch Processing

Apply the same operation to multiple files in one call. Trim 10 videos, blur 5, or convert a whole directory at once.

Validation Before Execution

Filter types, color presets, and parameters are validated before hitting FFmpeg. Get clear error messages, not cryptic KeyError tracebacks.


AI-Powered
AI-Powered Editing

Machine learning features for intelligent video processing.

Silence Removal

Auto-remove dead air from audio tracks with configurable threshold and padding.

Transcription

Whisper-powered speech-to-text with automatic subtitle generation and SRT export.

Scene Detection

ML-enhanced cut detection for automatic scene boundary identification.

Stem Separation

Isolate audio components — vocals, drums, bass, and other instruments.

AI Upscale

Super-resolution upscaling for enhancing video resolution with ML models.

Color Grading

Auto color correction with ML-powered scene analysis and LUT application.

Spatial Audio

3D audio positioning for immersive surround sound experiences.


Effects
Video Transitions

Professional transitions for seamless clip connections.

Glitch

RGB shift effects for digital distortion transitions.

Pixelate

Block dissolve transition with configurable pixel size.

Morph

Mesh warp transitions for smooth shape-shifting effects.


Audio
Procedural Audio

Generate sound effects from code without external audio files.

Waveform Generation

Generate sine, square, sawtooth, and noise waveforms programmatically.

Audio Presets

18+ ready-to-use synthesized sounds: beeps, boops, sweeps, and more.

Sequencing

Time-based composition for building complex audio sequences.

Effects Chain

Apply reverb, filter, normalize, and other effects to synthesized audio.


Why mcp-video
Not just another FFmpeg wrapper.

"Can't I just have Claude Code run ffmpeg commands?" Sure — if you enjoy debugging 200-character filter graphs, wrestling with flag order semantics, and paying dollars per short clip on cloud APIs. Here's why mcp-video is different.

mcp-video Claude Code + FFmpeg Cloud APIs
Cost Free, local Free, but your time isn't $0.28–$2.50 per 5s video
Privacy 100% local 100% local Uploads to cloud
Error messages Structured + auto-fix FFmpeg stderr (cryptic) HTTP errors
Parameter validation Before execution Only at runtime API schema check
Text overlay add_text(text="Hi") -vf "drawtext=text='Hi':fontfile=...:fontsize=24:x=(w-text_w)/2:y=h-50" Varies by API
Speed change Handles audio-less videos Fails if no audio stream Usually handled
Discovery 82 self-documenting tools Must know hundreds of flags API docs
Testing 690+ real media tests None (ad-hoc) Provider's tests
The real problems with raw FFmpeg

Flag order changes behavior silently

Put -ss 10 before -i input.mp4 and FFmpeg seeks instantly. Put it after, and it decodes every frame up to 10s first. Same flags, completely different performance. Claude Code can't know which you meant.

Cryptic error messages

Try adding audio to a silent video and you get: "Stream specifier ':a' matches no streams." No hint that the video has no audio. mcp-video checks before running and gives you a clear error.

Intermediate files = quality loss

Every FFmpeg call re-encodes. Chain 5 operations and you've re-encoded 5 times. mcp-video's timeline DSL composes operations into a single FFmpeg command where possible.

Claude forgets between commands

In a 10-step workflow, Claude must re-discover file paths, resolutions, and durations at every step. mcp-video returns structured results (path, duration, resolution) so the agent always has context.

Why not cloud APIs?

Expensive at scale

Replicate charges $0.28–$0.50 per second of generated video. A 60-second clip costs $17–$30. RunwayML's Pro plan gives you ~90 seconds of Gen-4.5 for $28/month. mcp-video: $0, forever.

Vendor lock-in

Your workflow depends on their API staying up, their pricing not changing, and their model versions not breaking your prompts. With mcp-video, FFmpeg runs on your machine — no API deprecation risk.


Capabilities
82 tools. Every operation you need.

Every tool returns structured JSON. On success, you get output paths, duration, resolution. On failure, you get error types with auto-fix suggestions.

ToolWhat It Does
video_infoGet metadata: duration, resolution, codec, fps, file size
video_trimTrim clip by start time + duration or end time
video_mergeConcatenate multiple clips with optional transitions
video_add_textOverlay text, titles, captions with custom positioning
video_add_audioAdd, replace, or mix audio tracks with fade effects
video_resizeChange resolution or apply preset aspect ratios
video_convertConvert between mp4, webm, gif, mov (with two-pass encoding)
video_speedSpeed up or slow down (slow-mo, time-lapse)
video_thumbnailExtract a single frame at any timestamp
video_previewGenerate fast low-res preview for quick review
video_storyboardExtract key frames + create grid for review
video_subtitlesBurn SRT/VTT subtitles into the video
video_generate_subtitlesCreate SRT subtitles from text entries, optionally burn in
video_watermarkAdd image watermark with opacity control
video_exportRender final video with quality presets
video_editFull timeline-based edit (JSON DSL for complex edits)
video_extract_audioExtract audio as mp3, wav, aac, ogg, or flac
video_cropCrop to rectangular region with custom offset
video_rotateRotate 90/180/270 degrees or flip horizontal/vertical
video_fadeAdd fade in/out effects to video
video_filterApply visual filter (blur, sharpen, grayscale, ken_burns, and more)
video_blurBlur video with custom radius and strength
video_color_gradeApply color preset (warm, cool, vintage, cinematic, noir)
video_normalize_audioNormalize loudness to LUFS target (YouTube, broadcast, Spotify)
video_audio_waveformExtract audio waveform data (peaks and silence regions)
video_audio_reverbAdd echo/reverb effect to audio
video_audio_compressorDynamic range compression for balanced audio
video_audio_pitch_shiftChange audio pitch while maintaining duration
video_audio_noise_reductionRemove background noise from audio tracks
video_overlayPicture-in-picture overlay with positioning and opacity
video_split_screenSide-by-side or top/bottom layout for two videos
video_stabilizeStabilize shaky footage using motion vectors (requires vidstab)
video_apply_maskApply image mask with edge feathering for compositing
video_detect_scenesAutomatically identify scene changes in videos
video_create_from_imagesCreate videos from image sequences
video_export_framesExport video as image frames
video_compare_qualityCompare PSNR/SSIM quality metrics between videos
video_read_metadataRead video metadata tags
video_write_metadataWrite video metadata tags
video_batchApply same operation to multiple files at once

Three Interfaces
However you want to use it.

mcp-video works as an MCP server for AI agents, a Python library for scripts, and a rich CLI tool for the terminal with human-friendly output and templates.

MCP Server (for AI agents)

Add to your Claude Code or Cursor MCP config. Then just talk to your agent.

{
  "mcpServers": {
    "mcp-video": {
      "command": "uvx",
      "args": ["mcp-video"]
    }
  }
}

Then: *"Trim this video from 0:30 to 1:00, add a title, and export for TikTok."*

Python Client

Clean API for automation, pipelines, and batch processing.

from mcp_video import Client

editor = Client()
clip = editor.trim("v.mp4", start="0:30", duration="15")
final = editor.resize(clip.output_path, aspect_ratio="9:16")
result = editor.export(final.output_path)
print(result.resolution)  # 1080x1920

CLI Tool Rich output · Spinners · Templates

Human-friendly terminal output by default. Add --format json for scripts and piping.

# Rich table output (default)
$ mcp-video info video.mp4
# ┌─────────────┬──────────────────┐
# │ Property    │ Value            │
# ├─────────────┼──────────────────┤
# │ Duration    │ 45.23s           │
# │ Resolution  │ 1920x1080        │
# │ FPS         │ 30               │
# └─────────────┴──────────────────┘

# JSON output for scripts
$ mcp-video --format json info video.mp4

# Trim with progress spinner
$ mcp-video trim video.mp4 -s 0:30 -d 15

# Apply a TikTok template
$ mcp-video template tiktok video.mp4 --caption "Check this out!"

# List all templates
$ mcp-video templates

Timeline DSL
Complex edits in a single JSON object.

Describe multi-track edits with video clips, audio, text overlays, transitions, and export settings — all in one call.

editor.edit({
    "width": 1080, "height": 1920,
    "tracks": [
        {
            "type": "video",
            "clips": [
                {"source": "intro.mp4", "duration": 5},
                {"source": "main.mp4", "trim_start": 10, "duration": 30},
            ],
            "transitions": [{"after_clip": 0, "type": "fade", "duration": 1.0}],
        },
        {
            "type": "audio",
            "clips": [{"source": "music.mp3", "volume": 0.7}],
        },
        {
            "type": "text",
            "elements": [{"text": "EPISODE 42", "position": "top-center"}],
        },
    ],
    "export": {"format": "mp4", "quality": "high"},
})

Templates
One function per platform.

Pre-built templates handle the correct dimensions, text positioning, and export settings for each social media platform.

TikTok

9:16, 1080x1920. Caption at bottom-center. Optional background music at volume 0.5.

YouTube Shorts

9:16, title at top-center with size 42. Same format as TikTok but different text placement.

Instagram Reel

9:16, caption at bottom-center. Clean, simple template.

YouTube Video

16:9, 1920x1080. Supports title card (3s), outro clip, and background music.

Instagram Post

1:1, 1080x1080. Square format with optional caption overlay.

Custom

Build your own with the Timeline DSL. Any dimensions, any number of tracks.


Quality
690+ tests passing. Real media, no mocks.

Every feature tested with actual FFmpeg operations on real video files. No mocks, no stubs — just genuine video processing validation. From AI upscaling to video stabilization, all 690+ features verified with real media.

690+
Comprehensive real media tests
All 690+ features tested with real media
8
Test categories
Core, Audio, Effects, Transitions, AI, Layout, Quality, Utility
100%
Feature coverage
AI features, stabilization, stem separation

Test Categories

Core Video Editing 18/18 ✓
Audio Features 10/10 ✓
Visual Effects 8/8 ✓
Transitions 3/3 ✓
AI Features 8/8 ✓
Layout & Composition 8/8 ✓
Quality & Metadata 8/8 ✓
Utility 7/7 ✓

AI Features tested: Scene detection, silence removal, transcription (Whisper), stem separation (Demucs), AI upscale (OpenCV DNN), color grading, spatial audio, color extraction
Dependencies installed: FFmpeg with vidstab, demucs, torchcodec, opencv-contrib-python


Get Started
Up and running in 60 seconds.
1. Install FFmpeg
2. pip install mcp-video
3. Add to MCP config
4. Start editing
# 1. Install FFmpeg (if you don't have it)
brew install ffmpeg

# For full text overlay support (drawtext filter):
# brew install freetype harfbuzz
# brew reinstall --build-from-source ffmpeg
# Verify: ffmpeg -filters | grep drawtext

# 2. Install mcp-video
pip install mcp-video

# 3. Add to your Claude Code MCP config
# (Settings > MCP Servers > Add > mcp-video)

# 4. Start editing!
# Just tell your agent what you want:
# "Trim this video to 30 seconds and add a title"