video-use is an open-source skill for Claude Code (and other coding agents) that edits raw video footage via natural language. You drop footage in a folder, describe what you want, and the agent produces final.mp4 — cutting filler words, color grading, burning subtitles, and generating animation overlays. It is built by the browser-use team and has 11.6k GitHub stars.

Does video-use require Premiere Pro or any video editor?

No. video-use uses ffmpeg (open source, CLI) for all rendering. You never open a video editor. The editing decision layer is the LLM — the actual cut execution is ffmpeg commands generated from the LLM's edit decision list (EDL).

How does video-use transcribe footage?

video-use calls ElevenLabs Scribe API to get word-level timestamps, speaker diarization, and audio events for every source clip. These pack into a ~12KB markdown file the LLM reads as its primary editing surface.

What agents does video-use work with?

Claude Code, Codex, Hermes, Openclaw, and any agent with shell access. The setup prompt handles skill registration for whichever agent you are using.

Is video-use free and open source?

Yes — MIT license on GitHub at github.com/browser-use/video-use. ElevenLabs Scribe requires an API key (has a free tier). ffmpeg is free. The LLM (Claude Code) requires an Anthropic subscription or API key.

How is video-use different from Premiere Pro or DaVinci Resolve?

Traditional NLEs (Premiere Pro, DaVinci, Final Cut) are timeline-based — you scrub through footage, drag clips, apply effects. video-use is description-based — you describe the edit in plain language and the agent executes it. video-use is faster for rough cuts and filler removal; NLEs are still better for frame-precise creative control.

video-use: AI Video Editing With Claude Code 2026 | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

video-use: AI Video Editing With Claude Code 2026 | explainx.ai Blog | explainx.ai

Adobe Premiere Pro costs $60/month and takes hundreds of hours to master. DaVinci Resolve is free but has a steep learning curve. Final Cut Pro is Mac-only and $300 upfront.

For a large class of video editing tasks — cutting filler words, removing dead air, color grading, burning subtitles, combining multiple takes into a launch video — none of that complexity is necessary anymore.

video-use is an open-source tool from the browser-use team that turns Claude Code into a video editor. You drop raw footage in a folder, describe what you want, and get edit/final.mp4 back. No timeline. No menus. No NLE.

11.6k GitHub stars in roughly two months. MIT license. Here is how it works and how to set it up.

What video-use Actually Does

video-use is a skill — a self-contained plugin that registers with Claude Code (or Codex, Hermes, Openclaw) and gives it video editing capabilities via shell access.

It handles:

Filler word removal — cuts "umm", "uh", false starts, and dead space between takes with 30ms audio fades at every cut so there is no pop
Auto color grading — warm cinematic, neutral punch, or any custom ffmpeg chain applied per segment
Subtitle generation — burned in your style (2-word UPPERCASE chunks by default, fully configurable)
Animation overlays — generates HyperFrames, Remotion, Manim, or PIL animations in parallel sub-agents, one per animation block
Self-evaluation — renders, checks every cut boundary against the source, fixes issues and re-renders before showing you anything (max 3 tries)
Session memory — writes project.md so next week's session picks up exactly where it left off

The output lands in <your_videos_dir>/edit/ — the skill directory stays clean.

The Core Insight: Read Video as Text

The LLM never watches the video. It reads it. For open-ended video Q&A (not editing), see — scene-aware frame packs, Gemini native upload, and compared.

snippet

Transcribe ──> Pack ──> LLM Reasons ──> EDL ──> Render ──> Self-Eval
                                                              │
                                                              └─ issue? fix + re-render (max 3x)

snippet

Set up https://github.com/browser-use/video-use for me.

Read install.md first to install wire up ffmpeg, register the skill with whichever agent you're running under, and set up the ElevenLabs API key — ask me to paste it when you need it. Then read SKILL.md for daily usage, and always read helpers/ because that's where the editing scripts live. After install, don't transcribe anything on your own — just tell me it's ready and wait for me to drop footage into a folder.

bash

# 1. Clone and symlink into your agent's skills directory
git clone https://github.com/browser-use/video-use ~/Developer/video-use
ln -sfn ~/Developer/video-use ~/.claude/skills/video-use        # Claude Code
# ln -sfn ~/Developer/video-use ~/.codex/skills/video-use       # Codex

# 2. Install Python deps
cd ~/Developer/video-use
uv sync                         # or: pip install -e .

# 3. Install system deps
brew install ffmpeg              # required
brew install yt-dlp             # optional — for downloading online sources

# 4. API key
cp .env.example .env
# Edit .env and add: ELEVENLABS_API_KEY=your_key_here

Engine	Best for	Notes
HyperFrames	Motion graphics, lower thirds	Default recommendation
Remotion	React-based frame-accurate animations	Requires Node.js
Manim	Mathematical/technical visualizations	Requires Python Manim
PIL	Simple image overlays, static graphics	Fastest, no extra deps

	video-use	Adobe Premiere Pro	DaVinci Resolve
Price	Free (MIT)	$60/month	Free / $295 (Studio)
Interface	Natural language	Timeline GUI	Timeline GUI
Learning curve	Minutes	Weeks–months	Weeks
Filler word removal	Automatic	Manual or plugin	Manual or scripted
Frame-precise control	Via description	Yes	Yes
Color grading	Auto (ffmpeg presets)	Full professional suite	Industry-leading
Subtitle generation	Automatic	Via plugin/AI	Auto-caption feature
Animation overlays	Remotion / Manim / PIL	After Effects integration	Fusion (built-in)
Self-evaluation	Built-in	Manual preview	Manual preview
Session memory	project.md	Project files	Project files
Best for	Fast rough cuts, filler removal, launch videos	Professional broadcast, film	Color work, feature-length

Task	Command in session
First edit	`edit these into a launch video`
Specific style	`keep all the takes, just remove filler words and grade warm`
Subtitles only	`burn white sentence-case subtitles, no other edits`
Social clips	`find the 3 best 90-second clips for Instagram`
Animation	`add a lower-third title animation for each speaker`
Download + edit	`download [URL] and edit into a 3-minute summary`

video-use: Edit Videos With Claude Code — No Premiere Pro Needed

What video-use Actually Does

The Core Insight: Read Video as Text

Related posts

Jack Dorsey's Buzz: Team Chat, AI Agents, and Git Hosting in One Nostr-Signed Workspace

OpenMontage: Agentic Video Production for Claude Code and Cursor

Google Fired the Engineer Who Built Its Viral Workspace CLI — Two Days Before Announcing the Official One

The Full Pipeline

Setup Guide

Prerequisites

One-Command Setup (Recommended)

Manual Setup

Your First Edit Session

Subtitles

Animation Overlays

video-use vs. Premiere Pro vs. DaVinci Resolve

Use Cases That Work Well

Design Rules Worth Knowing

Connection to the Broader Agentic Video Ecosystem

Quick Reference

What video-use Actually Does

The Core Insight: Read Video as Text

Related posts

Jack Dorsey's Buzz: Team Chat, AI Agents, and Git Hosting in One Nostr-Signed Workspace

OpenMontage: Agentic Video Production for Claude Code and Cursor

Google Fired the Engineer Who Built Its Viral Workspace CLI — Two Days Before Announcing the Official One

The Full Pipeline

Setup Guide

Prerequisites

One-Command Setup (Recommended)

Manual Setup

Your First Edit Session

Subtitles

Animation Overlays

video-use vs. Premiere Pro vs. DaVinci Resolve

Use Cases That Work Well

Design Rules Worth Knowing

Connection to the Broader Agentic Video Ecosystem

Quick Reference

Related Reading