videoagent-video-studio

Generate short AI videos from text or images using 7 backend models with zero API key setup.

pexoai/pexo-skillsUpdated Apr 8, 2026

Works with

Claude CodeCursorClineWindsurfCodexGooseGitHub CopilotZed

0

total installs

0

this week

434

GitHub stars

0

upvotes

Install Skill

Run in your terminal

$npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-video-studio

0

installs

0

this week

434

stars

What it does

  • Supports three generation modes: text-to-video, image-to-video, and reference-based generation for consistent output

  • Seven models available (minimax, kling, veo, hunyuan, grok, seedance, pixverse) with automatic selection or manual override via --model flag

  • Configurable duration (4–12 seconds), aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4), and automatic prompt enhancement for better results

  • S

Category

Video

Last updated

Apr 8, 2026

Installation Guide

How to use videoagent-video-studio on Cursor

AI-first code editor with Composer

1

Prerequisites

Before installing skills in Cursor, ensure your development environment meets these requirements:

  • Cursor installed and configured on your machine
  • Node.js 16+ with npm — verify with node --version
  • Active project directory where you want to add videoagent-video-studio
2

Run the install command

Execute the skills CLI command in your project's root directory to begin installation:

$npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-video-studio

Fetches videoagent-video-studio from pexoai/pexo-skills and configures it for Cursor.

3

Select Cursor when prompted

The CLI shows a list of agents. Use arrow keys and space to select Cursor:

◆ Which agents do you want to install to?
│ ── Universal (.agents/skills) ────────────────
│ · Cline · Codex · Goose · Windsurf
│ ●Cursor(selected)
│ · Cursor · Aider · Continue
4

Verify installation

Confirm successful installation by checking the skill directory location:

.cursor/skills/videoagent-video-studio

Restart Cursor to activate videoagent-video-studio. Access via /videoagent-video-studio in your agent's command palette.

Security Notice

We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.

Skills execute code in your environment. Always review source, verify the publisher, and test in isolation before production.

Documentation

🎬 VideoAgent Video Studio

Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.

Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.


Quick Reference

User Intent Mode Typical Duration
"Make a video of..." (no image) text-to-video 4–10 s
"Animate this image" / "Make this move" image-to-video 4–6 s
"Turn this into a video with..." image-to-video 4–6 s
Cinematic, story, ad Prefer text-to-video with detailed prompt 5–10 s

Generation Modes

Mode Description Models
text-to-video Text prompt only → video minimax, kling, veo, hunyuan, grok, seedance
image-to-video Single image + prompt → animated clip minimax, kling, veo, pixverse, grok, seedance
reference-based Reference images/video → consistent output minimax, kling, veo, hunyuan, grok, seedance

Models (use --model <id>)

Model ID T2V I2V Reference Notes
minimax Subject reference image, character consistency
kling Multi-element / character / keyframe (O3)
veo Google Veo 3.1, multiple reference images
hunyuan Video-to-video style transfer
pixverse Stylized image-to-video
grok Video editing via reference video
seedance Seedance 1.5 Pro, synchronized audio, 4–12 s

Full model details and endpoint reference: references/models.md.


How to Generate a Video

Step 1 — Choose mode and enhance the prompt

  • Text-to-video: Expand with subject, action, camera movement, lighting, and style. Be specific about motion (e.g. "camera slowly zooms in", "character walks left to right").
  • Image-to-video: Describe the motion to apply to the image (e.g. "gentle breeze in the hair", "camera pans across the scene"). See references/prompt_guide.md for patterns.

Step 2 — Run the script

Text-to-video:

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "<enhanced prompt>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Image-to-video:

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "<motion description>" \
  --image-url "<public image URL>" \
  --duration <seconds> \
  --aspect-ratio <ratio>

Parameters:

Parameter Default Description
--mode text-to-video text-to-video or image-to-video
--prompt (required) Scene or motion description
--image-url Required for image-to-video; public image URL
--duration 5 Length in seconds (typically 4–10)
--aspect-ratio 16:9 16:9, 9:16, 1:1, 4:3, 3:4
--model auto Model ID (e.g. kling, veo, grok, seedance); auto = proxy picks

Other commands:

Command Description
node tools/generate.js --list-models List available models from the proxy
node tools/generate.js --status --job-id <id> Check async job status

Step 3 — Return the result

The script returns JSON:

{
  "success": true,
  "mode": "text-to-video",
  "videoUrl": "https://...",
  "duration": 5,
  "aspectRatio": "16:9"
}

Send videoUrl to the user.


Example Conversations

User: "Generate a short video of a cat walking in the rain, cinematic."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "A cat walking through rain, wet streets, neon reflections, cinematic lighting, slow motion, 4K" \
  --duration 5 \
  --aspect-ratio 16:9

User: "Animate this photo" (user uploads a landscape)

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --prompt "Gentle clouds moving across the sky, subtle grass movement, cinematic atmosphere" \
  --image-url "https://..." \
  --duration 5 \
  --aspect-ratio 16:9

User: "Make a 10-second vertical video of a coffee pour, slow motion."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --prompt "Close-up of coffee pouring into a white cup, slow motion, steam rising, soft lighting, product shot" \
  --duration 10 \
  --aspect-ratio 9:16

User: "Use Google Veo for a cinematic shot."

node {baseDir}/tools/generate.js \
  --mode text-to-video \
  --model veo \
  --prompt "A dragon flying through cloudy skies, cinematic lighting, 8s" \
  --duration 8 \
  --aspect-ratio 16:9

User: "Animate this portrait."

node {baseDir}/tools/generate.js \
  --mode image-to-video \
  --model grok \
  --prompt "Gentle smile, subtle head turn" \
  --image-url "https://..." \
  --duration 5

Setup

Zero API keys by default. Requests go through a hosted proxy. Set these for a custom proxy or token:

Variable Required Description
VIDEO_STUDIO_PROXY_URL No Proxy base URL
VIDEO_STUDIO_TOKEN No Auth token if the proxy requires it

Knowledge Base

List & Monetize Your Skill

Submit your Claude Code skill and start earning

Get started →

Use Cases

Task Automation & Efficiency

Automate repetitive workflows and reduce manual effort

Example

Generate reports, summarize documents, draft communications

Save 3-5 hours per week on routine tasks

Knowledge Enhancement

Learn new skills, understand complex topics, get expert guidance

Example

Explain concepts, provide examples, suggest learning resources

Accelerate learning and skill development by 2x

Quality Improvement

Enhance output quality through reviews, suggestions, and refinements

Example

Review drafts, suggest improvements, catch errors

Improve work quality by 30-40% with less effort

Implementation Guide

Prerequisites

  • Claude Desktop or compatible AI client with skill support
  • Clear understanding of task or problem to solve
  • Willingness to iterate and refine outputs

Time Estimate

15-45 minutes depending on use case complexity

Steps

  1. 1Install skill using provided installation command
  2. 2Test with simple use case relevant to your work
  3. 3Evaluate output quality and relevance
  4. 4Iterate on prompts to improve results
  5. 5Integrate into regular workflow if valuable

Common Pitfalls

  • Expecting perfect results without iteration
  • Not providing enough context in prompts
  • Using skill for tasks outside its intended scope
  • Accepting outputs without review and validation

Best Practices

✓ Do

  • +Start with clear, specific prompts
  • +Provide relevant context and constraints
  • +Review and refine all outputs before using
  • +Iterate to improve output quality
  • +Document successful prompt patterns

✗ Don't

  • Don't use without understanding skill limitations
  • Don't skip validation of outputs
  • Don't share sensitive information in prompts
  • Don't expect skill to replace human judgment

💡 Pro Tips

  • Be specific about desired format and style
  • Ask for multiple options to choose from
  • Request explanations to understand reasoning
  • Combine AI efficiency with human expertise

When to Use This

✓ Use when

Use when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement.

✗ Avoid when

Avoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion.

Learning Path

  1. 1Familiarize yourself with skill capabilities and limitations
  2. 2Start with low-risk, non-critical tasks
  3. 3Progress to more complex and valuable use cases
  4. 4Build expertise through regular use and experimentation

Related Skills

Reviews

4.846 reviews
  • C
    Chaitanya PatilDec 24, 2024

    Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.

  • K
    Kiara JainDec 24, 2024

    Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.

  • N
    Noor HuangDec 4, 2024

    We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • A
    Advait HarrisNov 23, 2024

    Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.

  • P
    Piyush GNov 15, 2024

    We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • M
    Mia ReddyNov 15, 2024

    We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • N
    Noor AndersonNov 15, 2024

    videoagent-video-studio reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • L
    Lucas DesaiOct 14, 2024

    videoagent-video-studio has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • S
    Shikha MishraOct 6, 2024

    videoagent-video-studio fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • M
    Mia HuangOct 6, 2024

    videoagent-video-studio fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

showing 1-10 of 46

1 / 5

Discussion

Comments — not star reviews
  • No comments yet — start the thread.