videoagent-video-studio
Generate short AI videos from text or images using 7 backend models with zero API key setup.
Works with
0
total installs
0
this week
434
GitHub stars
0
upvotes
Install Skill
Run in your terminal
0
installs
0
this week
434
stars
What it does
Supports three generation modes: text-to-video, image-to-video, and reference-based generation for consistent output
Seven models available (minimax, kling, veo, hunyuan, grok, seedance, pixverse) with automatic selection or manual override via --model flag
Configurable duration (4–12 seconds), aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4), and automatic prompt enhancement for better results
S
Installation Guide
How to use videoagent-video-studio on Cursor
AI-first code editor with Composer
Prerequisites
Before installing skills in Cursor, ensure your development environment meets these requirements:
- ›Cursor installed and configured on your machine
- ›Node.js 16+ with npm — verify with
node --version - ›Active project directory where you want to add
videoagent-video-studio
Run the install command
Execute the skills CLI command in your project's root directory to begin installation:
Fetches videoagent-video-studio from pexoai/pexo-skills and configures it for Cursor.
Select Cursor when prompted
The CLI shows a list of agents. Use arrow keys and space to select Cursor:
Verify installation
Confirm successful installation by checking the skill directory location:
Restart Cursor to activate videoagent-video-studio. Access via /videoagent-video-studio in your agent's command palette.
Security Notice
We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.
Skills execute code in your environment. Always review source, verify the publisher, and test in isolation before production.
Documentation
🎬 VideoAgent Video Studio
Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.
Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.
Quick Reference
| User Intent | Mode | Typical Duration |
|---|---|---|
| "Make a video of..." (no image) | text-to-video |
4–10 s |
| "Animate this image" / "Make this move" | image-to-video |
4–6 s |
| "Turn this into a video with..." | image-to-video |
4–6 s |
| Cinematic, story, ad | Prefer text-to-video with detailed prompt |
5–10 s |
Generation Modes
| Mode | Description | Models |
|---|---|---|
| text-to-video | Text prompt only → video | minimax, kling, veo, hunyuan, grok, seedance |
| image-to-video | Single image + prompt → animated clip | minimax, kling, veo, pixverse, grok, seedance |
| reference-based | Reference images/video → consistent output | minimax, kling, veo, hunyuan, grok, seedance |
Models (use --model <id>)
| Model ID | T2V | I2V | Reference | Notes |
|---|---|---|---|---|
minimax |
✅ | ✅ | ✅ | Subject reference image, character consistency |
kling |
✅ | ✅ | ✅ | Multi-element / character / keyframe (O3) |
veo |
✅ | ✅ | ✅ | Google Veo 3.1, multiple reference images |
hunyuan |
✅ | — | ✅ | Video-to-video style transfer |
pixverse |
— | ✅ | — | Stylized image-to-video |
grok |
✅ | ✅ | ✅ | Video editing via reference video |
seedance |
✅ | ✅ | ✅ | Seedance 1.5 Pro, synchronized audio, 4–12 s |
Full model details and endpoint reference: references/models.md.
How to Generate a Video
Step 1 — Choose mode and enhance the prompt
- Text-to-video: Expand with subject, action, camera movement, lighting, and style. Be specific about motion (e.g. "camera slowly zooms in", "character walks left to right").
- Image-to-video: Describe the motion to apply to the image (e.g. "gentle breeze in the hair", "camera pans across the scene"). See references/prompt_guide.md for patterns.
Step 2 — Run the script
Text-to-video:
node {baseDir}/tools/generate.js \
--mode text-to-video \
--prompt "<enhanced prompt>" \
--duration <seconds> \
--aspect-ratio <ratio>
Image-to-video:
node {baseDir}/tools/generate.js \
--mode image-to-video \
--prompt "<motion description>" \
--image-url "<public image URL>" \
--duration <seconds> \
--aspect-ratio <ratio>
Parameters:
| Parameter | Default | Description |
|---|---|---|
--mode |
text-to-video |
text-to-video or image-to-video |
--prompt |
(required) | Scene or motion description |
--image-url |
— | Required for image-to-video; public image URL |
--duration |
5 |
Length in seconds (typically 4–10) |
--aspect-ratio |
16:9 |
16:9, 9:16, 1:1, 4:3, 3:4 |
--model |
auto |
Model ID (e.g. kling, veo, grok, seedance); auto = proxy picks |
Other commands:
| Command | Description |
|---|---|
node tools/generate.js --list-models |
List available models from the proxy |
node tools/generate.js --status --job-id <id> |
Check async job status |
Step 3 — Return the result
The script returns JSON:
{
"success": true,
"mode": "text-to-video",
"videoUrl": "https://...",
"duration": 5,
"aspectRatio": "16:9"
}
Send videoUrl to the user.
Example Conversations
User: "Generate a short video of a cat walking in the rain, cinematic."
node {baseDir}/tools/generate.js \
--mode text-to-video \
--prompt "A cat walking through rain, wet streets, neon reflections, cinematic lighting, slow motion, 4K" \
--duration 5 \
--aspect-ratio 16:9
User: "Animate this photo" (user uploads a landscape)
node {baseDir}/tools/generate.js \
--mode image-to-video \
--prompt "Gentle clouds moving across the sky, subtle grass movement, cinematic atmosphere" \
--image-url "https://..." \
--duration 5 \
--aspect-ratio 16:9
User: "Make a 10-second vertical video of a coffee pour, slow motion."
node {baseDir}/tools/generate.js \
--mode text-to-video \
--prompt "Close-up of coffee pouring into a white cup, slow motion, steam rising, soft lighting, product shot" \
--duration 10 \
--aspect-ratio 9:16
User: "Use Google Veo for a cinematic shot."
node {baseDir}/tools/generate.js \
--mode text-to-video \
--model veo \
--prompt "A dragon flying through cloudy skies, cinematic lighting, 8s" \
--duration 8 \
--aspect-ratio 16:9
User: "Animate this portrait."
node {baseDir}/tools/generate.js \
--mode image-to-video \
--model grok \
--prompt "Gentle smile, subtle head turn" \
--image-url "https://..." \
--duration 5
Setup
Zero API keys by default. Requests go through a hosted proxy. Set these for a custom proxy or token:
| Variable | Required | Description |
|---|---|---|
VIDEO_STUDIO_PROXY_URL |
No | Proxy base URL |
VIDEO_STUDIO_TOKEN |
No | Auth token if the proxy requires it |
Knowledge Base
- references/prompt_guide.md — Prompt patterns for text-to-video and image-to-video.
- references/models.md — Model list, capabilities, and selection guide.
- references/calling_guide.md — Per-model endpoint details, input parameters, and special handling.
List & Monetize Your Skill
Submit your Claude Code skill and start earning
Use Cases
Task Automation & Efficiency
Automate repetitive workflows and reduce manual effort
Example
Generate reports, summarize documents, draft communications
Save 3-5 hours per week on routine tasks
Knowledge Enhancement
Learn new skills, understand complex topics, get expert guidance
Example
Explain concepts, provide examples, suggest learning resources
Accelerate learning and skill development by 2x
Quality Improvement
Enhance output quality through reviews, suggestions, and refinements
Example
Review drafts, suggest improvements, catch errors
Improve work quality by 30-40% with less effort
Implementation Guide
Prerequisites
- ›Claude Desktop or compatible AI client with skill support
- ›Clear understanding of task or problem to solve
- ›Willingness to iterate and refine outputs
Time Estimate
15-45 minutes depending on use case complexity
Steps
- 1Install skill using provided installation command
- 2Test with simple use case relevant to your work
- 3Evaluate output quality and relevance
- 4Iterate on prompts to improve results
- 5Integrate into regular workflow if valuable
Common Pitfalls
- ⚠Expecting perfect results without iteration
- ⚠Not providing enough context in prompts
- ⚠Using skill for tasks outside its intended scope
- ⚠Accepting outputs without review and validation
Best Practices
✓ Do
- +Start with clear, specific prompts
- +Provide relevant context and constraints
- +Review and refine all outputs before using
- +Iterate to improve output quality
- +Document successful prompt patterns
✗ Don't
- −Don't use without understanding skill limitations
- −Don't skip validation of outputs
- −Don't share sensitive information in prompts
- −Don't expect skill to replace human judgment
💡 Pro Tips
- ★Be specific about desired format and style
- ★Ask for multiple options to choose from
- ★Request explanations to understand reasoning
- ★Combine AI efficiency with human expertise
When to Use This
✓ Use when
Use when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement.
✗ Avoid when
Avoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion.
Learning Path
- 1Familiarize yourself with skill capabilities and limitations
- 2Start with low-risk, non-critical tasks
- 3Progress to more complex and valuable use cases
- 4Build expertise through regular use and experimentation
Related Skills
seedance-2.0-prompter
7pexoai/pexo-skills
remotion-video-production
7supercent-io/skills-template
video-editing
7affaan-m/everything-claude-code
remotion-best-practices
6remotion-dev/skills
video-downloader
6davila7/claude-code-templates
video-analyzer
6zrong/skills
Reviews
- CChaitanya Patil★★★★★Dec 24, 2024
Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.
- KKiara Jain★★★★★Dec 24, 2024
Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.
- NNoor Huang★★★★★Dec 4, 2024
We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.
- AAdvait Harris★★★★★Nov 23, 2024
Solid pick for teams standardizing on skills: videoagent-video-studio is focused, and the summary matches what you get after install.
- PPiyush G★★★★★Nov 15, 2024
We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.
- MMia Reddy★★★★★Nov 15, 2024
We added videoagent-video-studio from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.
- NNoor Anderson★★★★★Nov 15, 2024
videoagent-video-studio reduced setup friction for our internal harness; good balance of opinion and flexibility.
- LLucas Desai★★★★★Oct 14, 2024
videoagent-video-studio has been reliable in day-to-day use. Documentation quality is above average for community skills.
- SShikha Mishra★★★★★Oct 6, 2024
videoagent-video-studio fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.
- MMia Huang★★★★★Oct 6, 2024
videoagent-video-studio fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.
showing 1-10 of 46
Discussion
Comments — not star reviews- No comments yet — start the thread.