short-video-production▌
vivy-yi/xiaohongshu-skills · updated May 3, 2026
MDX-style export adds YAML metadata + attribution linking explainx.ai and this canonical listing URL.
Short video production is the art and science of creating engaging 15-60 second vertical videos optimized specifically for Xiaohongshu's mobile-first, algorithm-driven platform. This skill encompasses the complete workflow from concept through filming, editing, and optimization—combining compelling storytelling, visual appeal, rapid pacing, and concise delivery to capture viewer attention within milliseconds and retain it through completion. The core principle: on mobile platforms with endless c
Short Video Production (短视频制作)
Overview
Short video production is the art and science of creating engaging 15-60 second vertical videos optimized specifically for Xiaohongshu's mobile-first, algorithm-driven platform. This skill encompasses the complete workflow from concept through filming, editing, and optimization—combining compelling storytelling, visual appeal, rapid pacing, and concise delivery to capture viewer attention within milliseconds and retain it through completion. The core principle: on mobile platforms with endless content competing for attention, you don't have luxury of slow builds—every second must earn the viewer's continued watching.
Key insight: Xiaohongshu's algorithm heavily weights video completion rate and watch time as signals of content quality. Well-produced short videos (15-60 seconds, vertical format, strong hooks) achieve 5-10x higher completion rates than unedited horizontal footage. The first 3 seconds determine 80% of a video's success—invest disproportionately in hooks, and make every frame earn its place through rapid pacing and clear value delivery.
When to Use
Use when:
- Creating vertical video content for Xiaohongshu (primary format)
- Filming vlogs, tutorials, reviews, or lifestyle content
- Editing short-form vertical video (under 60 seconds)
- Optimizing video for mobile-first consumption
- Repurposing longer content into short clips
- Creating product showcases or demonstrations
- Building personal brand through video storytelling
- Producing behind-the-scenes or day-in-the-life content
- Making educational or how-to video content
Do NOT use when:
- Creating static image posts (use photo-editing or graphic-layout instead)
- Editing feature-length video or documentary content (use professional video editing software)
- Production requires cinematic quality beyond smartphone capability
- Content is fundamentally audio-first and doesn't benefit from visuals (podcast-style content)
- Creating live streaming content (different production considerations)
- Platform requires horizontal format (certain cinematic content for other platforms)
Core Pattern
Before (unoptimized, boring video): ❌ "3+ minutes long, viewers scroll away" ❌ "Horizontal format, black bars on mobile" ❌ "Poor audio, can't hear clearly" ❌ "No clear structure, rambles" ❌ "Boring 15-second intro, lost viewers" ❌ "Weak or missing call-to-action" ❌ "Raw footage, no editing"
After (engaging, optimized short video): ✅ "15-60 seconds, respects viewer time" ✅ "Vertical 9:16 format, mobile-native" ✅ "Clear audio, professional quality" ✅ "Hook-value-CTA structure, purposeful" ✅ "Captivating first 3 seconds, immediate hook" ✅ "Clear CTA, tells viewer what to do" ✅ "Polished editing, dynamic pacing"
6 Essential Short Video Elements:
| Element | Purpose | Impact | Priority |
|---|---|---|---|
| Hook (0-3s) | Grab attention immediately | Critical | #1 - determines 80% of success |
| Value Delivery (3-45s) | Deliver on hook's promise | Critical | #2 - must earn continued watching |
| Pacing | Maintain engagement through speed | High | #3 - dynamic, fast cuts |
| Visual Quality | Professional, credible appearance | High | #4 - lighting, framing, stability |
| Audio Quality | Clear, pleasant sound experience | High | #5 - poor audio = video failure |
| Call-to-Action | Guide viewer to next step | Medium-High | #6 - engagement and conversion |
Quick Reference
Video Type Comparison for Xiaohongshu:
| Video Type | Optimal Length | Structure | Production | Engagement | Best For |
|---|---|---|---|---|---|
| Tutorial (教程) | 45-60s | Hook → Problem → Steps → Result | Medium | Very High | Educational value |
| Review (测评) | 30-45s | Hook → First impressions → Features → Verdict | Medium | High | Product content |
| Vlog (日常) | 30-60s | Scene → Action → Reaction → Reflection | Low-Medium | Medium | Authenticity, lifestyle |
| Tips List (干货) | 15-30s | Hook → Tip 1, 2, 3 → CTA | Low | Very High | Quick value delivery |
| Before-After (对比) | 15-30s | Before → Process → After reveal | Low-Medium | Very High | Transformations |
| Storytelling (故事) | 45-60s | Hook → Conflict → Journey → Resolution | High | High | Emotional connection |
| Challenge (挑战) | 30-45s | Intro → Attempt → Result → Reaction | Medium | High | Entertainment, relatability |
| Quick Tip (秒懂) | 7-15s | Hook → One tip → CTA | Very Low | High | Ultra-brief value |
Video Format Specifications:
| Specification | Recommended | Acceptable | Avoid |
|---|---|---|---|
| Aspect Ratio | 9:16 (vertical) | 1:1 (square) | 16:9 (horizontal) |
| Resolution | 1080×1920 (Full HD) | 720×1280 (HD) | Below 720p |
| Frame Rate | 30fps | 24fps, 60fps | Below 24fps |
| File Size | Under 500MB | Under 1GB | Over 1GB |
| Length | 15-60s | Up to 3 min (rare) | Over 3 min |
| Format | MP4, MOV | - | AVI, MKV (unsupported) |
Equipment by Production Quality:
| Quality Level | Camera | Audio | Lighting | Stabilization | Budget |
|---|---|---|---|---|---|
| Basic | Smartphone (built-in) | Phone mic | Natural light | Handheld careful | ¥0 |
| Improved | Smartphone + app | Lavalier mic (¥50) | Ring light (¥100) | Mini tripod (¥50) | ¥200 |
| Advanced | Smartphone + lens kit | Wireless mic (¥300) | Softbox + ring light | Gimbal stabilizer (¥500) | ¥1,000 |
| Professional | Mirrorless camera | Shotgun mic + recorder | 3-point lighting | Professional rig | ¥10,000+ |
Implementation
Step 1: Plan Video Structure and Concept
Great videos start with intentional planning, not random filming.
Define Your Video Type (from Quick Reference table above):
- Tutorial: Teaching a skill or process
- Review: Evaluating a product or service
- Vlog: Documenting daily life or experience
- Tips List: Delivering multiple quick tips
- Before-After: Showing transformation
- Storytelling: Narrative arc with emotion
- Challenge: Attempting something difficult
- Quick Tip: Ultra-brief single-value delivery
Structure Formula by Length:
15-Second Videos (ultra-fast):
- 0-2s: Hook (grab attention immediately)
- 2-13s: Single value point (one tip, one reveal, one moment)
- 13-15s: CTA (follow, like, save)
30-Second Videos (standard):
- 0-3s: Hook (setup, promise, curiosity)
- 3-25s: Main content (2-3 key points, examples)
- 25-30s: CTA + preview (what's next, why follow)
60-Second Videos (maximum):
- 0-3s: Hook (strongest opener)
- 3-50s: Main content (3-5 points, demonstration, examples)
- 50-55s: Summary or key takeaway
- 55-60s: CTA (specific action)
Pre-Filming Checklist:
- Video type and structure defined
- Hook concept finalized (first 3 seconds scripted)
- Key points outlined (3-5 max)
- Location and lighting scouted
- Props and materials ready
- Battery charged, storage available
- Practice run-through (optional but recommended)
Step 2: Film High-Quality Vertical Footage
Good footage makes editing easy; bad footage can't be fully fixed.
Camera Settings for Smartphone:
| Setting | Recommended | Why |
|---|---|---|
| Resolution | 4K (downscale to 1080p in edit) | Future-proofing, editing flexibility |
| Frame Rate | 30fps | Standard for social media, smooth motion |
| Grid | On (rule of thirds) | Composition guide |
| HDR | Off (can cause editing issues) | Consistent exposure |
| Stabilization | On (but use gimbal when possible) | Reduce shake |
Framing and Composition:
Vertical Framing Rules:
- Subject placement: Center or rule-of-thirds intersection
- Headroom: Leave 10-15% space above head (not too much, not too little)
- Eye level: Camera at subject's eye level (most flattering)
- Close-ups: Chest up for talking head, closer for detail shots
- Variety: Mix close-ups, medium shots, and wide shots for dynamic editing
Lighting Fundamentals:
| Lighting Type | Quality | Best For | Setup |
|---|---|---|---|
| Natural (window) | Soft, flattering | Talking head, beauty | Face near window, not direct sun |
| Ring light | Even, circular catchlights | Product, tutorials | Ring light centered on face/product |
| Softbox | Professional, diffused | Any content | Softbox 45° angle to subject |
| Natural (outdoor) | Varied by weather | Vlog, lifestyle | Golden hour (sunrise/sunset) best |
Lighting Setup Tips:
- Avoid: Backlighting (subject becomes silhouette)
- Avoid: Harsh overhead light (unflattering shadows under eyes)
- Preferred: Light source in front of subject (illuminates face)
- Test: Record 5-second test, check lighting before full shoot
Audio Recording Best Practices:
Microphone Selection:
- Built-in phone mic: Acceptable for quiet indoor settings, worst choice
- Lavalier (clip-on): Best for voice clarity, consistent distance (¥50-300)
- Wireless mic: Freedom of movement, professional quality (¥300-800)
- Shotgun mic: Directional, outdoor use, requires operator (¥500+)
Audio Setup:
- Distance: 15-30cm from mic to mouth (too close = distortion, too far = echo)
- Environment: Quietest room available, close windows, turn off fans/AC
- Monitoring: Use headphones to monitor audio live (catch issues immediately)
- Test record: Always do 10-second test, check audio quality
Stabilization Techniques:
| Method | Effectiveness | Cost | Best For |
|---|---|---|---|
| Handheld careful | Low | Free | Stationary shots |
| Two-handed grip | Low-Medium | Free | Brief shots |
| Elbow lock (body as tripod) | Medium | Free | Medium-length shots |
| Mini tripod | Medium | ¥30-50 | Stationary, tutorials |
| Gimbal stabilizer | Very High | ¥400-800 | Moving shots, vlogs |
| Fixed tripod | Very High | ¥100-300 | Talking head, tutorials |
Step 3: Edit for Maximum Engagement
Editing transforms raw footage into compelling content.
Editing Workflow (using JianYing/CapCut/VietCut):
1. Import and Organize (5 minutes):
- Import footage to editing app
- Create bins/folders for organized clips
- Watch all footage, mark best takes
2. Rough Cut (10-15 minutes):
- Arrange clips in planned structure order
- Trim to approximate target length (20-30% over target)
- Remove mistakes, pauses, "ums" and "uhs"
- Focus on narrative flow, not perfection yet
3. Fine Cut (15-20 minutes):
- Tighten pacing: remove unnecessary frames
- Target exact length (15, 30, or 60 seconds)
- Ensure each shot earns its place (advances story or value)
- Cut on action (movement masks cuts)
4. Add Visual Elements (10-15 minutes):
- Text overlays: Key points, product names, tips
- Captions: Auto-generate, manually edit for accuracy
- Transitions: Keep subtle (0.3-0.5s), avoid cheesy effects
- Speed changes: Speed up boring parts (1.5-2x), slow down reveals (0.5-0.75x)
5. Audio Enhancement (5-10 minutes):
- Voiceover: Record clear narration if needed
- Music: Add background music (10-20% volume, not competing with voice)
- Sound effects: Subtle whooshes, pops for emphasis (use sparingly)
- Volume normalization: Ensure consistent volume throughout
6. Export and Quality Check (5 minutes):
- Export settings: 1080p, 30fps, high quality
- Format: MP4 for Xiaohongshu compatibility
- File size: Under 500MB for fast upload
- Test playback: Watch on phone before publishing
Editing Best Practices:
| Technique | When to Use | Effect |
|---|---|---|
| Jump cuts | Remove pauses, maintain energy | Dynamic, fast-paced |
| B-roll overlay | Cover cuts, add visual interest | Professional, engaging |
| Text animation | Emphasize key points | Attention-grabbing |
| Speed ramping | Build anticipation to reveal | Dramatic effect |
| Zoom in/out | Focus on detail, show context | Dynamic movement |
Step 4: Optimize the First 3 Seconds (Critical)
The hook determines 80% of your video's success. Invest disproportionate time here.
Hook Formula Types:
| Formula | Example | Why It Works |
|---|---|---|
| Curiosity gap | "这个产品真的让我惊呆了..." | Creates mystery, must watch to learn |
| Specific promise | "5个学生党省钱技巧" | Clear value, sets expectations |
| Problem-solution | "别再犯这个错误了" | Identifies pain, promises solution |
| Counter-intuitive | " unpopular opinion: [contrarian take]" | Challenges assumptions, provokes |
| Immediate result | [Show before-after instantly] | Visual proof, instant gratification |
| Question hook | "你知道为什么...吗?" | Engages curiosity, interactive |
| Bold statement | "这是我用过最好的..." | Strong opinion, controversial |
Visual Hook Techniques:
1. Face-to-Camera:
- Direct eye contact (creates connection)
- Genuine expression (smile, surprise, concern)
- Close-up framing (chest up)
2. Dynamic Movement:
- Quick camera move (zoom, pan, reveal)
- Subject movement (walk toward camera, gesture)
- Transition effect (whip pan, zoom transition)
3. Text Teaser:
- Bold text overlay: "Wait for the end!"
- Split screen: "Before" vs "After"
- Countdown: "3 things that will change your..."
4. Action Start:
- Start mid-action (not "hi everyone, welcome back")
- Immediate demonstration (show, don't tell)
- In media res (in the middle of action)
Hook Testing:
- Record 3 different hook variations
- Show to friends/audience: "Which makes you want to keep watching?"
- Use analytics: Which hook has highest 3-second retention?
Step 5: Add Clear Call-to-Action
Don't leave viewers guessing—tell them exactly what to do.
CTA Types and Examples:
| CTA Type | Example | Best For |
|---|---|---|
| Follow | "关注我看更多 [topic] 内容" | Building audience |
| Like/Save | "点赞收藏,下次容易找到" | Algorithm boost, utility |
| Comment | "评论区告诉我你的看法" | Engagement boost |
| Share | "转发给需要的朋友" | Viral potential |
| Link in bio | "链接在主页" | Traffic/Conversion |
| Challenge | "试试这个,然后告诉我结果" | Community, UGC |
| Series CTA | "这是Part 1, Part 2明天见" | Retention, series viewership |
CTA Placement Strategies:
1. End CTA (standard):
- Last 3-5 seconds of video
- Clear, specific, singular action (not 3 CTAs at once)
- Example: "关注我,明天分享Part 2"
2. Mid-roll CTA (for longer videos):
- At 50% mark (e.g., 25 seconds into 50-second video)
- Tease remaining content: "最重要的技巧在最后..."
- Returns viewer attention for second half
3. Text CTA (always visible):
- Persistent text overlay during video
- Example: "关注看更多" in corner throughout
- Reinforces verbal CTA
4. Comment CTA (engagement):
- Ask specific question: "你更喜欢A还是B? 评论区告诉我"
- Pin your response to top comment
- Builds community and algorithm signals
Step 6: Optimize for Xiaohongshu Platform
Platform-specific optimization maximizes reach and engagement.
Xiaohongshu Algorithm Signals:
| Signal | Weight | How to Optimize |
|---|---|---|
| Completion rate | Very High | Keep videos 15-60s, engaging throughout |
| Watch time | Very High | Deliver value early, don't bury lead |
| Engagement (likes, comments, saves, shares) | High | Include CTAs, ask questions |
| Re-watches | Medium | Create loops, make worth rewatching |
| Session time | Medium | Create binge-worthy content series |
Platform-Specific Best Practices:
Title and Description:
- Title: 10-20 characters, compelling, descriptive
- First line: Most important, visible in feed
- Hashtags: 3-5 relevant tags (not spammy)
- Keywords: Include searchable terms in description
Thumbnail/Cover (first frame matters):
- High contrast: Bright, eye-catching
- Text overlay: Tease content (not give away)
- Face: People respond to human faces
- Branding: Consistent style across videos
Posting Timing:
- Best times: 7-10 PM (high engagement hours)
- Best days: Weekends (Saturday, Sunday) for lifestyle
- Consistency: Post same time/days to build habit
- Frequency: 3-7 videos per week for growth
Cross-Promotion:
- WeChat Moments: Share video link to personal network
- Weibo: Post teaser with link back to Xiaohongshu
- Douyin: Repost vertical videos (cross-platform friendly)
Step 7: Test, Analyze, and Iterate
Continuous improvement through data-driven optimization.
Key Metrics to Track:
| Metric | How to Measure | What It Indicates |
|---|---|---|
| Views | Total impressions | Reach + algorithm boost |
| Completion Rate | Finished / Started × 100% | Content engagement quality |
| 3- how to use short-video-production How to use short-video-production on CursorAI-first code editor with Composer 1 PrerequisitesBefore installing skills in Cursor, ensure your development environment meets these requirements:
2 Execute installation commandExecute the skills CLI command in your project's root directory to begin installation: $npx skills add https://github.com/vivy-yi/xiaohongshu-skills --skill short-video-production The skills CLI fetches 3 Select Cursor when promptedThe CLI will show a list of available agents. Use arrow keys to navigate and space to select Cursor: ◆ Which agents do you want to install to? │ │ ── Universal (.agents/skills) ── always included ──── │ • Amp │ • Antigravity │ • Cline │ • Codex │ ●Cursor(selected) │ • Cursor │ • Windsurf 4 Verify installationConfirm successful installation by checking the skill directory location: .cursor/skills/short-video-production Reload or restart Cursor to activate short-video-production. Access the skill through slash commands (e.g., ⚠ Security & Verification NoticeWe perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use. Skills execute code in your development environment. Always verify the publisher's identity, review recent commits, and test in isolated environments before production deployment. List & Monetize Your SkillSubmit your Claude Code skill and start earning Use Cases▌Task Automation & EfficiencyAutomate repetitive workflows and reduce manual effort Example Generate reports, summarize documents, draft communications ✓ Save 3-5 hours per week on routine tasks Knowledge EnhancementLearn new skills, understand complex topics, get expert guidance Example Explain concepts, provide examples, suggest learning resources ✓ Accelerate learning and skill development by 2x Quality ImprovementEnhance output quality through reviews, suggestions, and refinements Example Review drafts, suggest improvements, catch errors ✓ Improve work quality by 30-40% with less effort Implementation Guide▌Prerequisites
Time Estimate15-45 minutes depending on use case complexity Installation Steps
Common Pitfalls
Best Practices▌✓ Do
✗ Don't
💡 Pro Tips
When to Use This▌✓ Use WhenUse when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement. ✗ Avoid WhenAvoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion. Learning Path▌
DiscussionProduct Hunt–style comments (not star reviews)
general reviews Ratings4.6★★★★★63 reviews
showing 1-10 of 63 1 / 7 |