generation▌
36 indexed skills · max 10 per page
ai-music-generation
inference-sh/skills · AI/ML
Generate music and songs via inference.sh CLI.
podcast-generation
bytedance/deer-flow · Productivity
This skill generates high-quality podcast audio from text content. The workflow includes creating a structured JSON script (conversational dialogue) and executing audio generation through text-to-speech synthesis.
image-generation
claude-office-skills/skills · Productivity
I help you create effective prompts for AI image generation tools like DALL-E, Midjourney, and Stable Diffusion. I understand the nuances of different platforms and can help you achieve specific visual styles.
image-generation
supercent-io/skills-template · Productivity
Generate high-quality images via Gemini models with structured prompts, aspect ratios, and brand validation. \n \n Supports three Gemini models (gemini-3-pro-image, gemini-2.5-flash-image, gemini-2.5-pro-image) optimized for different quality-speed tradeoffs \n Enforces structured prompt format covering subject, style, lighting, mood, composition, aspect ratio, and brand colors to ensure consistency \n Includes multi-agent workflow for prompt validation, style verification, and output delivery a
ai-video-generation
inferen-sh/skills · AI/ML
Generate videos with 40+ AI models including Veo, Seedance, Wan, and Grok via inference.sh CLI. \n \n Supports text-to-video, image-to-video, avatar animation, lipsync, video upscaling, and foley sound generation across multiple model families \n Access 10+ text-to-video models (Veo 3.1, Seedance 1.5 Pro, Wan, Grok Video) and 5+ image-to-video variants optimized for speed, quality, or cost \n Includes avatar and lipsync tools (OmniHuman, Fabric, PixVerse) for talking-head and character animation
ai-image-generation
inferen-sh/skills · AI/ML
Generate images with 50+ AI models including FLUX, Gemini, Grok, and Seedream via inference.sh CLI. \n \n Supports text-to-image, image-to-image, inpainting, LoRA customization, image editing, upscaling, and text rendering across multiple model families \n Models range from ultra-fast budget options (FLUX Klein at $0.0001/image) to high-fidelity 4K outputs (Seedream 4.5, ImagineArt 1.5 Pro) \n Includes Google Gemini, xAI Grok, ByteDance Seedream, and Pruna P-Image variants with configurable aspe