tag

generation

36 indexed skills · max 10 per page

skills (36)

ai-music-generation

inference-sh/skills · AI/ML

0

Generate music and songs via inference.sh CLI.

podcast-generation

bytedance/deer-flow · Productivity

0

This skill generates high-quality podcast audio from text content. The workflow includes creating a structured JSON script (conversational dialogue) and executing audio generation through text-to-speech synthesis.

image-generation

claude-office-skills/skills · Productivity

0

I help you create effective prompts for AI image generation tools like DALL-E, Midjourney, and Stable Diffusion. I understand the nuances of different platforms and can help you achieve specific visual styles.

image-generation

supercent-io/skills-template · Productivity

0

Generate high-quality images via Gemini models with structured prompts, aspect ratios, and brand validation. \n \n Supports three Gemini models (gemini-3-pro-image, gemini-2.5-flash-image, gemini-2.5-pro-image) optimized for different quality-speed tradeoffs \n Enforces structured prompt format covering subject, style, lighting, mood, composition, aspect ratio, and brand colors to ensure consistency \n Includes multi-agent workflow for prompt validation, style verification, and output delivery a

ai-video-generation

inferen-sh/skills · AI/ML

0

Generate videos with 40+ AI models including Veo, Seedance, Wan, and Grok via inference.sh CLI. \n \n Supports text-to-video, image-to-video, avatar animation, lipsync, video upscaling, and foley sound generation across multiple model families \n Access 10+ text-to-video models (Veo 3.1, Seedance 1.5 Pro, Wan, Grok Video) and 5+ image-to-video variants optimized for speed, quality, or cost \n Includes avatar and lipsync tools (OmniHuman, Fabric, PixVerse) for talking-head and character animation

ai-image-generation

inferen-sh/skills · AI/ML

0

Generate images with 50+ AI models including FLUX, Gemini, Grok, and Seedream via inference.sh CLI. \n \n Supports text-to-image, image-to-image, inpainting, LoRA customization, image editing, upscaling, and text rendering across multiple model families \n Models range from ultra-fast budget options (FLUX Klein at $0.0001/image) to high-fidelity 4K outputs (Seedream 4.5, ImagineArt 1.5 Pro) \n Includes Google Gemini, xAI Grok, ByteDance Seedream, and Pruna P-Image variants with configurable aspe

prevpage 4 / 4next