audio▌
21 indexed skills · max 10 per page
alicloud-ai-audio-tts-voice-clone
cinience/alicloud-skills · Cloud
Voice cloning and text-to-speech synthesis using Alibaba Cloud Qwen TTS VC models. \n \n Supports two model variants: standard batch processing ( qwen3-tts-vc-2026-01-22 ) and real-time streaming ( qwen3-tts-vc-realtime-2026-01-15 ) \n Accepts voice samples as file paths or raw bytes; generates cloned voice IDs for reuse across multiple synthesis requests \n Normalized interface handles text input, voice enrollment, optional streaming output, and returns audio URLs or PCM chunks \n Requires DASH
web-audio-api
martinholovsky/claude-skills-generator · Backend
This skill provides Web Audio API expertise for creating audio feedback, voice processing, and sound effects in the JARVIS AI Assistant.
game-audio
opusgamelabs/game-creator · Productivity
You are an expert game audio engineer. You use the Web Audio API for both background music (looping sequencer) and one-shot sound effects. Zero dependencies — everything is built into the browser.
alicloud-ai-audio-asr-test
cinience/alicloud-skills · Cloud
Category: test
alicloud-ai-audio-tts-realtime
cinience/alicloud-skills · Cloud
Category: provider
alicloud-ai-audio-tts-voice-clone-test
cinience/alicloud-skills · Cloud
Category: test
videoagent-audio-studio
pexoai/pexo-skills · Video
Unified audio generation dispatcher routing TTS, music, sound effects, and voice cloning to optimal models. \n \n Routes requests to ElevenLabs (TTS, voice cloning, SFX) or fal.ai (music) based on request type, with latencies ranging from <1s to ~15s \n Supports five audio capabilities: multilingual text-to-speech with voice selection, low-latency turbo TTS, background music composition, sound effect generation (up to 22 seconds), and voice cloning from audio samples \n Requires only ELEVEN
audio-producer
daffy0208/ai-dev-standards · Productivity
I help you build audio players, process audio, and create interactive sound experiences for the web.
dialogue-audio
inference-sh/skills · Productivity
Create realistic multi-speaker dialogue with Dia TTS via inference.sh CLI.
audio-transcribe
infquest/vibe-ops-plugin · Productivity
使用 WhisperX 进行语音识别,支持多种语言和词级别时间戳对齐。