explainx.ainewsletter3.4k
trending🔥loopsskills
pricing
workshops ↗
explainx.ai

Learn to lead teams that combine humans and agents. Platform access, live workshops, bootcamps, and 50+ courses — plus skills, tools, and MCP to practice what you learn.

follow us

custom AI agents

[email protected]

get started

Join · $29/moUpcoming workshop

learn

platform · $29/moupcoming workshopworkshopsbootcampscoursescertificationscertification testsexplainx universitycorporate trainingfacilitatorshackathonslearn skills & mcp

discover

skillstoolsagentsmcp serversdesignsllmsagiranks

content

releasesvisionmissionaboutteamcareersresourcespromptsgenerators hubgenerator SEO hubprompt templatesprompt guidesblogfor LLMsdemo

Sister Products

Infloq

Infloq

Influencer marketing

BgBlur

BgBlur

Privacy-first blur

Olly Social

Olly Social

Social AI copilot

Ceptory

Ceptory

Video intelligence

BgRemover

BgRemover

Background removal

newsletter · weekly

Get AI news, tools, and insights in your inbox.

contactsupportprivacytermsdata rightssubmission guidelines

© 2026 AISOLO Technologies Pvt Ltd

skills/tag/audio
tag

audio▌

21 indexed skills · max 10 per page

skills (21)

alicloud-ai-audio-tts-voice-clone

cinience/alicloud-skills · Cloud

2

Voice cloning and text-to-speech synthesis using Alibaba Cloud Qwen TTS VC models. \n \n Supports two model variants: standard batch processing ( qwen3-tts-vc-2026-01-22 ) and real-time streaming ( qwen3-tts-vc-realtime-2026-01-15 ) \n Accepts voice samples as file paths or raw bytes; generates cloned voice IDs for reuse across multiple synthesis requests \n Normalized interface handles text input, voice enrollment, optional streaming output, and returns audio URLs or PCM chunks \n Requires DASH

web-audio-api

martinholovsky/claude-skills-generator · Backend

1

This skill provides Web Audio API expertise for creating audio feedback, voice processing, and sound effects in the JARVIS AI Assistant.

game-audio

opusgamelabs/game-creator · Productivity

1

You are an expert game audio engineer. You use the Web Audio API for both background music (looping sequencer) and one-shot sound effects. Zero dependencies — everything is built into the browser.

alicloud-ai-audio-asr-test

cinience/alicloud-skills · Cloud

0

Category: test

alicloud-ai-audio-tts-realtime

cinience/alicloud-skills · Cloud

0

Category: provider

alicloud-ai-audio-tts-voice-clone-test

cinience/alicloud-skills · Cloud

0

Category: test

videoagent-audio-studio

pexoai/pexo-skills · Video

0

Unified audio generation dispatcher routing TTS, music, sound effects, and voice cloning to optimal models. \n \n Routes requests to ElevenLabs (TTS, voice cloning, SFX) or fal.ai (music) based on request type, with latencies ranging from <1s to ~15s \n Supports five audio capabilities: multilingual text-to-speech with voice selection, low-latency turbo TTS, background music composition, sound effect generation (up to 22 seconds), and voice cloning from audio samples \n Requires only ELEVEN

audio-producer

daffy0208/ai-dev-standards · Productivity

0

I help you build audio players, process audio, and create interactive sound experiences for the web.

dialogue-audio

inference-sh/skills · Productivity

0

Create realistic multi-speaker dialogue with Dia TTS via inference.sh CLI.

audio-transcribe

infquest/vibe-ops-plugin · Productivity

0

使用 WhisperX 进行语音识别,支持多种语言和词级别时间戳对齐。

prevpage 1 / 3next