alicloud-ai-audio-tts▌
cinience/alicloud-skills · updated Apr 8, 2026
Category: provider
Category: provider
Model Studio Qwen TTS
Validation
mkdir -p output/alicloud-ai-audio-tts
python -m py_compile skills/ai/audio/alicloud-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/alicloud-ai-audio-tts/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-tts/validate.txt is generated.
Output And Evidence
- Save generated audio links, sample audio files, and request payloads to
output/alicloud-ai-audio-tts/. - Keep one validation log per execution.
Critical model names
Use one of the recommended models:
qwen3-tts-flashqwen3-tts-instruct-flashqwen3-tts-instruct-flash-2026-01-26
Prerequisites
- Install SDK (recommended in a venv to avoid PEP 668 limits):
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials(env takes precedence).
Normalized interface (tts.generate)
Request
text(string, required)voice(string, required)language_type(string, optional; defaultAuto)instruction(string, optional; recommended for instruct models)stream(bool, optional; default false)
Response
audio_url(string, when stream=false)audio_base64_pcm(string, when stream=true)sample_rate(int, 24000)format(string, wav or pcm depending on mode)
Quick start (Python + DashScope SDK)
import os
import dashscope
# Prefer env var for auth: export DASHSCOPE_API_KEY=...
# Or use ~/.alibabacloud/credentials with dashscope_api_key under [default].
# Beijing region; for Singapore use: https://dashscope-intl.aliyuncs.com/api/v1
dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1"
text = "Hello, this is a short voice line."
response = dashscope.MultiModalConversation.call(
model="qwen3-tts-instruct-flash",
api_key=os.getenv("DASHSCOPE_API_KEY"),
text=text,
voice="Cherry",
language_type="English",
instruction="Warm and calm tone, slightly slower pace.",
stream=False,
)
audio_url = response.output.audio.url
print(audio_url)
Streaming notes
stream=Truereturns Base64-encoded PCM chunks at 24kHz.- Decode chunks and play or concatenate to a pcm buffer.
- The response contains
finish_reason == "stop"when the stream ends.
Operational guidance
- Keep requests concise; split long text into multiple calls if you hit size or timeout errors.
- Use
language_typeconsistent with the text to improve pronunciation. - Use
instructiononly when you need explicit style/tone control. - Cache by
(text, voice, language_type)to avoid repeat costs.
Output location
- Default output:
output/alicloud-ai-audio-tts/audio/ - Override base dir with
OUTPUT_DIR.
Workflow
- Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
- Run one minimal read-only query first to verify connectivity and permissions.
- Execute the target operation with explicit parameters and bounded scope.
- Verify results and save output/evidence files.
References
-
references/api_reference.mdfor parameter mapping and streaming example. -
Realtime mode is provided by
skills/ai/audio/alicloud-ai-audio-tts-realtime/. -
Voice cloning/design are provided by
skills/ai/audio/alicloud-ai-audio-tts-voice-clone/andskills/ai/audio/alicloud-ai-audio-tts-voice-design/. -
Source list:
references/sources.md
Discussion
Product Hunt–style comments (not star reviews)- No comments yet — start the thread.
Ratings
4.6★★★★★31 reviews- ★★★★★Harper Rahman· Dec 24, 2024
I recommend alicloud-ai-audio-tts for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.
- ★★★★★Chaitanya Patil· Dec 20, 2024
Keeps context tight: alicloud-ai-audio-tts is the kind of skill you can hand to a new teammate without a long onboarding doc.
- ★★★★★James Haddad· Nov 15, 2024
Solid pick for teams standardizing on skills: alicloud-ai-audio-tts is focused, and the summary matches what you get after install.
- ★★★★★Piyush G· Nov 11, 2024
Registry listing for alicloud-ai-audio-tts matched our evaluation — installs cleanly and behaves as described in the markdown.
- ★★★★★Emma White· Oct 6, 2024
alicloud-ai-audio-tts has been reliable in day-to-day use. Documentation quality is above average for community skills.
- ★★★★★Shikha Mishra· Oct 2, 2024
alicloud-ai-audio-tts reduced setup friction for our internal harness; good balance of opinion and flexibility.
- ★★★★★James White· Sep 25, 2024
Keeps context tight: alicloud-ai-audio-tts is the kind of skill you can hand to a new teammate without a long onboarding doc.
- ★★★★★Yash Thakker· Sep 21, 2024
I recommend alicloud-ai-audio-tts for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.
- ★★★★★James Huang· Sep 13, 2024
Solid pick for teams standardizing on skills: alicloud-ai-audio-tts is focused, and the summary matches what you get after install.
- ★★★★★Alexander Abebe· Sep 1, 2024
alicloud-ai-audio-tts is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.
showing 1-10 of 31