alicloud-ai-audio-tts-voice-design

cinience/alicloud-skills · updated Apr 8, 2026

$npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-design
0 commentsdiscussion
summary

Category: provider

skill.md

Category: provider

Model Studio Qwen TTS Voice Design

Use voice design models to create controllable synthetic voices from natural language descriptions.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-vd-2026-01-26
  • qwen3-tts-vd-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_design)

Request

  • voice_prompt (string, required) target voice description
  • text (string, required)
  • stream (bool, optional)

Response

  • audio_url (string) or streaming PCM chunks
  • voice_id (string)
  • request_id (string)

Operational guidance

  • Write voice prompts with tone, pace, emotion, and timbre constraints.
  • Build a reusable voice prompt library for product consistency.
  • Validate generated voice in short utterances before long scripts.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "This is a voice-design demo"

Output location

  • Default output: output/ai-audio-tts-voice-design/audio/
  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-voice-design
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-design/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-design/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-voice-design/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

References

  • references/sources.md

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.728 reviews
  • Advait Okafor· Dec 20, 2024

    We added alicloud-ai-audio-tts-voice-design from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Noah Tandon· Nov 19, 2024

    alicloud-ai-audio-tts-voice-design is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

  • Noah Patel· Nov 11, 2024

    Keeps context tight: alicloud-ai-audio-tts-voice-design is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Chaitanya Patil· Oct 22, 2024

    I recommend alicloud-ai-audio-tts-voice-design for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Benjamin Thompson· Oct 10, 2024

    Useful defaults in alicloud-ai-audio-tts-voice-design — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Kwame Patel· Oct 2, 2024

    alicloud-ai-audio-tts-voice-design has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Kaira Bansal· Sep 25, 2024

    We added alicloud-ai-audio-tts-voice-design from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Rahul Santra· Sep 13, 2024

    Keeps context tight: alicloud-ai-audio-tts-voice-design is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Piyush G· Sep 1, 2024

    Useful defaults in alicloud-ai-audio-tts-voice-design — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Shikha Mishra· Aug 20, 2024

    alicloud-ai-audio-tts-voice-design is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

showing 1-10 of 28

1 / 3