alicloud-ai-audio-tts-realtime

cinience/alicloud-skills · updated Apr 8, 2026

$npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-realtime
0 commentsdiscussion
summary

Category: provider

skill.md

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-flash-realtime
  • qwen3-tts-instruct-flash-realtime
  • qwen3-tts-instruct-flash-realtime-2026-01-22
  • qwen3-tts-vd-realtime-2026-01-15
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

  • text (string, required)
  • voice (string, required)
  • instruction (string, optional)
  • sample_rate (int, optional)

Response

  • audio_base64_pcm_chunks (array)
  • sample_rate (int)
  • finish_reason (string)

Operational guidance

  • Use websocket or streaming endpoint for realtime mode.
  • Keep each utterance short for lower latency.
  • For instruction models, keep instruction explicit and concise.
  • Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

  • Default output: output/ai-audio-tts-realtime/audio/
  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-realtime
for f in skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-realtime/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-realtime/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

References

  • references/sources.md

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.536 reviews
  • Omar Dixit· Dec 28, 2024

    alicloud-ai-audio-tts-realtime has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Aarav Singh· Dec 12, 2024

    alicloud-ai-audio-tts-realtime fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Omar Chawla· Nov 19, 2024

    Solid pick for teams standardizing on skills: alicloud-ai-audio-tts-realtime is focused, and the summary matches what you get after install.

  • Omar Bhatia· Nov 15, 2024

    Registry listing for alicloud-ai-audio-tts-realtime matched our evaluation — installs cleanly and behaves as described in the markdown.

  • William Malhotra· Nov 3, 2024

    I recommend alicloud-ai-audio-tts-realtime for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Sophia Verma· Oct 22, 2024

    Solid pick for teams standardizing on skills: alicloud-ai-audio-tts-realtime is focused, and the summary matches what you get after install.

  • Olivia Diallo· Oct 10, 2024

    I recommend alicloud-ai-audio-tts-realtime for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Soo Rahman· Oct 6, 2024

    Useful defaults in alicloud-ai-audio-tts-realtime — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Kabir Reddy· Sep 21, 2024

    Keeps context tight: alicloud-ai-audio-tts-realtime is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Omar Ghosh· Sep 13, 2024

    I recommend alicloud-ai-audio-tts-realtime for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

showing 1-10 of 36

1 / 4