alicloud-ai-audio-tts-voice-clone

cinience/alicloud-skills · updated Apr 8, 2026

$npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-clone
0 commentsdiscussion
summary

Voice cloning and text-to-speech synthesis using Alibaba Cloud Qwen TTS VC models.

  • Supports two model variants: standard batch processing ( qwen3-tts-vc-2026-01-22 ) and real-time streaming ( qwen3-tts-vc-realtime-2026-01-15 )
  • Accepts voice samples as file paths or raw bytes; generates cloned voice IDs for reuse across multiple synthesis requests
  • Normalized interface handles text input, voice enrollment, optional streaming output, and returns audio URLs or PCM chunks
  • Requires DASH
skill.md

Category: provider

Model Studio Qwen TTS Voice Clone

Use voice cloning models to replicate timbre from enrollment audio samples.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-vc-2026-01-22
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_clone)

Request

  • text (string, required)
  • voice_sample (string | bytes, required) enrollment sample
  • voice_name (string, optional)
  • stream (bool, optional)

Response

  • audio_url (string) or streaming PCM chunks
  • voice_id (string)
  • request_id (string)

Operational guidance

  • Use clean speech samples with low background noise.
  • Respect consent and policy requirements for cloned voices.
  • Persist generated voice_id and reuse for future synthesis requests.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "Welcome to this voice-clone demo" \
  --voice-sample "https://example.com/voice-sample.wav"

Output location

  • Default output: output/ai-audio-tts-voice-clone/audio/
  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-voice-clone
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-voice-clone/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-clone/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-voice-clone/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

References

  • references/sources.md

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.725 reviews
  • Shikha Mishra· Dec 12, 2024

    Keeps context tight: alicloud-ai-audio-tts-voice-clone is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Anaya Mehta· Dec 4, 2024

    alicloud-ai-audio-tts-voice-clone is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

  • Dev Reddy· Nov 23, 2024

    alicloud-ai-audio-tts-voice-clone reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Yash Thakker· Nov 3, 2024

    Registry listing for alicloud-ai-audio-tts-voice-clone matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Sakshi Patil· Nov 3, 2024

    We added alicloud-ai-audio-tts-voice-clone from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Dhruvi Jain· Oct 22, 2024

    alicloud-ai-audio-tts-voice-clone reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Chaitanya Patil· Oct 22, 2024

    alicloud-ai-audio-tts-voice-clone fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Layla Rao· Oct 22, 2024

    alicloud-ai-audio-tts-voice-clone reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Fatima Rahman· Oct 14, 2024

    Registry listing for alicloud-ai-audio-tts-voice-clone matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Sakura Gonzalez· Sep 5, 2024

    Useful defaults in alicloud-ai-audio-tts-voice-clone — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

showing 1-10 of 25

1 / 3