asr▌
5 indexed skills · max 10 per page
alicloud-ai-audio-asr-test
cinience/alicloud-skills · Cloud
Category: test
asr
answerzhao/agent-skills · Productivity
This skill guides the implementation of speech-to-text (ASR) functionality using the z-ai-web-dev-sdk package, enabling accurate transcription of spoken audio into text.
alicloud-ai-audio-asr
cinience/alicloud-skills · Cloud
Category: provider
qwen-asr
aahl/skills · Productivity
Qwen ASR \n Transcribe an audio file (wav/mp3/ogg...) to text using Qwen ASR. No configuration or API key required. \n Usage \n uv run scripts/main.py -f audio.wav\n \n cat audio.mp3 | uv run scripts/main.py > transcript.txt\n \n curl https://example.com/audio.ogg | uv run scripts/main.py\n
asr
marswaveai/skills · Productivity
Local offline audio transcription with multi-language support and optional AI polishing. \n \n Transcribes audio files to text using coli asr with no API keys required; supports Chinese, English, Japanese, Korean, and Cantonese via sensevoice model, or English-only via whisper-tiny \n Models download automatically on first use (~60MB) to ~/.coli/models/ ; requires coli CLI and ffmpeg (WAV files work without it) \n Optional AI polishing step corrects punctuation, removes filler words, and improve