speak-tts
emzod/speak · Productivity
0Real-time text-to-speech with voice cloning on Apple Silicon, entirely on-device. \n \n Supports multiple input sources (text files, markdown, stdin, web articles, PDFs) and output modes (streaming, file save, playback, or both) \n Voice cloning from 10–30 second WAV samples at 24000 Hz mono; includes emotion tags like [laugh] , [sigh] , and [gasp] for audible effects \n Batch processing with auto-chunking for long documents, concatenation utilities, and resume capability for interrupted generat