FluidVoice is a free, open source (GPLv3) voice-to-text dictation app for macOS. It captures speech via a global hotkey, transcribes locally using models like Parakeet, Nemotron, Whisper, or Apple Speech, and inserts text into any app via accessibility APIs. Optional AI enhancement runs through Fluid Intelligence (fully local) or cloud providers like OpenAI and Groq.

The core FluidVoice app is free and open source under GPLv3. Speech models download to your Mac and run on-device. Fluid Intelligence — the optional local AI enhancement layer — is a separate privately maintained runtime you can download during onboarding; the README positions the base dictation experience as free without a subscription.

How do I install FluidVoice?

Fastest path: brew install --cask fluidvoice. Or download the latest release from github.com/altic-dev/FluidVoice/releases. Grant microphone and accessibility permissions, set a global hotkey, and complete onboarding to pick a speech model.

What is Fluid Intelligence?

Fluid Intelligence is FluidVoice's optional on-device AI enhancement layer for smart formatting, context-aware capitalization, and post-processing. It runs entirely locally (~3.5 GB download) with no API keys and no data leaving your Mac. The runtime itself is privately maintained — separate from the GPLv3 open source app — so the team can sustain a free core product while offering advanced local enhancement.

Which speech models does FluidVoice support?

Nemotron Speech 3.5 (ultra-fast streaming, ~40 languages), Nemotron 3.5 Multilingual, Parakeet Flash (lowest-latency English beta), Parakeet TDT v3 (25 languages), Parakeet TDT v2 (English), Cohere Transcribe (14 languages), Apple Speech (zero-download, system languages), and Whisper Tiny through Large (99 languages, including Intel Mac support).

How does FluidVoice compare to Wispr Flow?

Both target fast macOS dictation with a global hotkey and live preview. Wispr Flow is a commercial cloud-backed product. FluidVoice is open source, local-first by default, and lets you choose on-device models or bring your own cloud API key. Fluid Intelligence adds a Wispr-like local enhancement path without sending transcripts to a vendor — though that layer is not open source.

What are Command Mode and Write Mode?

Command Mode lets you control your Mac by voice — launch apps, run shortcuts, trigger system actions, and automate workflows. Write Mode lets you dictate new text or rewrite selected text directly inside any text field across any app. Both ship in recent 1.5+ builds alongside standard dictation.

What are the system requirements?

macOS 15.0 (Sequoia) or later. Apple Silicon is required for Nemotron, Parakeet, and Cohere models. Intel Macs are supported via Whisper models (from v1.5.1+). Budget ~1 GB disk for a speech model and ~3.5 GB extra if you enable Fluid Intelligence.

FluidVoice 1.6.1: Open Source macOS Dictation With Fluid Intelligence (2026) | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

FluidVoice 1.6.1: Open Source macOS Dictation With Fluid Intelligence (2026) | explainx.ai Blog | explainx.ai

Paid dictation apps taught millions of Mac users that voice can be faster than typing — but most of them still route your speech through someone else's cloud.

FluidVoice takes the opposite default: open source dictation where transcription runs on your Mac, text inserts into any app via accessibility APIs, and optional enhancement can stay local too. The project crossed 5,000 GitHub stars ahead of v1.6.1 (released late June 2026), positioning itself as the fastest native Parakeet implementation on macOS and a credible local Wispr Flow alternative.

Install in one line:

bash

brew install --cask fluidvoice

Or grab the latest build from GitHub Releases.

TL;DR


What it is	GPLv3 macOS dictation app with global hotkey, live overlay, and per-app prompt configs
Latest version	v1.6.1 — builds on 1.6.0's Parakeet speed rebuild and Fluid Intelligence onboarding
Local STT	Nemotron, Parakeet (Flash / TDT v2 & v3), Cohere Transcribe, Apple Speech, Whisper
Local AI	Fluid Intelligence — optional ~3.5 GB on-device enhancement (private runtime, not GPLv3)
Cloud AI	Optional OpenAI, Groq, or custom providers — keys in macOS Keychain
Extra modes	Command Mode (voice-control Mac), Write Mode (dictate/rewrite in any text field)
Requirements	macOS 15+, Apple Silicon for most models; Intel via Whisper
Cost	Core app free and open source; sponsor the project on GitHub Sponsors

Model	Best for	Languages	Download	Hardware
Nemotron Speech 3.5	Ultra-fast streaming multilingual	~40	~670 MB	Apple Silicon
Nemotron 3.5 Multilingual	Higher-accuracy multilingual	~40	~530 MB	Apple Silicon
Parakeet Flash (Beta)	Lowest-latency live English	English	~250 MB	Apple Silicon
Parakeet TDT v3	Fast default multilingual	25	~500 MB	Apple Silicon
Parakeet TDT v2	Fastest English-only	English	~500 MB	Apple Silicon
Cohere Transcribe	High-accuracy multilingual	14	~1.4 GB	Apple Silicon
Apple Speech	Zero-download native macOS	System langs	Built-in	Apple Silicon + Intel
Whisper (Tiny→Large)	Broad compatibility	99	75 MB–2.9 GB	Apple Silicon + Intel

Tool	Focus	Local STT	Local enhancement	Open source	Platform
FluidVoice	Dictation + Command/Write modes	Yes (many models)	Fluid Intelligence (optional, private runtime)	App: GPLv3	macOS 15+
Wispr Flow	Fast dictation SaaS	Partial / cloud-backed	Cloud	No	macOS (+ expanding)
Voicebox	Voice studio (TTS + STT + MCP)	Whisper	Qwen3 local LLM	MIT	macOS, Windows, Linux
HeyClicky	Full computer control	Cloud (GPT-Realtime)	Cloud	No	macOS

FluidVoice 1.6.1: The Open Source macOS Dictation App With On-Device STT and Fluid Intelligence

TL;DR

Related posts

Silent Speech with Ultrasound: Aleph Neuro's 15.6% WER Demo Explained

Voicebox: The Free, Open Source AI Voice Studio That Replaces ElevenLabs and WisprFlow in One App

Meetily: Privacy-First AI Meeting Assistant With Local Whisper and Parakeet

What's New in 1.6.x

Fluid Intelligence: Open App, Private Runtime

Speech Models: Pick Latency vs. Language Coverage

Core Features Beyond Raw Transcription

Live preview and overlay

Global hotkey and smart typing

Command Mode

Write Mode

History, stats, and per-app configs

Updates and beta channel

Privacy and Analytics

Quick Start

How FluidVoice Compares to Other Mac Voice Tools

Community and Roadmap Hints

TL;DR

Related posts

Silent Speech with Ultrasound: Aleph Neuro's 15.6% WER Demo Explained

Voicebox: The Free, Open Source AI Voice Studio That Replaces ElevenLabs and WisprFlow in One App

Meetily: Privacy-First AI Meeting Assistant With Local Whisper and Parakeet

What's New in 1.6.x

Fluid Intelligence: Open App, Private Runtime

Speech Models: Pick Latency vs. Language Coverage

Core Features Beyond Raw Transcription

Live preview and overlay

Global hotkey and smart typing

Command Mode

Write Mode

History, stats, and per-app configs

Updates and beta channel

Privacy and Analytics

Quick Start

How FluidVoice Compares to Other Mac Voice Tools

Community and Roadmap Hints

Related Reading