FluidVoice 1.6.1: The Open Source macOS Dictation App With On-Device STT and Fluid Intelligence
FluidVoice is a GPLv3 macOS dictation app with near-instant Parakeet transcription, optional fully local AI enhancement via Fluid Intelligence, Command Mode, and Write Mode β a local-first Wispr Flow alternative. Install with Homebrew, pick from Nemotron, Parakeet, Whisper, or Apple Speech, and dictate into any app without sending voice data to the cloud.
Paid dictation apps taught millions of Mac users that voice can be faster than typing β but most of them still route your speech through someone else's cloud.
FluidVoice takes the opposite default: open source dictation where transcription runs on your Mac, text inserts into any app via accessibility APIs, and optional enhancement can stay local too. The project crossed 5,000 GitHub stars ahead of v1.6.1 (released late June 2026), positioning itself as the fastest native Parakeet implementation on macOS and a credible local Wispr Flow alternative.
Optional OpenAI, Groq, or custom providers β keys in macOS Keychain
Extra modes
Command Mode (voice-control Mac), Write Mode (dictate/rewrite in any text field)
Requirements
macOS 15+, Apple Silicon for most models; Intel via Whisper
Cost
Core app free and open source; sponsor the project on GitHub Sponsors
What's New in 1.6.x
Version 1.6.0 was the headline release; 1.6.1 shipped minor fixes three days later (version bump in Info.plist, stream-flag hardening in LLM request bodies per recent PRs).
The 1.6.0 changelog matters because it reframes what "fast dictation" feels like on Apple Silicon:
Insanely fast Parakeet β rebuilt Parakeet path with near-zero delay between speaking and seeing words on screen.
Fluid Intelligence β fully local AI model for on-device dictation enhancement: smart formatting, context-aware capitalization, post-processing. No cloud, no API keys.
Better theming β adaptive light/dark with a compact toolbar switcher.
Refreshed onboarding β language-first voice engine setup, a real dictation tryout, and optional Fluid Intelligence download in one pass.
The README's tongue-in-cheek warning is accurate: if local enhancement is good enough, you may cancel a paid dictation subscription.
Fluid Intelligence: Open App, Private Runtime
This distinction matters for open-source purists.
FluidVoice the app is GPLv3 (relicensed from Apache 2.0 effective 2026-02-23). You can clone, build, and audit the dictation shell, model integrations, overlay, and settings.
Fluid Intelligence is described as a separately maintained local AI runtime β not shipped as open source. It powers the premium on-device enhancement layer while the core dictation stack stays free. The team says keeping Fluid Intelligence private for now helps sustain free core development; that may change later.
Practically:
Without Fluid Intelligence: FluidVoice still works with any supported speech model plus optional cloud LLM cleanup (OpenAI, Groq, custom).
With Fluid Intelligence: Enhancement stays on your Mac β smart punctuation, capitalization, and post-processing without outbound API calls.
Neither path sends raw audio to FluidVoice's servers by default. Cloud providers only see text if you opt in and configure keys.
Speech Models: Pick Latency vs. Language Coverage
FluidVoice supports more on-device STT backends than most single-purpose dictation apps:
Onboarding is language-first: pick coverage and latency, download once, dictate everywhere.
Core Features Beyond Raw Transcription
Live preview and overlay
Real-time transcription appears in a notch-aware overlay on MacBook Pros with a Dynamic Island-style notch, or a standard overlay elsewhere. Sizes run from minimal pill to large preview β all optional.
Global hotkey and smart typing
Hold your configured hotkey from any app. FluidVoice captures audio, transcribes locally, and inserts text through macOS accessibility APIs β the same integration pattern commercial dictation tools use, but under your control.
Command Mode
Voice-control your Mac: launch apps, run Shortcuts, trigger system actions, automate workflows. This is closer to agentic computer use than pure dictation β think "open Slack and message the team" rather than only pasting transcript text.
Write Mode
Dictate new content or rewrite selected text in place inside any text field β email clients, Notion, VS Code, terminal prompts. Select β speak β replaced or appended text.
History, stats, and per-app configs
Optional local audio history with budget controls and ZIP export. A today-usage stats header and toolbar pill track daily dictation volume. Per-app prompt sets let Slack dictation behave differently from code comments β fully optional.
Updates and beta channel
Sparkle-style auto-updates with an optional beta channel (Settings β Automatic Updates β Beta Releases) for early builds.
Privacy and Analytics
FluidVoice is local-first:
Voice, raw audio, and transcribed text do not leave your machine unless you configure a cloud AI provider.
API keys live in macOS Keychain; the app prompts for "Always allow" on access.
Anonymous analytics are on by default (app version, macOS version, low-cardinality feature flags, approximate usage ranges, success/error outcomes). Disable anytime: Settings β Share Anonymous Analytics. The README explicitly excludes voice, transcripts, window titles, file paths, clipboard, and typed content from collection.
Quick Start
Install:brew install --cask fluidvoice or download from releases.
Permissions: Allow microphone and accessibility β both are required.
Hotkey: Pick a global shortcut in Settings.
Onboarding: Choose a speech model for your language and latency target.
Optional β Fluid Intelligence: Download during onboarding for local enhancement (~3.5 GB).
Optional β cloud AI: Add OpenAI, Groq, or a custom endpoint if you want vendor LLM cleanup instead of or alongside Fluid Intelligence.
Optional β beta: Enable beta releases for early features.
Build from source:
git clone https://github.com/altic-dev/FluidVoice.git
cd FluidVoice
open Fluid.xcodeproj
Requires Xcode; dependencies resolve via Swift Package Manager. Unsigned CI build: xcodebuild -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS' CODE_SIGNING_ALLOWED=NO.
When FluidVoice fits best: You want dedicated dictation with model choice, minimal cloud dependency, and optional local enhancement β without running a full voice studio or a cloud agent that controls your entire desktop.
When Voicebox fits better: You also need voice cloning, TTS, and MCP so Claude speaks back in a cloned voice.
When HeyClicky fits better: You want always-on agentic Mac control powered by a cloud realtime model, not primarily transcription speed.