Gemini Omni is Google's unreleased video generation model that has been spotted in early tests on the Gemini mobile app on May 12, 2026. It allows users to remix videos, edit directly in chat, and generate video samples from simple text prompts. The model is expected to be officially announced at Google I/O 2026 on May 19-20.

What can Gemini Omni do?

According to early tests, Gemini Omni can: generate videos from text prompts (e.g., suited men dining oceanside with shifting camera angles and clinking glasses), remix and edit existing videos directly in chat, swap objects in videos (like changing elements in anime clips), and maintain strong prompt adherence with coherent motion and voice quality.

How does Gemini Omni compare to other video generation models?

Early testers have placed Gemini Omni slightly above Runway Gen-3 and comparable to leading video models. Feedback highlights strong prompt adherence, smooth motion, impressive math coherence (complex scene elements), and high-quality voice. Some noted minor motion glitches, but overall performance is described as 'one of the best video models' seen to date.

When will Gemini Omni be officially released?

While not officially confirmed, the early leaks ahead of Google I/O 2026 (May 19-20) suggest Gemini Omni will likely be announced at the event. The model is currently tied to high usage limits in the Gemini app, indicating it may be positioned as a premium feature.

What makes Gemini Omni different from other video models?

Gemini Omni appears to unify video generation with Gemini's reasoning capabilities, allowing editing directly in chat and potentially leveraging Google's multimodal foundation. The 'Omni' branding suggests it may handle multiple modalities (text, image, video, voice) in an integrated way, similar to OpenAI's GPT-4o approach.

Gemini Omni Video Model emerges in early Gemini app | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Gemini Omni Video Model emerges in early Gemini app | explainx.ai Blog | explainx.ai

Model	Strengths (per early reports)	Weaknesses (per early reports)
Gemini Omni	Strong prompt adherence, smooth motion, editing in chat, object swaps, math coherence (complex scenes), voice quality	Minor motion glitches, some missing elements (e.g., "missing centerpiece" in one shot)
Runway Gen-3	High visual quality, cinematic feel	Less conversational editing, no chat interface
Pika 2.0	Fast generation, good for short clips	Less prompt adherence for complex scenes
Kling (Kuaishou)	Strong motion dynamics, longer videos	Less accessible (China-focused rollout)
OpenAI Sora	Impressive samples, strong physics	Not publicly available
Luma Dream Machine	Fast, accessible, good quality	Less control over editing

Product	Gemini integration	Video capability
Gemini Advanced	Core reasoning model	Likely gets Omni access
Google Workspace	Docs, Sheets, Slides, Gmail	Generate videos in Slides, narrate Docs
YouTube	Video understanding, summaries	Remix and edit YouTube videos directly
Google Cloud	Vertex AI, Gemini API	Enterprise video generation at scale
Android	On-device Gemini Nano	Local video editing on Pixel devices
Google Search	AI Overviews	Generate video explainers in search results

Gemini Omni Video Model emerges in early Gemini app tests: remix videos, edit in chat, and generate impressive samples ahead of Google I/O 2026

What early testers are seeing in Gemini Omni

Related posts

Gemini Omni Flash: Google's AI Video Generation Model [2026]

Seedance 2.5: ByteDance's 30-Second 4K AI Video Model

Generate AI Videos in Google Docs with Vids & Veo 3.1【2026】

Sample generation: suited men dining oceanside

Editing capabilities: object swaps in anime clips

How Gemini Omni compares to other video models

What "Omni" likely means: multimodal unification

Usage limits and pricing hints

What this leak tells us about Google I/O 2026

Practical use cases for Gemini Omni

2. Educational content

3. Product demos

4. Creative storytelling

5. A/B testing video ads

How developers should prepare for Gemini Omni

1. Explore the API early

2. Build conversational video editing workflows

3. Combine with other Gemini capabilities

4. Monitor pricing and usage limits

How Gemini Omni fits into Google's AI strategy

Bottom line