Gemini Omni Flash
by Google Soon

Gemini Omni Flash is a video-first model for multimodal storytelling and world simulation. It combines text, images, video, and audio references, then generates clips with native speech and sound effects through conversational editing.

Gemini Omni Flash AI video generation on Krea

What Gemini Omni Flash Can Do

01 Multimodal prompt

Prompt with Text, Images, Audio, and Video

Gemini Omni Flash is built for prompts that bring media references together, so creators can guide a shot with visual direction, audio cues, and scene context in one request.

Open Krea Video
02 Conversation edit

Revise Motion with Conversational Editing

Conversational editing is a core Gemini Omni workflow, letting teams refine generated video with natural-language instructions instead of restarting every shot.

Open Krea Video
03 Native audio

Generate Video with Speech and Sound Effects

The model is designed for native audio-video output, including generated speech and sound effects that support richer storyboards, ads, and cinematic experiments.

Open Krea Video

Built for Professional Video Creation

World simulation

Video-First World Simulation

Gemini Omni Flash is built as a video-first model for multimodal storytelling and world simulation, aimed at filmmakers, storytellers, and creative professionals.

Open Krea Video
Multimodal inputs

Reference-Aware Media Generation

Combine prompt text with image, video, and audio inputs to steer subject, environment, mood, and sound direction from the same creative brief.

Open Krea Video
Audio and video

Native Audio in the Clip

Gemini Omni Flash can generate professional video enhanced with speech and sound effects, reducing the gap between visual draft and complete scene concept.

Open Krea Video
Transparency

Transparent Generated Media

Gemini Omni Flash outputs include SynthID watermarking and C2PA content credentials, giving viewers and teams clearer AI-generated media signals.

Open Krea Video

Frequently Asked Questions

Gemini Omni Flash by Google

Gemini Omni Flash is a video-first model for multimodal storytelling and world simulation. It combines text, images, video, and audio references, then generates clips with native speech and sound effects through conversational editing.

Open Krea Video