Gemini Omni
Google has introduced Gemini Omni, a new AI model capable of creating videos from various inputs. This model allows users to edit videos using natural language and offers the ability to transform scenes creatively. The first version, Gemini Omni Flash, is now available in the Gemini app, Google Flow, and YouTube Shorts.
- ▪Gemini Omni can create videos from images, audio, video, and text inputs.
- ▪Users can edit videos through conversation, maintaining consistency in characters and scenes.
- ▪The model is designed to support future output modalities like image and audio.
Opening excerpt (first ~120 words) tap to expand
Breadcrumb Innovation & AI Models & research Gemini Models Introducing Gemini Omni { "reading_time": "[[read\u002Dtime]] min read" } Share x.com Facebook LinkedIn Mail Copy link Gemini Omni Flash is a model that can create anything from any input – starting with video. Koray Kavukcuoglu CTO, Google DeepMind and Chief AI Architect, Google Share x.com Facebook LinkedIn Mail Copy link { "@context": "https://schema.org/", "@type": "VideoObject", "name": "Introducing Gemini Omni: Create Anything from Anything", "description": "a YouTube video showing Gemini Omni", "thumbnailUrl": "https://i.ytimg.com/vi_webp/KUyRq7szZsM/maxresdefault.webp", "uploadDate": "2026-05-19T17:45:00+00:00", "contentUrl": "https://www.youtube.com/watch?v=KUyRq7szZsM", "embedUrl":…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Google.