Gemini Omni

May 20, 2026 · 7:29 AM UTC ·11 min read · 0 reactions · 0 comments · 10 views

#ai #technology #video #Google #Gemini #YouTube Shorts #Koray Kavukcuoglu

⚡ TL;DR · AI summary

Google has introduced Gemini Omni, a new AI model capable of creating videos from various inputs. This model allows users to edit videos using natural language and offers the ability to transform scenes creatively. The first version, Gemini Omni Flash, is now available in the Gemini app, Google Flow, and YouTube Shorts.

Key facts

▪Gemini Omni can create videos from images, audio, video, and text inputs.
▪Users can edit videos through conversation, maintaining consistency in characters and scenes.
▪The model is designed to support future output modalities like image and audio.

Original article

Google

Read full at Google →

Opening excerpt (first ~120 words) tap to expand

Breadcrumb Innovation & AI Models & research Gemini Models Introducing Gemini Omni { "reading_time": "[[read\u002Dtime]] min read" } Share x.com Facebook LinkedIn Mail Copy link Gemini Omni Flash is a model that can create anything from any input – starting with video. Koray Kavukcuoglu CTO, Google DeepMind and Chief AI Architect, Google Share x.com Facebook LinkedIn Mail Copy link { "@context": "https://schema.org/", "@type": "VideoObject", "name": "Introducing Gemini Omni: Create Anything from Anything", "description": "a YouTube video showing Gemini Omni", "thumbnailUrl": "https://i.ytimg.com/vi_webp/KUyRq7szZsM/maxresdefault.webp", "uploadDate": "2026-05-19T17:45:00+00:00", "contentUrl": "https://www.youtube.com/watch?v=KUyRq7szZsM", "embedUrl":…

Excerpt limited to ~120 words for fair-use compliance. The full article is at Google.

Anonymous · no account needed

Discussion

0 comments

Gemini Omni

Discussion

More from Google