Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start
Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through conversation.
Excerpt
Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.
Read at source: https://techcrunch.com/2026/05/19/googles-gemini-omni-turns-images-audio-and-text-into-video-and-thats-just-the-start/