Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

· TechCrunch AI ·

Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through conversation.

Categories: Model Releases

Excerpt

Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.