Google Deepmind just dropped Genie 3... (WOAH)

The video showcases Google DeepMind’s Genie 3, an AI-powered tool that generates interactive virtual worlds from user prompts, allowing for exploration, customization, and remixing of environments. While the host demonstrates its impressive creative capabilities and features, he also notes current limitations like laggy controls, perspective issues, and a short simulation time.

The video introduces Google DeepMind’s Genie 3, a groundbreaking interactive world model that allows users to generate and explore fully AI-created virtual environments. The host, Alex, steps in for Matt, who is unwell, and walks viewers through the Genie 3 interface. Users can select from pre-made test worlds or create their own, with each simulation lasting about a minute. Alex demonstrates by exploring a racetrack world, highlighting the AI’s ability to generate detailed, interactive environments, and notes the option to download a video of the session for easy sharing.

Alex then creates custom worlds using prompts generated by ChatGPT, such as an organic alien construct in space and a floating cloud city inspired by Skyward Sword. The process involves an initial sketch phase, powered by Nano Banana Pro, where users can modify aspects like color and perspective before the final world is generated. While the AI impressively maintains structural consistency between sketches, Alex notes some limitations, such as laggy controls and occasional issues with perspective (first-person vs. third-person views).

The video delves into the technical aspects of Genie 3, referencing DeepMind’s blog post. Genie 3 combines technologies like Nano Banana Pro and Gemini to handle prompt processing, sketch generation, and final world creation. The platform’s three core features are world sketching (customizing environments and characters), world exploration (real-time path generation as users move), and world remixing (modifying existing worlds or building on curated examples). Users can also download videos of their explorations, though the current prototype has limitations like latency, less controllable characters, and a 60-second time cap per simulation.

Alex demonstrates the remixing feature by altering an existing racetrack world, changing the car’s color and the grass, and observing how the AI adapts the environment. He also tests image-based world creation by uploading a photo and generating a Lego-themed city, noting both the creative possibilities and some quirky AI-generated results. The randomize feature is explored, revealing that it currently cycles through preset example environments and characters rather than generating entirely new combinations.

In conclusion, Alex praises Genie 3’s ability to generate immersive, interactive worlds from simple prompts, despite some early-stage performance issues. He highlights the platform’s potential for creativity and experimentation, especially as features and stability improve. The video ends with encouragement for viewers to engage with the channel and look forward to further explorations of AI-generated worlds.