In a groundbreaking move, Google has announced the launch of Gemini Live, its latest AI voice assistant designed to revolutionize conversational interactions. The new assistant is set to compete directly with OpenAI’s ChatGPT Voice Mode and promises a more natural, dynamic conversational experience for Android users.
Unlike traditional AI platforms that follow a rigid prompt-response pattern, Gemini Live introduces a more fluid, interactive dialogue. Users can seamlessly interrupt and redirect conversations, allowing for a more flexible and engaging interaction. This feature aims to enhance daily tasks such as planning meals, choosing outfits, or even co-writing speeches.
The announcement came during Google’s early showcase at the Made by Google event, where Jenny BlackBurn, Vice President of User Experience, highlighted the assistant’s versatility. “A truly helpful personal AI assistant must go beyond the basics,” Blackburn stated. “It needs to engage in genuine conversation. Sometimes, talking through complex issues is the most effective approach.”
In a live demonstration, Blackburn showcased Gemini Live’s capabilities by brainstorming weekend activities for her niece and nephew. The AI assistant suggested creative projects like invisible ink art and provided real-time responses to Blackburn’s queries, illustrating its conversational prowess.
Gemini Live is available for free on the Play Store, offering ten distinct voice models. However, it lacks a Scarlett Johansson-like voice option. The assistant is designed to act as a “sidekick” on your phone, with future updates promising enhanced research capabilities. For instance, it will be able to generate detailed research reports, such as planning for opening a new café, by analyzing web data and compiling comprehensive documents.
Rick Osterloh, Senior Vice President of Google Devices and Services, revealed that Gemini Live will also integrate with various Google products, including Workspace, Chrome, YouTube, and Gmail. This integration marks the beginning of the “Gemini era,” with AI becoming central to Google’s technology stack.
Osterloh emphasized Google’s commitment to making Gemini Live universally accessible, starting with Android devices. “Our goal is to embed Gemini deeply into Android and provide breakthrough mobile experiences to billions,” he said.
The new Pixel 9 series smartphones will feature exclusive Gemini functionalities, powered by Google’s Tensor G4 chip. These include an AI-enhanced weather app, advanced image editing tools, and the AI “Call Assist” feature. Call Assist provides on-device transcripts and summaries of phone calls, ensuring privacy through Gemini Nano, an on-device AI that does not rely on cloud storage.
Gemini Live and other Gemini features began rolling out yesterday for Pixel, Samsung, and other Android devices. However, access requires a Gemini Advanced subscription, included in the Google One AI Premium Plan at $32.99 per month, or with a Pixel Pro 9 purchase. Currently, availability in Australia is limited, with users seeking updates on discussions platforms like Reddit.
As Google continues to innovate, Gemini Live represents a significant leap in AI-assisted personal interaction, setting new standards for conversational technology.
Related topics:
What Are the Challenges of Automation Testing?