Google Enhances Gemini Live with Multilingual Support for AI-Powered Conversations in India

  • Patrick Schuster
  • Oct 04, 2024
  • 0
Google Enhances Gemini Live with Multilingual Support for AI-Powered Conversations in India

The annual Google for India event, focusing on the country's unique needs, took place recently in New Delhi. This gathering unveiled various enhancements aimed at enriching user experience. Among the highlights was the introduction of advanced features for Google's AI chatbot, Gemini. Following the rollout of the two-way verbal communication functionality known as Gemini Live last month, updates now include support for Hindi and eight additional regional languages.

The Senior Director of Product Management announced the extension of Gemini Live's capabilities to include Hindi alongside various regional languages. This innovative AI feature enables real-time interactions with users, allowing them to engage in natural conversations. By posing questions verbally, users receive immediate, spoken replies from the AI. Initially revealed at Google I/O, this tool is a product of Google DeepMind's efforts.

Initially available to Gemini Advanced subscribers in August, Gemini Live has since been made accessible to users on the free tier of the app for Android devices. Previously available only in English, the inclusion of more languages represents a significant expansion of its functionality.

With the latest enhancements, support for Hindi, Bengali, Gujarat's, Karnataka's, Kerala's, Maharashtra's, Andhra Pradesh's, Tamil Nadu's, as well as Urdu languages is now being implemented. This allows speakers of these languages to utilize the chatbot's capabilities, enabling them to input prompts as well as receive responses in their native tongue. Users from Gadgets 360 reported successful interactions with the feature in several supported languages.

Gemini Live retains all the generative functions of the traditional text-based chatbot, permitting users to ask follow-up inquiries without the need for repeating background information. This encourages a more fluid conversational style, resembling exchanges between two individuals. However, while it produces responses in real-time, it does not possess contextual voice variation or emotional expression, which are features found in other advanced voice systems.

To engage with Gemini Live, users simply need to open the Gemini app or activate the assistant on their Android devices. A distinctive waveform icon is located adjacent to the text input area. By tapping this icon, users can access a full-screen interface, enabling them to start speaking their questions and receive almost immediate responses from the AI. If they wish to halt or terminate the interaction, users can choose from two available buttons — Hold and End call — positioned at the screen's bottom.

Share this Post: