Google is reportedly working on a major update for its AI-powered chatbot, Gemini Live, that would allow users to upload files and discuss their contents through voice conversations. Currently, the Gemini chatbot supports two-way conversations via text and voice, but the addition of file upload capabilities to the voice interface could transform Gemini Live into an even more powerful tool for handling documents on the go.
How Gemini Live Will Handle File Uploads
The upgrade, which was uncovered through an APK teardown of the Google app’s latest beta, indicates several potential new commands, including “Open Live,” “Talk about attachment,” and “Open Live with attachment.” This suggests that users will soon be able to engage in verbal conversations with Gemini about the content of their files, making it easier to extract information, ask questions, and gain insights from documents, spreadsheets, and text-heavy files without typing.
Currently, users can upload files to Gemini only for text-based interactions. The addition of voice-based interaction with files marks a significant shift, allowing a hands-free experience where users can ask questions and receive responses about file contents. This feature is expected to make it easier for professionals who rely on quick insights from complex documents while multitasking.
Availability: For Gemini Advanced Subscribers
This enhanced Gemini Live feature is expected to be available exclusively to Gemini Advanced subscribers. Google recently expanded Gemini Live access to Android users, though the file upload capability has remained a feature limited to subscribers. The Gemini Advanced subscription, available through the Google One AI Premium plan, currently costs ₹1,950 per month (approx. $23.50) and offers advanced AI features across Google’s ecosystem.
Gemini Live currently supports Hindi and eight regional Indian languages, in addition to English, making it accessible to a wider user base in multilingual settings. However, the new file-related upgrade will likely be restricted to Android devices for the time being, without immediate plans for web or iOS availability.
How This Update Could Enhance Productivity
With the planned Gemini Live updates, users can expect:
- Seamless Document Analysis: The ability to discuss text-based documents through audio could help users extract information from lengthy reports or spreadsheets more effectively.
- Multitasking: By providing hands-free document analysis, Gemini Live could enable users to review important files even while on the move, without needing to view the Gemini interface.
- Efficient Workflow for Professionals: This feature could be particularly beneficial in professional settings, helping individuals quickly access information from files during meetings, calls, or on-site inspections.
What’s Next for Gemini Live
Google’s latest updates and ongoing enhancements with Gemini demonstrate its commitment to making Gemini a versatile AI tool in both personal and professional settings. If successful, this feature could make Gemini Live a competitive offering in the AI productivity space, rivaling other AI-powered tools that aim to simplify document interaction and provide dynamic, voice-based data insights.
Final Thoughts
Google’s efforts to refine Gemini Live with conversational file support reflect the tech giant’s broader focus on creating intuitive, AI-powered tools that support real-time productivity. For Gemini Advanced subscribers, the upcoming feature promises to make interacting with files more accessible, engaging, and aligned with today’s fast-paced, multitasking workflows.