Highlights
- Google’s AI Mode now supports multimodal capabilities.
- It allows users to upload or snap images and ask detailed questions for contextual answers.
- This builds on Google’s visual search expertise and leverages AI to understand objects, their context and their relationships within images.
- The feature is rolling out to millions more Labs users in the U.S.
Caption – Google brings multimodal search to AI Mode. (Image credit – Google)
Google is making its AI Mode even smarter by adding multimodal support, which means you can now ask questions using images. Whether you upload a photo or snap one with your camera, AI Mode can understand the picture and give you detailed, helpful answers.
This upgrade is now rolling out to millions more Labs users in the U.S., bringing the experience to a much wider audience.
Ask Questions with Images in AI Mode
Google calls this multimodal capability, and it builds on tech the company has used for years in visual search. Now, with AI Mode, you can do things like take a picture of a product, an object or even a scene and ask questions about it.
Here’s how Robby Stein, VP of Product at Google Search, explains it, “With AI Mode’s new multimodal understanding, you can snap a photo or upload an image, ask a question about it and get a rich, comprehensive response with links to dive deeper.”
Stein adds, “AI Mode builds on our years of work on visual search and takes it a step further. With Gemini’s multimodal capabilities, AI Mode can understand the entire scene in an image, including the context of how objects relate to one another and their unique materials, colors, shapes and arrangements.”
“Drawing on our deep visual search expertise, Lens precisely identifies each object in the image. Using our query fan-out technique, AI Mode then issues multiple queries about the image as a whole and the objects within the image, accessing more breadth and depth of information than a traditional search on Google. The result is a response that’s incredibly nuanced and contextually relevant, so you take the next step.”
Google recently expanded access to AI Mode, first to people without a Google One AI Premium subscription and then to another wave of users last week. Now, a third (possibly even fourth) batch of users in the U.S. is being invited as Google brings AI Mode to millions more lab participants.
This update shows where search is heading. Instead of just typing in text, you’ll soon be able to search by showing Google what you’re curious about. So, if you haven’t tried AI Mode yet, now’s a great time to explore it and keep an eye on how it might change the way traffic flows to websites in the future.
FAQs
Q1. What is the new feature introduced in Google’s AI Mode?
Answer. Google’s AI Mode now supports multimodal functionality, allowing users to upload or capture images and ask questions based on those images for detailed responses.
Q2. How does AI Mode enhance the search experience with images?
Answer. AI Mode uses multimodal understanding to analyze objects, their context, and relationships within images, providing nuanced and contextually relevant answers.
Q3. Who can access the new AI Mode feature and when?
Answer. Google is rolling out this feature to millions of Labs users in the U.S., making it accessible to more people in batches without requiring a Google One AI Premium subscription.
Also Read: Google Unveils Gemini 2.5 – A Smarter AI Model That Thinks Before Answering
Also Read: Google Pixel 9a Gets Teardown, Scores 7.5/10 for Repairability