Google has recently offered a first look at Magic Pointer, an AI-powered browsing experience originally designed for the new premium laptops Googlebook.
The real news, confirmed in the last hours, is that this technology will not remain an exclusive of the new operating system, but will be integrated directly into Gemini in the Chrome browser.
According to Google DeepMind, which outlined the foundational principles of this technology in a recent post on their blog, the traditional mouse pointer has not undergone significant changes for over fifty years.
The current intention is to overcome this stagnation by introducing an extremely advanced context-understanding capability, powered by Gemini’s language models.
The vision behind the project is to create an intelligent and intuitive assistant capable of accompanying users in the tools they already regularly use, without breaking the rhythm of their work.
In practice, this means saying goodbye to the hassle of copying and pasting long blocks of text or having to draft complex textual requests. Through the Chrome integration, it will be enough to move the cursor over a specific element on screen and ask for support.
The system is able to decipher not only the indicated object, but also the user’s intent. DeepMind emphasizes how this dynamic makes human-computer interaction much more natural, almost conversational.
Instead of crafting elaborate commands, you will simply indicate a point and say simple phrases such as “set up this part“, “move this element here“, or “explain this concept“, leveraging visual selection to communicate your needs directly.
One of the most technically interesting aspects presented by the company is the system’s ability to transform the simple pixels displayed on screen into entities with which it is possible to interact. Gemini can autonomously recognize objects, dates, places and snippets of code.
Practical applications range from immediate interaction with information about a restaurant appearing in a video, to processing handwritten notes. Among the examples provided by researchers, there is the possibility to select several articles on a webpage to request a comparison, or to indicate an empty space in a photo of your living room to preview the footprint of a new sofa.
Regarding the distribution of these features, the company seems to have planned a clear differentiation. The more advanced computing and interaction capabilities of Magic Pointer will primarily find space on future Googlebooks.
The Chrome browser version, however, will offer a more basic set of tools, primarily focused on operations like data comparison and visual manipulation of images.
At the moment, management has not announced a precise timeline nor has it specified which markets will receive access first. Tests currently conducted on public versions of Gemini in Chrome do not yet show traces of the new tool, suggesting that the release phase will proceed gradually and in a controlled manner over the coming months.
More and more people are turning to smartwatches powered by the Wear OS operating system…
The Chinese company has announced the opening of the beta program dedicated to Android 17:…
Over ten years after the high-profile commercial failure of the Fire Phone, Amazon seems to…
The device is characterized by a yellow color and top features: a powerful motor of…
Ahead of the annual developers' conference WWDC, which will open its doors on June 8,…
If you thought that the current memory shortages were the only factor capable of driving…