Categorie: News

Gemini is getting closer to being able to control all of your apps

Already during the Google I/O 2025, the American company had anticipated its vision with Project Astra, showing how the artificial intelligence could not only “see” what happens on the smartphone screen, but physically interact with it, swiping pages and pressing buttons.

Today, thanks to the in-depth analysis of the beta versions of the Google app, concrete details emerge on how this functionality will take shape on Android devices.

Gemini will be able to control your smartphone, here’s how

Credits: Google

Recent discoveries made within the code of the Google application for Android (specifically in the beta version 17.4.66) have brought to light new text strings that outline how what is internally called by the codename “Bonobo” works.

The publicly used terminology, however, seems to lean toward the concept of “screen automation”, i.e., screen automation.

This technology promises to transform Gemini into an operating agent capable of carrying out complex actions on behalf of the user. The descriptions found in the software explicitly indicate the AI’s ability to handle practical tasks such as placing online orders or booking rides on platforms like Uber or Lyft.

It’s no longer about tapping various links to complete the action, but about letting the assistant navigate the app interface, select options and finalize the request.

The responsibility remains with the user

Despite the promise of autonomous assistance, Google seems to take an extremely cautious approach regarding the responsibility for actions performed by AI.

The warning messages integrated into the system emphasize that Gemini can make mistakes and that the user is required to supervise operations closely. One of the text strings clearly states that the user is responsible for what the agent does on their behalf, inviting monitoring of progress and manual intervention if necessary.

This aspect introduces a curious contradiction in the user experience: the goal of an autonomous agent should be to free the user from repetitive tasks, reducing human intervention. However, the need for constant supervision could, at least in an initial phase, not significantly reduce the cognitive load required, turning the user from actor into controller.

Privacy and management of sensitive data

Another fundamental chapter concerns privacy and data security during the use of screen automation. The informational notes discovered in the code warn that when Gemini interacts with an application, screenshots of activities could be analyzed by human reviewers to improve the service, if the option to save the activity is active.

Therefore, Google strongly advises against entering login credentials or payment information directly in chats with Gemini or using the automation for tasks involving highly sensitive data.

It remains to be clarified how the system will handle the moment of payment within third-party apps: it is not specified whether the AI will pause to let the user enter credit card details or whether there will be specific security protocols.

Finally, the integration of these features could lead to visible changes in the user interface. References to a section “My Orders” or “Purchases” within the user profile have been identified, suggesting that Google intends to centralize the history of actions performed by the assistant, offering a single hub to monitor transactions carried out via automation.

Although there is not yet an official release date, it is evident that the move from a simple chatbot to a true operating agent is imminent.

Luca Zaninello

Appassionato del mondo della telefonia da sempre, da oltre un decennio si occupa di provare con mano i prodotti e di raccontare le sue esperienze al pubblico del web. Fotografo amatoriale, ha un occhio di riguardo per i cameraphone più esagerati.

Recent Posts

A low-cost Xiaomi is in the world’s top 10 best-selling smartphones, among the giants

Recently Counterpoint published the world's best-selling smartphones list for the fourth quarter of 2025. This…

10 hours ago

Here are the Motorola Razr 70 images: the family is now complete

After taking a first look at the bigger brother Razr 70 Ultra, here come the…

12 hours ago

There will be another vivo X300 FE, with Zeiss photography kit and a new color

Last month the Chinese company announced vivo X300 FE, but the launch took place quietly…

13 hours ago

Realme C100i on Google Play Console, with images and specifications

After the introduction of Realme C100 5G and the appearance of C100x in an European…

15 hours ago

Google Chrome: Vertical Tabs Arrive and the New Reading Mode

A new update for Google Chrome on desktop is arriving and brings with it two…

15 hours ago

Motorola Edge 70 Pro shown for the first time: refined design and 3 colors

Recently the first press renders dedicated to Razr 70 Ultra, the brand's next folding flagship,…

16 hours ago