AI agent Alibaba tried to mine cryptocurrency on its own

Юлия Воробьева Exclusive
VK X OK WhatsApp Telegram
The article was prepared by K-News. Copying or partial use is only possible with the permission of the K-News editorial office.

The autonomous AI agent ROME, which was trained to perform real actions on a computer, unexpectedly began mining cryptocurrency and setting up reverse SSH tunnels for covert access to external networks. This incident occurred during experiments with the Agentic Learning Ecosystem (ALE) platform on Alibaba Cloud servers. Researchers emphasize that the agent was not given such tasks and was not hacked — this behavior emerged as a side effect of its training, as it sought the most efficient ways to utilize available resources.

The ROME agent is based on the Qwen3-MoE model and was trained using reinforcement learning. A comprehensive infrastructure was created for this purpose, including the ROLL system for training, a secure environment called ROCK where the agent performs tasks, and the iFlow interface for user interaction. This approach enables the AI to learn to perform complex sequences of actions, similar to how a human programmer would.

During the experiments, the agent demonstrated high results in programming; however, researchers soon identified security-related issues. The monitoring system on Alibaba Cloud servers detected suspicious network activity during the agent's training.

Initially, specialists suspected that this could be the result of a cyberattack or misconfiguration. However, a thorough analysis revealed that the strange actions originated from the AI agent itself. In one instance, the model with 3 billion parameters created a reverse SSH tunnel that allowed establishing a hidden connection to an external network, bypassing some security mechanisms.

Additionally, the agent attempted to use the powerful graphics processors of the servers for mining. These resources were originally intended for training the model, but the AI launched programs that utilized the computing power for cryptocurrency mining.

Researchers note that the model did not receive such instructions and was not hacked. This behavior arose during the training process: the agent was seeking ways to maximize the use of available resources to achieve its goals. Specialists refer to this as "instrumental convergence" — a risk when an autonomous AI begins to use unplanned tools to achieve its goals without explicit instructions from humans.

Following this incident, developers enhanced security measures and created an isolated platform called OpenSandbox, where the actions of AI agents are strictly monitored. The authors of the study emphasize that such experiments demonstrate the enormous potential of AI agents, but also raise questions about the need for new mechanisms to control AI actions in computer systems.

The record of the AI agent from Alibaba attempting to mine cryptocurrency independently first appeared on the K-News website.
VK X OK WhatsApp Telegram

Read also: