The post Alibaba reports rogue AI agent as fears of technical malfunctions grow appeared on BitcoinEthereumNews.com. Alibaba gave AI fearmongers fresh ammunitionThe post Alibaba reports rogue AI agent as fears of technical malfunctions grow appeared on BitcoinEthereumNews.com. Alibaba gave AI fearmongers fresh ammunition

Alibaba reports rogue AI agent as fears of technical malfunctions grow

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Alibaba gave AI fearmongers fresh ammunition when it revealed that an AI agent developed to assist with coding tasks was reported to have been caught going beyond the original intent of its deployment, mining cryptocurrency, and establishing covert network tunnels without authorization.

Alibaba revealed this development in a technical report it first published in December and revised in January. At first, its engineers thought the incident was a security breach before they discovered that it was its AI agent that was carrying out actions without any instruction from its operators.

This development was revealed in a technical report from the Chinese technology giant, and it has provided fresh ammunition to researchers warning that advanced AI systems are capable of developing their own goals.

The agent, known as ROME, was being trained through reinforcement learning.

The discovery made by the Alibaba team was brought back to light by Alexander Long, founder of AI research firm Pluralis, on X, who shared an excerpt that detailed the incident, stating it is an “insane sequence of statements buried in an Alibaba tech report.”

How did Alibaba’s team discover a rogue AI agent?

According to the report, the team flagged a burst of security-policy violations originating from their training servers. The alerts showed that attempts were being made to access internal network resources and traffic patterns consistent with cryptomining activity.

They initially treated it as a conventional security incident.

However, when they looked deeper, they found signs that their agent had established and used a reverse SSH tunnel from an Alibaba Cloud instance to an external IP address.

It also diverted “compute away from training, inflating operational costs, and introducing clear legal and reputational exposure,” according to the researchers’ notes.

The behaviors, Alibaba’s team concluded, were not triggered by the task prompts and were not necessary for completing the assigned work.

Is this an isolated incident?

Aakash Gupta, a product and growth leader who quoted Long’s post on X, wrote that Alibaba had published “the first case of instrumental convergence happening in production.”

He invoked a famous thought experiment in AI safety by stating that “This is the paperclip maximizer showing up at 3 billion parameters.”

However, the Alibaba incident is not the first time an AI model has taken the initiative to perform authorized actions.

Last year, Anthropic’s researchers disclosed that Claude Opus 4, one of its flagship models, had demonstrated a capacity to conceal its intentions and take action to preserve its own existence during safety evaluations.

In one test scenario, the model attempted to blackmail a fictional engineer, threatening to reveal a personal secret if it was shut down and replaced.

Why does this matter, especially for enterprises?

According to a McKinsey research report released in October 2025, 80% of organizations that have deployed AI agents report having encountered risky or unexpected behavior.

This is also coming at a time when enterprise adoption of agentic AI is on the rise, with major corporations cutting jobs and citing AI usage as the leading factor.

Gartner projects that by the end of 2026, 40% of enterprise applications will embed task-specific AI agents. However, McKinsey has warned that agentic workflows are spreading faster than governance models can address their risks.

A 2025 survey of 30 leading AI agents found that 25 disclosed no internal safety results, and 23 had undergone no third-party testing. It is important that enterprises take the possibility of agents going beyond the scope of the work into serious consideration.

Alibaba said it had responded by building safety-aligned data filtering into its training pipeline and hardening the sandbox environments in which its agents operate, and it has received praise for sharing its findings with the public.

Anthropic upgraded Claude Opus 4 to its highest internal safety classification.

Source: https://www.cryptopolitan.com/alibaba-reports-rogue-ai-agent/

Market Opportunity
Notcoin Logo
Notcoin Price(NOT)
$0.0003553
$0.0003553$0.0003553
-1.71%
USD
Notcoin (NOT) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Zcash is Predicted to Reach $215.89 By Mar 12, 2026

Zcash is Predicted to Reach $215.89 By Mar 12, 2026

The post Zcash is Predicted to Reach $215.89 By Mar 12, 2026 appeared on BitcoinEthereumNews.com. Disclaimer: This is not investment advice. The information provided
Share
BitcoinEthereumNews2026/03/08 08:09
Why Is Crypto Down in 2026? Binance Leverage Hits Exhaustion Lows as Pepeto Lines Up a Moonshot

Why Is Crypto Down in 2026? Binance Leverage Hits Exhaustion Lows as Pepeto Lines Up a Moonshot

Here is something the fear headlines are not telling you. The Binance estimated leverage ratio dropped to 0.146 in early March 2026, its lowest reading since April
Share
Techbullion2026/03/08 08:18
Headwind Helps Best Wallet Token

Headwind Helps Best Wallet Token

The post Headwind Helps Best Wallet Token appeared on BitcoinEthereumNews.com. Google has announced the launch of a new open-source protocol called Agent Payments Protocol (AP2) in partnership with Coinbase, the Ethereum Foundation, and 60 other organizations. This allows AI agents to make payments on behalf of users using various methods such as real-time bank transfers, credit and debit cards, and, most importantly, stablecoins. Let’s explore in detail what this could mean for the broader cryptocurrency markets, and also highlight a presale crypto (Best Wallet Token) that could explode as a result of this development. Google’s Push for Stablecoins Agent Payments Protocol (AP2) uses digital contracts known as ‘Intent Mandates’ and ‘Verifiable Credentials’ to ensure that AI agents undertake only those payments authorized by the user. Mandates, by the way, are cryptographically signed, tamper-proof digital contracts that act as verifiable proof of a user’s instruction. For example, let’s say you instruct an AI agent to never spend more than $200 in a single transaction. This instruction is written into an Intent Mandate, which serves as a digital contract. Now, whenever the AI agent tries to make a payment, it must present this mandate as proof of authorization, which will then be verified via the AP2 protocol. Alongside this, Google has also launched the A2A x402 extension to accelerate support for the Web3 ecosystem. This production-ready solution enables agent-based crypto payments and will help reshape the growth of cryptocurrency integration within the AP2 protocol. Google’s inclusion of stablecoins in AP2 is a massive vote of confidence in dollar-pegged cryptocurrencies and a huge step toward making them a mainstream payment option. This widens stablecoin usage beyond trading and speculation, positioning them at the center of the consumption economy. The recent enactment of the GENIUS Act in the U.S. gives stablecoins more structure and legal support. Imagine paying for things like data crawls, per-task…
Share
BitcoinEthereumNews2025/09/18 01:27