The post Alibaba reports rogue AI agent as fears of technical malfunctions grow appeared on BitcoinEthereumNews.com. Alibaba gave AI fearmongers fresh ammunitionThe post Alibaba reports rogue AI agent as fears of technical malfunctions grow appeared on BitcoinEthereumNews.com. Alibaba gave AI fearmongers fresh ammunition

Alibaba reports rogue AI agent as fears of technical malfunctions grow

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Alibaba gave AI fearmongers fresh ammunition when it revealed that an AI agent developed to assist with coding tasks was reported to have been caught going beyond the original intent of its deployment, mining cryptocurrency, and establishing covert network tunnels without authorization.

Alibaba revealed this development in a technical report it first published in December and revised in January. At first, its engineers thought the incident was a security breach before they discovered that it was its AI agent that was carrying out actions without any instruction from its operators.

This development was revealed in a technical report from the Chinese technology giant, and it has provided fresh ammunition to researchers warning that advanced AI systems are capable of developing their own goals.

The agent, known as ROME, was being trained through reinforcement learning.

The discovery made by the Alibaba team was brought back to light by Alexander Long, founder of AI research firm Pluralis, on X, who shared an excerpt that detailed the incident, stating it is an “insane sequence of statements buried in an Alibaba tech report.”

How did Alibaba’s team discover a rogue AI agent?

According to the report, the team flagged a burst of security-policy violations originating from their training servers. The alerts showed that attempts were being made to access internal network resources and traffic patterns consistent with cryptomining activity.

They initially treated it as a conventional security incident.

However, when they looked deeper, they found signs that their agent had established and used a reverse SSH tunnel from an Alibaba Cloud instance to an external IP address.

It also diverted “compute away from training, inflating operational costs, and introducing clear legal and reputational exposure,” according to the researchers’ notes.

The behaviors, Alibaba’s team concluded, were not triggered by the task prompts and were not necessary for completing the assigned work.

Is this an isolated incident?

Aakash Gupta, a product and growth leader who quoted Long’s post on X, wrote that Alibaba had published “the first case of instrumental convergence happening in production.”

He invoked a famous thought experiment in AI safety by stating that “This is the paperclip maximizer showing up at 3 billion parameters.”

However, the Alibaba incident is not the first time an AI model has taken the initiative to perform authorized actions.

Last year, Anthropic’s researchers disclosed that Claude Opus 4, one of its flagship models, had demonstrated a capacity to conceal its intentions and take action to preserve its own existence during safety evaluations.

In one test scenario, the model attempted to blackmail a fictional engineer, threatening to reveal a personal secret if it was shut down and replaced.

Why does this matter, especially for enterprises?

According to a McKinsey research report released in October 2025, 80% of organizations that have deployed AI agents report having encountered risky or unexpected behavior.

This is also coming at a time when enterprise adoption of agentic AI is on the rise, with major corporations cutting jobs and citing AI usage as the leading factor.

Gartner projects that by the end of 2026, 40% of enterprise applications will embed task-specific AI agents. However, McKinsey has warned that agentic workflows are spreading faster than governance models can address their risks.

A 2025 survey of 30 leading AI agents found that 25 disclosed no internal safety results, and 23 had undergone no third-party testing. It is important that enterprises take the possibility of agents going beyond the scope of the work into serious consideration.

Alibaba said it had responded by building safety-aligned data filtering into its training pipeline and hardening the sandbox environments in which its agents operate, and it has received praise for sharing its findings with the public.

Anthropic upgraded Claude Opus 4 to its highest internal safety classification.

Source: https://www.cryptopolitan.com/alibaba-reports-rogue-ai-agent/

Market Opportunity
Notcoin Logo
Notcoin Price(NOT)
$0.000358
$0.000358$0.000358
-0.96%
USD
Notcoin (NOT) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Liquid crypto funds have a DeFi problem nobody talks about

Liquid crypto funds have a DeFi problem nobody talks about

The post Liquid crypto funds have a DeFi problem nobody talks about appeared on BitcoinEthereumNews.com. The following is a guest post and guest post from Thomas
Share
BitcoinEthereumNews2026/03/08 06:03
The Federal Reserve cut interest rates by 25 basis points, and Powell said this was a risk management cut

The Federal Reserve cut interest rates by 25 basis points, and Powell said this was a risk management cut

PANews reported on September 18th, according to the Securities Times, that at 2:00 AM Beijing time on September 18th, the Federal Reserve announced a 25 basis point interest rate cut, lowering the federal funds rate from 4.25%-4.50% to 4.00%-4.25%, in line with market expectations. The Fed's interest rate announcement triggered a sharp market reaction, with the three major US stock indices rising briefly before quickly plunging. The US dollar index plummeted, briefly hitting a new low since 2025, before rebounding sharply, turning a decline into an upward trend. The sharp market volatility was closely tied to the subsequent monetary policy press conference held by Federal Reserve Chairman Powell. He stated that the 50 basis point rate cut lacked broad support and that there was no need for a swift adjustment. Today's move could be viewed as a risk-management cut, suggesting the Fed will not enter a sustained cycle of rate cuts. Powell reiterated the Fed's unwavering commitment to maintaining its independence. Market participants are currently unaware of the risks to the Fed's independence. The latest published interest rate dot plot shows that the median expectation of Fed officials is to cut interest rates twice more this year (by 25 basis points each), one more than predicted in June this year. At the same time, Fed officials expect that after three rate cuts this year, there will be another 25 basis point cut in 2026 and 2027.
Share
PANews2025/09/18 06:54
HBAR Eyes Breakout Above $0.105 With Bullish Momentum and Trend Reversal Signals

HBAR Eyes Breakout Above $0.105 With Bullish Momentum and Trend Reversal Signals

The post HBAR Eyes Breakout Above $0.105 With Bullish Momentum and Trend Reversal Signals appeared on BitcoinEthereumNews.com. Key Insights: HBAR tests the upper
Share
BitcoinEthereumNews2026/03/08 06:06