TLDR Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale. Aegaeon cuts AI model-switching latency by 97%, boosting performance. One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade. Alibaba Cloud improves GPU efficiency with token-level auto-scaling. Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips. [...] The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.TLDR Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale. Aegaeon cuts AI model-switching latency by 97%, boosting performance. One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade. Alibaba Cloud improves GPU efficiency with token-level auto-scaling. Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips. [...] The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.

Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82%

TLDR

  • Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale.
  • Aegaeon cuts AI model-switching latency by 97%, boosting performance.
  • One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade.
  • Alibaba Cloud improves GPU efficiency with token-level auto-scaling.
  • Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips.

Alibaba Group Holding Limited closed at $167.05, marking a 1.19% increase, following a major breakthrough in AI infrastructure.

BABA Stock Card

Alibaba Group Holding Limited, BABA

The company introduced a computing pooling solution that cut Nvidia GPU usage by 82% in model-serving operations. This advance positions Alibaba Cloud ahead in the race to optimize AI deployment at scale.

Aegaeon boosts efficiency, cuts GPU dependency

Alibaba Cloud, the cloud computing arm of the Hangzhou-based firm, implemented a new system called Aegaeon to boost AI efficiency. The solution allows a single Nvidia H20 GPU to serve up to seven large language models concurrently. This change reduced GPU usage from 1,192 to just 213 units during internal testing.

Aegaeon works by performing auto-scaling at the token level during model inference across concurrent AI workloads. This strategy enables dynamic resource reallocation, allowing the same GPU to switch between models mid-processing. It also cut latency in model-switching tasks by 97%.

The solution was beta-tested for over three months in Alibaba Cloud’s Bailian marketplace. It handled dozens of models with up to 72 billion parameters without service degradation. Aegaeon has now been formally deployed in Alibaba’s model marketplace, which serves its proprietary Qwen models.

Model market insights and performance optimization

Alibaba Cloud found that only a small number of models are frequently used in real-world AI tasks. Despite this, many GPUs were allocated to rarely called models, resulting in low resource utilization. Data showed that 17.7% of GPUs served just 1.35% of total inference requests.

With Aegaeon, the company resolved this imbalance through pooling and smart scaling strategies. The system ensured consistent GPU usage and prevented idle processing across rarely used models. Alibaba achieved higher throughput and improved hardware efficiency for enterprise deployments.

Peking University and Alibaba Cloud researchers co-authored a technical paper detailing the innovation, presented at SOSP 2025 in South Korea. The study underlined that serving concurrent workloads with traditional GPU methods incurred unnecessary costs. This breakthrough directly supports China’s goal of AI infrastructure modernization under resource constraints.

Nvidia’s role and China’s chip strategy shift

Nvidia developed the H20 GPU specifically for AI inference in China, complying with U.S. export restrictions. However, Chinese regulators recently launched a probe into possible backdoor security vulnerabilities in the chip. This scrutiny has affected the chip’s market position and adoption within China.

Chinese firms like Huawei and Cambricon are accelerating development of domestic GPUs to reduce foreign dependency. Nvidia’s CEO stated that the company’s market share for advanced AI chips in China has fallen to zero. This trend pushes local players to innovate and localize AI hardware supply chains.

Alibaba’s new approach strengthens its market stance while aligning with national strategies for tech self-sufficiency. By reducing reliance on U.S. chips, Alibaba gains a stronger foothold in China’s evolving AI ecosystem. The stock rise reflects confidence in its technology-led cost savings and scalability.

 

The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.

Market Opportunity
Sleepless AI Logo
Sleepless AI Price(AI)
$0.03542
$0.03542$0.03542
-2.37%
USD
Sleepless AI (AI) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Microsoft Corp. $MSFT blue box area offers a buying opportunity

Microsoft Corp. $MSFT blue box area offers a buying opportunity

The post Microsoft Corp. $MSFT blue box area offers a buying opportunity appeared on BitcoinEthereumNews.com. In today’s article, we’ll examine the recent performance of Microsoft Corp. ($MSFT) through the lens of Elliott Wave Theory. We’ll review how the rally from the April 07, 2025 low unfolded as a 5-wave impulse followed by a 3-swing correction (ABC) and discuss our forecast for the next move. Let’s dive into the structure and expectations for this stock. Five wave impulse structure + ABC + WXY correction $MSFT 8H Elliott Wave chart 9.04.2025 In the 8-hour Elliott Wave count from Sep 04, 2025, we saw that $MSFT completed a 5-wave impulsive cycle at red III. As expected, this initial wave prompted a pullback. We anticipated this pullback to unfold in 3 swings and find buyers in the equal legs area between $497.02 and $471.06 This setup aligns with a typical Elliott Wave correction pattern (ABC), in which the market pauses briefly before resuming its primary trend. $MSFT 8H Elliott Wave chart 7.14.2025 The update, 10 days later, shows the stock finding support from the equal legs area as predicted allowing traders to get risk free. The stock is expected to bounce towards 525 – 532 before deciding if the bounce is a connector or the next leg higher. A break into new ATHs will confirm the latter and can see it trade higher towards 570 – 593 area. Until then, traders should get risk free and protect their capital in case of a WXY double correction. Conclusion In conclusion, our Elliott Wave analysis of Microsoft Corp. ($MSFT) suggested that it remains supported against April 07, 2025 lows and bounce from the blue box area. In the meantime, keep an eye out for any corrective pullbacks that may offer entry opportunities. By applying Elliott Wave Theory, traders can better anticipate the structure of upcoming moves and enhance risk management in volatile markets. Source: https://www.fxstreet.com/news/microsoft-corp-msft-blue-box-area-offers-a-buying-opportunity-202509171323
Share
BitcoinEthereumNews2025/09/18 03:50
DOGE ETF Hype Fades as Whales Sell and Traders Await Decline

DOGE ETF Hype Fades as Whales Sell and Traders Await Decline

The post DOGE ETF Hype Fades as Whales Sell and Traders Await Decline appeared on BitcoinEthereumNews.com. Leading meme coin Dogecoin (DOGE) has struggled to gain momentum despite excitement surrounding the anticipated launch of a US-listed Dogecoin ETF this week. On-chain data reveals a decline in whale participation and a general uptick in coin selloffs across exchanges, hinting at the possibility of a deeper price pullback in the coming days. Sponsored Sponsored DOGE Faces Decline as Whales Hold Back, Traders Sell The market is anticipating the launch of Rex-Osprey’s Dogecoin ETF (DOJE) tomorrow, which is expected to give traditional investors direct exposure to Dogecoin’s price movements.  However, DOGE’s price performance has remained muted ahead of the milestone, signaling a lack of enthusiasm from traders. According to on-chain analytics platform Nansen, whale accumulation has slowed notably over the past week. Large investors, with wallets containing DOGE coins worth more than $1 million, appear unconvinced by the ETF narrative and have reduced their holdings by over 4% in the past week.  For token TA and market updates: Want more token insights like this? Sign up for Editor Harsh Notariya’s Daily Crypto Newsletter here. Dogecoin Whale Activity. Source: Nansen When large holders reduce their accumulation, it signals a bearish shift in market sentiment. This reduced DOGE demand from significant players can lead to decreased buying pressure, potentially resulting in price stagnation or declines in the near term. Sponsored Sponsored Furthermore, DOGE’s exchange reserve has risen steadily in the past week, suggesting that more traders are transferring DOGE to exchanges with the intent to sell. As of this writing, the altcoin’s exchange balance sits at 28 billion DOGE, climbing by 12% in the past seven days. DOGE Balance on Exchanges. Source: Glassnode A rising exchange balance indicates that holders are moving their assets to trading platforms to sell rather than to hold. This influx of coins onto exchanges increases the available supply in…
Share
BitcoinEthereumNews2025/09/18 05:07
The Digital WOW Explains How AI Is Affecting Digital Marketing

The Digital WOW Explains How AI Is Affecting Digital Marketing

WEST PALM BEACH, Fla., Dec. 19, 2025 /PRNewswire/ — The Digital WOW, powered by ConsultPR.net, announces new findings on how AI is affecting digital marketing.
Share
AI Journal2025/12/19 17:30