TLDRs; Alibaba’s Qwen3-Max-Thinking achieved perfect scores in AIME and HMMT, marking China’s first flawless AI math performance. OpenAI’s GPT-5 Pro also self-reported perfect results, setting up a new East–West rivalry in reasoning AI. Verification concerns linger, as Alibaba’s results lack third-party validation or evidence of closed-book testing. API access opens doors for developers and investors, [...] The post Alibaba’s Qwen AI Outsmarts Global Peers in Math Benchmarks appeared first on CoinCentral.TLDRs; Alibaba’s Qwen3-Max-Thinking achieved perfect scores in AIME and HMMT, marking China’s first flawless AI math performance. OpenAI’s GPT-5 Pro also self-reported perfect results, setting up a new East–West rivalry in reasoning AI. Verification concerns linger, as Alibaba’s results lack third-party validation or evidence of closed-book testing. API access opens doors for developers and investors, [...] The post Alibaba’s Qwen AI Outsmarts Global Peers in Math Benchmarks appeared first on CoinCentral.

Alibaba’s Qwen AI Outsmarts Global Peers in Math Benchmarks

2025/11/06 05:21

TLDRs;

  • Alibaba’s Qwen3-Max-Thinking achieved perfect scores in AIME and HMMT, marking China’s first flawless AI math performance.
  • OpenAI’s GPT-5 Pro also self-reported perfect results, setting up a new East–West rivalry in reasoning AI.
  • Verification concerns linger, as Alibaba’s results lack third-party validation or evidence of closed-book testing.
  • API access opens doors for developers and investors, with potential cost-performance advantages across Asia-Pacific markets.

Alibaba’s artificial intelligence division has unveiled Qwen3-Max-Thinking, an advanced reasoning model that stunned observers by scoring a perfect 100% in two of the world’s toughest mathematics competitions, the American Invitational Mathematics Examination (AIME) and the Harvard-MIT Mathematics Tournament (HMMT).

This marks a significant milestone for China’s AI industry. It is reportedly the first time a Chinese-developed model has matched or exceeded Western benchmarks in reasoning-heavy academic tests.

The announcement places Alibaba’s AI efforts shoulder-to-shoulder with OpenAI’s GPT-5 Pro, which also self-reported flawless results in the same contests earlier this year.

A Leap for China’s AI Ambitions

According to Alibaba, Qwen3-Max-Thinking is built atop Qwen3-Max, the company’s largest AI model boasting over one trillion parameters. Released in late September, the Qwen3-Max architecture represents Alibaba’s boldest step toward creating general-purpose reasoning models that can compete globally in complex problem-solving tasks.

The math victories are symbolic as much as technical. For years, elite competitions like the AIME and HMMT have been used as unofficial benchmarks for evaluating the reasoning depth and abstract thinking capacity of large language models (LLMs). Perfect accuracy in such events signals that Qwen3-Max-Thinking is closing the performance gap with Western-developed systems.

However, questions remain about transparency and verification. Alibaba’s claims, while headline-grabbing, lack third-party confirmation. Neither the AIME nor HMMT maintains public leaderboards for AI models, and no independent audit has yet verified whether the results were achieved under closed-book, internet-free conditions, a crucial factor in determining authenticity.

Verification Gaps Raise Skepticism

Despite the celebration, experts have urged caution. The absence of public verification means it is unclear whether Qwen3-Max-Thinking truly achieved 100% accuracy under standardized conditions.
Unverified results have become a recurring issue in AI benchmarking, as companies race to claim superiority in domains like reasoning, coding, and mathematics.

Further complicating the picture, details remain murky on whether the 2025 versions of the contest problems were used or if the AI had prior exposure to similar data during training. Without contamination controls,  safeguards ensuring the model hadn’t seen test data before, perfect scores are difficult to validate.

While Alibaba’s announcement has sparked excitement, critics warn that without reproducibility, the victory could remain symbolic rather than scientific.

Developers and Investors Eye API Potential

Beyond benchmark bragging rights, Alibaba’s AI strategy has real commercial implications. The company recently opened API access to Qwen3-Max-Thinking, inviting developers to test its reasoning capabilities in real-world applications.

For software and data teams, this introduces new possibilities for cost-performance routing, dynamically choosing between AI providers based on pricing, accuracy, or latency. Developers in the Asia-Pacific region, particularly those seeking local AI infrastructure options, may find Qwen’s ecosystem attractive if it offers competitive pricing and reliable regional support beyond Singapore.

Investors are also watching closely. If Qwen3-Max-Thinking can handle complex reasoning tasks while maintaining affordability, Alibaba could carve out a niche among enterprise developers and AI startups looking for alternatives to U.S. providers. The success of such models could signal a new balance in global AI infrastructure, where Chinese models rival or even outperform Western ones in specific tasks.

The post Alibaba’s Qwen AI Outsmarts Global Peers in Math Benchmarks appeared first on CoinCentral.

Market Opportunity
Sleepless AI Logo
Sleepless AI Price(AI)
$0.03692
$0.03692$0.03692
-1.23%
USD
Sleepless AI (AI) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Unlock Potential: OKX Lists LIGHT Perpetual Futures with 50x Leverage

Unlock Potential: OKX Lists LIGHT Perpetual Futures with 50x Leverage

BitcoinWorld Unlock Potential: OKX Lists LIGHT Perpetual Futures with 50x Leverage In a significant move for crypto derivatives traders, OKX has announced the
Share
bitcoinworld2025/12/16 15:30
New Gold Protocol's NGP token was exploited and attacked, resulting in a loss of approximately $2 million.

New Gold Protocol's NGP token was exploited and attacked, resulting in a loss of approximately $2 million.

PANews reported on September 18th that according to Paidun monitoring, New Gold Protocol's NGP token was exploited in an attack, resulting in a loss of approximately $2 million. The NGP token plummeted 88% in an hour, and the attacker deposited the stolen funds (443.8 ETH) into TornadoCash.
Share
PANews2025/09/18 11:10
USDC Exchange Inflows Hit $1.33B, Highest in Over Four Years

USDC Exchange Inflows Hit $1.33B, Highest in Over Four Years

The post USDC Exchange Inflows Hit $1.33B, Highest in Over Four Years appeared on BitcoinEthereumNews.com. Key Points: Daily USDC inflow reaches $1.33B, marking a 4-year record Global stablecoin supply surges to an all-time high of $280B USDC market cap grows steadily, reflecting rising institutional interest USDC inflows into centralized exchanges have reached $1.33 billion, the highest level recorded in more than four years. This surge indicates renewed investor interest and suggests a strong return of capital to crypto markets. USDC Exchange Inflow + BTC Price | Source : CryptoQuant The recent inflow occurred in mid-September 2025 and followed consistent large deposits over the past month. Notably, inflows of $1.2 billion and $1 billion were seen in early and late August, respectively. Rising Exchange Inflows Signal Increasing On-Chain Liquidity Large stablecoin inflows to exchanges often signal potential market activity, especially when the volume exceeds historical averages. The $1.33B inflow represents a significant injection of liquidity and indicates increased market readiness. When stablecoins like USDC are sent to exchanges in large amounts, it typically reflects user intent to trade or reposition capital. These actions suggest that investors are preparing for market moves or accumulating digital assets. Global Stablecoin Supply Surges to $280 Billion The global supply of stablecoins has reached an all-time high of $280 billion, showing strong growth from a low of $125 billion in mid-2023. This doubling in supply over two years reflects rising demand for digital dollar-based assets. Global Stablecoin Supply at all-time high of $280 billion | Source : token terminal  This growth indicates broader adoption across use cases such as trading, payments, and decentralized finance. The consistent increase in outstanding supply also reflects capital inflows from both institutional and retail users. USDC Sees Steady Growth in Market Share and Trust USDC’s market capitalization has climbed to approximately $63 billion, continuing its recovery from previous lows. This steady rise signals improving market sentiment…
Share
BitcoinEthereumNews2025/09/19 17:12