The post Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack appeared on BitcoinEthereumNews.com. In brief Meta’s new MuseThe post Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack appeared on BitcoinEthereumNews.com. In brief Meta’s new Muse

Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

In brief

  • Meta’s new Muse Spark marks a shift to closed, natively multimodal AI with agent-based reasoning.
  • Meta reports strong benchmark gains in health and search, but still trails Gemini on core reasoning and coding.
  • Built in nine months with far less compute, this points to a new efficiency-driven AI strategy.

Meta launched Muse Spark on Wednesday, marking the first model built by Meta Superintelligence Labs—the team assembled nine months ago under Chief AI Officer Alexandr Wang after Meta’s $14 billion Scale AI acquisition. It’s live now at meta.ai and the Meta AI app, with a rollout to Facebook, Instagram, and WhatsApp coming in the next few weeks.

This isn’t just another chatbot upgrade or a new version of Llama. Muse Spark is natively multimodal—it processes images, text, and voice from the ground up, rather than bolting vision onto an existing text model. It comes with visual chain-of-thought, tool-use support, and something Meta is calling “Contemplating mode”: a setup that runs multiple AI agents in parallel to tackle harder problems. That’s Meta’s answer to the extended thinking modes from Google’s Gemini Deep Think and OpenAI’s GPT Pro.

“Muse Spark is the first step on our scaling ladder and the first product of a ground-up overhaul of our AI efforts,” Meta wrote in an official announcement. “To support further scaling, we are making strategic investments across the entire stack—from research and model training to infrastructure, including the Hyperion data center.”

The company worked with more than 1,000 physicians to curate training data for Muse Spark’s medical reasoning. The results on HealthBench Hard—an open-ended health queries benchmark—are striking: Muse Spark scored 42.8, compared to 40.1 for GPT 5.4 and just 20.6 for Gemini 3.1 Pro. That’s not a marginal difference.

On agentic search (DeepSearchQA), Muse Spark also leads with 74.8, beating Gemini (69.7) and GPT 5.4 (73.6). On CharXiv Reasoning—figure understanding from scientific papers—it scored 86.4, the highest across the models in the comparison.

For those into jailbreaking AI, the model was cracked open within minutes:

But good isn’t the same as great. The overall benchmark picture shows Gemini 3.1 Pro still running ahead on most categories. The gap is most visible on ARC AGI 2, the abstract reasoning puzzle benchmark: Gemini scored 76.5 to Muse Spark’s 42.5.

On coding (LiveCodeBench Pro), Gemini’s 82.9 outpaces Meta’s 80.0. On MMMU Pro—multimodal understanding—Gemini scored 83.9 versus 80.4. Meta’s own blog acknowledges current performance gaps in long-horizon agentic systems and coding workflows.

There’s also a notable strategic shift baked into this launch. Muse Spark is a closed model—its architecture and weights won’t be made public. That’s a sharp departure from Llama, which built Meta’s reputation in open AI circles. After Llama 4’s underwhelming reception earlier this year, Meta appears to have decided the next chapter needs to be written differently.

The company says it hopes to open-source future versions of Muse, but for now the code stays inside Meta. The tech giant’s stock climbed nearly 9% on Wednesday following the announcement, and finished the trading day up 6.5% to a price of $612.42.

“Contemplating mode” uses parallel agent orchestration to push the model’s ceiling higher. In that configuration, Muse Spark hit 58% on Humanity’s Last Exam and 38% on FrontierScience Research—territory that makes it competitive with the most capable versions of Gemini and GPT, rather than their standard releases.

Meta is also rolling out a shopping assistant that compares products and links directly to purchases, and plans to bring Muse Spark to Facebook, Instagram, and WhatsApp in the coming weeks—following the same script implemented since Llama 3, putting it in front of more than 3.5 billion users. A private API preview is opening to select developers.

The model was built in nine months, internally codenamed Avocado, with Meta claiming that its new pretraining stack can reach the same capability level as Llama 4 Maverick using over 10 times less compute.

Muse Spark is described internally as a “small and fast” first step in the Muse family. A more capable version is already in development.

Daily Debrief Newsletter

Start every day with the top news stories right now, plus original features, a podcast, videos and more.

Source: https://decrypt.co/363691/meta-muse-spark-most-capable-ai-gemini-pro-still-leads

Market Opportunity
HashPack Logo
HashPack Price(PACK)
$0.00673
$0.00673$0.00673
-1.60%
USD
HashPack (PACK) Live Price Chart

AI Strategy: Powered 24/7

AI Strategy: Powered 24/7AI Strategy: Powered 24/7

Generate automated strategies using natural language

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

‘Effectively Over’: Trader James Wynn Calls Time on the Memecoin Market

‘Effectively Over’: Trader James Wynn Calls Time on the Memecoin Market

BitcoinWorld ‘Effectively Over’: Trader James Wynn Calls Time on the Memecoin Market A prominent trader on the Hyperliquid platform, James Wynn, has delivered
Share
bitcoinworld2026/05/26 07:25
Soluna Closes $53M Briscoe Wind Farm Acquisition; Achieves Vertical Integration

Soluna Closes $53M Briscoe Wind Farm Acquisition; Achieves Vertical Integration

$6–$11M Year-One Projected EBITDA | 300 MW AI Campus Expansion at Project DorothyALBANY, N.Y.--(BUSINESS WIRE)--$SLNH #SLNH--Soluna Holdings, Inc. (“Soluna” or
Share
CryptoReporter2026/04/02 22:30
CME Group to Launch Solana and XRP Futures Options

CME Group to Launch Solana and XRP Futures Options

The post CME Group to Launch Solana and XRP Futures Options appeared on BitcoinEthereumNews.com. An announcement was made by CME Group, the largest derivatives exchanger worldwide, revealed that it would introduce options for Solana and XRP futures. It is the latest addition to CME crypto derivatives as institutions and retail investors increase their demand for Solana and XRP. CME Expands Crypto Offerings With Solana and XRP Options Launch According to a press release, the launch is scheduled for October 13, 2025, pending regulatory approval. The new products will allow traders to access options on Solana, Micro Solana, XRP, and Micro XRP futures. Expiries will be offered on business days on a monthly, and quarterly basis to provide more flexibility to market players. CME Group said the contracts are designed to meet demand from institutions, hedge funds, and active retail traders. According to Giovanni Vicioso, the launch reflects high liquidity in Solana and XRP futures. Vicioso is the Global Head of Cryptocurrency Products for the CME Group. He noted that the new contracts will provide additional tools for risk management and exposure strategies. Recently, CME XRP futures registered record open interest amid ETF approval optimism, reinforcing confidence in contract demand. Cumberland, one of the leading liquidity providers, welcomed the development and said it highlights the shift beyond Bitcoin and Ethereum. FalconX, another trading firm, added that rising digital asset treasuries are increasing the need for hedging tools on alternative tokens like Solana and XRP. High Record Trading Volumes Demand Solana and XRP Futures Solana futures and XRP continue to gain popularity since their launch earlier this year. According to CME official records, many have bought and sold more than 540,000 Solana futures contracts since March. A value that amounts to over $22 billion dollars. Solana contracts hit a record 9,000 contracts in August, worth $437 million. Open interest also set a record at 12,500 contracts.…
Share
BitcoinEthereumNews2025/09/18 01:39

No Chart Skills? Still Profit

No Chart Skills? Still ProfitNo Chart Skills? Still Profit

Copy top traders in 3s with auto trading!