MiniMax releases M2.7, a 230B-parameter mixture-of-experts model optimized for NVIDIA GPUs with up to 2.7x throughput gains on Blackwell hardware. (Read More)MiniMax releases M2.7, a 230B-parameter mixture-of-experts model optimized for NVIDIA GPUs with up to 2.7x throughput gains on Blackwell hardware. (Read More)

MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure

2026/04/12 09:37
2 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure

Ted Hisokawa Apr 12, 2026 01:37

MiniMax releases M2.7, a 230B-parameter mixture-of-experts model optimized for NVIDIA GPUs with up to 2.7x throughput gains on Blackwell hardware.

MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure

MiniMax has released M2.7, a 230-billion parameter open-weights AI model designed specifically for autonomous agent workflows, now available across NVIDIA's inference ecosystem including the company's latest Blackwell Ultra GPUs.

The model represents a significant efficiency play in enterprise AI. Despite its massive 230B total parameters, M2.7 activates only 10B parameters per token—a 4.3% activation rate achieved through mixture-of-experts (MoE) architecture with 256 local experts. This keeps inference costs manageable while maintaining the reasoning capacity of a much larger model.

Performance Numbers on Blackwell

NVIDIA collaborated with open source communities to optimize M2.7 for production workloads. Two key optimizations—a fused QK RMS Norm kernel and FP8 MoE integration from TensorRT-LLM—delivered substantial throughput improvements on Blackwell Ultra GPUs.

Testing with a 1K/1K input/output sequence length dataset showed vLLM achieved up to 2.5x throughput improvement, while SGLang hit 2.7x gains. Both optimizations were implemented within a single month, suggesting further performance headroom exists.

Technical Architecture

M2.7 supports 200K input context length across 62 layers, using multi-head causal self-attention with Rotary Position Embeddings (RoPE). A top-k expert routing mechanism activates only 8 of the 256 experts for any given input, which is how the model maintains low inference costs despite its scale.

The architecture targets coding challenges and complex agentic tasks—workflows where AI systems need to plan, execute, and iterate autonomously rather than respond to single prompts.

Deployment Options

Developers can access M2.7 through multiple channels. NVIDIA's NemoClaw reference stack provides a one-click deployment for running autonomous agents with OpenShell runtime. The model is also available through NVIDIA NIM containerized microservices for on-premise, cloud, or hybrid deployments.

For teams wanting to customize the model, NVIDIA's NeMo AutoModel library supports fine-tuning with published recipes. Reinforcement learning workflows are available through NeMo RL with sample configurations for 8K and 16K sequence lengths.

Free GPU-accelerated endpoints on build.nvidia.com allow testing before committing to infrastructure. The open weights are also available on Hugging Face for self-hosted deployments.

The release positions MiniMax as a credible alternative to closed models from OpenAI and Anthropic for enterprises building autonomous AI systems, particularly those already invested in NVIDIA infrastructure.

Image source: Shutterstock
  • nvidia
  • minimax
  • ai infrastructure
  • machine learning
  • enterprise ai
Market Opportunity
The 7 Wanderers Logo
The 7 Wanderers Price(7)
$0.0000187
$0.0000187$0.0000187
-5.79%
USD
The 7 Wanderers (7) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

USD1 Genesis: 0 Fees + 12% APR

USD1 Genesis: 0 Fees + 12% APRUSD1 Genesis: 0 Fees + 12% APR

New users: stake for up to 600% APR. Limited time!