TLDR DeepSeek introduced Manifold-Constrained Hyper-Connections (mHC) to improve large-model training scalability and efficiency. The mHC method was tested on 3BTLDR DeepSeek introduced Manifold-Constrained Hyper-Connections (mHC) to improve large-model training scalability and efficiency. The mHC method was tested on 3B

DeepSeek Introduces mHC Architecture to Improve Large Model Training

2026/01/02 00:43
3 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

TLDR

  • DeepSeek introduced Manifold-Constrained Hyper-Connections (mHC) to improve large-model training scalability and efficiency.
  • The mHC method was tested on 3B, 9B, and 27B parameter models, showing stable performance without added computational cost.
  • mHC builds on ByteDance’s 2024 hyper-connection architecture by adding a manifold constraint to reduce memory overhead.
  • CEO Liang Wenfeng co-authored and uploaded the paper, reaffirming his direct involvement in DeepSeek’s technical development.
  • Industry observers expect a new DeepSeek model release ahead of Spring Festival 2026, based on the company’s publication patterns.

DeepSeek has released a new AI training method, Manifold-Constrained Hyper-Connections (mHC), in a paper uploaded to arXiv by CEO Liang Wenfeng. The architecture aims to improve training scalability for large models while keeping computational costs low. Researchers tested the method on models with 3, 9, and 27 billion parameters, showing consistent training efficiency. This comes as the company is expected to launch a new model before the Spring Festival in February 2026.

DeepSeek Builds on ResNet and Hyper-Connection Foundations

According to a report by SCMP, the mHC method enhances earlier hyper-connection (HC) designs first proposed by ByteDance in 2024 as an improvement to ResNet. ResNet allows deeper neural networks by preserving signal strength across layers, but faces challenges in maintaining efficient learning at large scale. ByteDance’s HC improved signal flow but didn’t fully address memory usage in larger models.

DeepSeek introduced a manifold constraint to limit expansion and better control memory and compute costs during training. This adjustment preserved the HC benefits while making the network suitable for larger training tasks. Researchers wrote that mHC maintained performance without increasing computational overhead per unit during model training at scale.

Lead authors Zhenda Xie, Yixuan Wei, and Huanqi Cao explained that the system enables stable deep learning without collapse. They confirmed mHC works with minimal infrastructure adjustments, making it efficient for broader deployment. The architecture was tested across multiple model sizes, confirming the technique’s adaptability and reliability. DeepSeek reported that the method handled signal preservation and scalability better than previous HC-based frameworks.

Liang Wenfeng Directly Leads Technical Advancement

CEO Liang Wenfeng was listed as the final author and uploaded the paper himself, continuing his role in major DeepSeek research. He has consistently shared technical papers linked to the company’s top models, such as R1 and V3 on arXiv. Other researchers typically upload supporting studies not directly tied to product development.

His involvement in this paper signals continued leadership in the company’s core AI work. The release underscores DeepSeek’s approach of linking internal research closely with future product direction. Florian Brand, a PhD researcher at Trier University, said DeepSeek papers often indicate what models are coming next.

He noted that the R1 model followed a similar pattern of publication and then launch. Liang’s involvement has again drawn attention from analysts watching DeepSeek’s release schedule. The company has not announced a date, but its publication strategy has become predictable. DeepSeek has remained quiet on details, but research uploads suggest new systems are under development.

The post DeepSeek Introduces mHC Architecture to Improve Large Model Training appeared first on Blockonomi.

Market Opportunity
Hyperlane Logo
Hyperlane Price(HYPER)
$0.0846
$0.0846$0.0846
+2.06%
USD
Hyperlane (HYPER) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Etsy witches can apparently turn you into a crypto millionaire for $73

Etsy witches can apparently turn you into a crypto millionaire for $73

                                                                               New snake oil? Etsy witches are hawking spells they claim can change the weather on your wedding day, help you with your love life, or fatten your crypto portfolio.                     Etsy witches have become a massive trend on social media this year — from romance spells to helping manifest fame. Did you know they can also apparently help you become a crypto millionaire? The practice of witchcraft, once punishable by death by fire (or being pushed off a cliff), has become a talking point on TikTok. Online marketplace Etsy, which allows people to sell their handmade beanies and custom dog collars, has become a hub for the spellcasters despite having a ban on “metaphysical services.” Read more
Share
Coinstats2025/10/03 10:08
Ripple CEO Reacts to BBB Rating for Ripple Prime, Lists Three Points It Validates

Ripple CEO Reacts to BBB Rating for Ripple Prime, Lists Three Points It Validates

The post Ripple CEO Reacts to BBB Rating for Ripple Prime, Lists Three Points It Validates appeared on BitcoinEthereumNews.com. Brad Garlinghouse, CEO of Ripple
Share
BitcoinEthereumNews2026/04/03 11:28
REX-Osprey DOJE ETF Launch Drives Dogecoin Surge to $0.28

REX-Osprey DOJE ETF Launch Drives Dogecoin Surge to $0.28

The post REX-Osprey DOJE ETF Launch Drives Dogecoin Surge to $0.28 appeared on BitcoinEthereumNews.com. DOJE ETF Offers Direct Spot Exposure to Dogecoin In a press release, REX-Osprey announced the launch of the first-ever publicly traded ETF to provide exposure to Dogecoin (DOGE). The latest fund is the REX-OspreyDOGE ETF (CBOE: DOJE), an innovation in the cryptocurrency market. It is a unique exchange-traded fund (ETF) that offers direct spot exposure to Dogecoin, which has gained legendary popularity due to its Shiba Inu mascot and fan base of Shiba Inu followers. The introduction of the DOJE ETF is revolutionary for several reasons. It is the first ETF in the United States that provides investors direct access to the spot price of Dogecoin, a widely known cryptocurrency, which lacks inherent utility. This provides a controlled and smooth method for people to invest into DOGE through a regular brokerage account. Using this new product, REX-Osprey remains on the edge of digital asset integration into the regulated financial frameworks. Greg King, CEO of REX Financial and Osprey Funds, expressed his pride in this achievement: “Investors look to ETFs as trading and access vehicles. The digital asset revolution is already underway, and to be able to offer exposure to some of the most popular digital assets within the protections of the U.S. ’40 Act ETF regime is something REX-Osprey™ is proud of and has worked diligently to achieve.” SSK’s Success Sets the Stage for DOGE ETF Launch The DOJE ETF follows the successful launch of REX-Osprey’s SOL + Staking ETF (SSK) in July 2025. This fund became the first-ever U.S.-listed ETF to offer spot Solana exposure alongside on-chain staking rewards. Since its launch, SSK has been a significant success, accumulating over $275 million in assets under management. REX-Osprey has now expanded its crypto offerings with the addition of both DOGE and XRP ETFs, offering investors more opportunities to diversify their…
Share
BitcoinEthereumNews2025/09/19 00:52

Trade GOLD, Share 1,000,000 USDT

Trade GOLD, Share 1,000,000 USDTTrade GOLD, Share 1,000,000 USDT

0 fees, up to 1,000x leverage, deep liquidity