The post Character.ai Unveils Efficient Techniques for Large-Scale Pretraining appeared on BitcoinEthereumNews.com. Tony Kim Dec 23, 2025 21:56 Character.aiThe post Character.ai Unveils Efficient Techniques for Large-Scale Pretraining appeared on BitcoinEthereumNews.com. Tony Kim Dec 23, 2025 21:56 Character.ai

Character.ai Unveils Efficient Techniques for Large-Scale Pretraining

2025/12/24 14:41
2분 읽기
이 콘텐츠에 대한 의견이나 우려 사항이 있으시면 crypto.news@mexc.com으로 연락주시기 바랍니다


Tony Kim
Dec 23, 2025 21:56

Character.ai reveals innovative methods for optimizing large-scale pretraining, focusing on techniques like Squinch, dynamic clamping, and Gumbel Softmax, to enhance efficiency in AI model training.

Character.ai, a notable player in the AI space, has recently shared insights into its early efforts to optimize large-scale transformer training. The company, which has since shifted its focus to open-source model foundations, originally explored various techniques to enhance training efficiency and speed, according to the Character.AI Blog.

Gradient Compression: Squinch

One of the key innovations highlighted in Character.ai’s efforts is a gradient compression algorithm known as Squinch. Developed by co-founder Noam Shazeer, this 6-bit compression technique was designed to significantly reduce communication bandwidth during distributed training while maintaining model accuracy. The algorithm effectively compresses gradients to 6 bits per element, optimizing the bandwidth usage of training clusters.

Precision Regularization: Attention Z-Reg

Character.ai also developed Attention Z-Reg, a regularization method applied to attention logits to ensure numerical stability. This technique helps maintain the precision of bfloat16 representations, crucial for optimizing the training of large models.

Quantization Stability: Dynamic Clamping

Dynamic Clamping is another technique employed to enhance quantization stability. It prevents small activation values from collapsing to zero by dynamically calculating the clamping range based on the root mean square of input weights. This method improves training stability by reducing quantization errors.

Efficient Attention API: Visibility Mask

The introduction of the Visibility Mask, a tool for representing inter-token relationships during training and inference, has improved the efficiency of training systems. This API helps manage attention ranges within batches, supporting tree-structured document relationships and bidirectional attention.

Distillation Optimization: Gumbel Softmax

In the realm of model distillation, Character.ai has leveraged the Gumbel Softmax technique to reduce storage and bandwidth costs while maintaining the fidelity of teacher models. This approach involves sampling subsets of teacher model outputs, preserving soft target values for more efficient student model training.

Character.ai’s efforts in optimizing pretraining have paved the way for more efficient AI model training, even as the company shifts towards post-training reinforcement learning for open-source models. These techniques, including Squinch and Gumbel Softmax, underscore the company’s commitment to advancing AI efficiency and scalability.

Image source: Shutterstock

Source: https://blockchain.news/news/character-ai-unveils-efficient-techniques-for-large-scale-pretraining

시장 기회
플러리싱 에이아이 로고
플러리싱 에이아이 가격(SLEEPLESSAI)
$0.02378
$0.02378$0.02378
+10.24%
USD
플러리싱 에이아이 (SLEEPLESSAI) 실시간 가격 차트
면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, crypto.news@mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.

추천 콘텐츠

Stakestone (STO) Soars: Token Surpasses $1.14 After Stunning 367% Rally

Stakestone (STO) Soars: Token Surpasses $1.14 After Stunning 367% Rally

BitcoinWorld Stakestone (STO) Soars: Token Surpasses $1.14 After Stunning 367% Rally In a remarkable display of market momentum, the Stakestone (STO) token has
공유하기
bitcoinworld2026/04/02 17:10
CME Group to launch Solana and XRP futures options in October

CME Group to launch Solana and XRP futures options in October

The post CME Group to launch Solana and XRP futures options in October appeared on BitcoinEthereumNews.com. CME Group is preparing to launch options on SOL and XRP futures next month, giving traders new ways to manage exposure to the two assets.  The contracts are set to go live on October 13, pending regulatory approval, and will come in both standard and micro sizes with expiries offered daily, monthly and quarterly. The new listings mark a major step for CME, which first brought bitcoin futures to market in 2017 and added ether contracts in 2021. Solana and XRP futures have quickly gained traction since their debut earlier this year. CME says more than 540,000 Solana contracts (worth about $22.3 billion), and 370,000 XRP contracts (worth $16.2 billion), have already been traded. Both products hit record trading activity and open interest in August. Market makers including Cumberland and FalconX plan to support the new contracts, arguing that institutional investors want hedging tools beyond bitcoin and ether. CME’s move also highlights the growing demand for regulated ways to access a broader set of digital assets. The launch, which still needs the green light from regulators, follows the end of XRP’s years-long legal fight with the US Securities and Exchange Commission. A federal court ruling in 2023 found that institutional sales of XRP violated securities laws, but programmatic exchange sales did not. The case officially closed in August 2025 after Ripple agreed to pay a $125 million fine, removing one of the biggest uncertainties hanging over the token. This is a developing story. This article was generated with the assistance of AI and reviewed by editor Jeffrey Albus before publication. Get the news in your inbox. Explore Blockworks newsletters: Source: https://blockworks.co/news/cme-group-solana-xrp-futures
공유하기
BitcoinEthereumNews2025/09/17 23:55
Q2 Market Insights: Bitcoin regains dominance in risk-averse environment, ETFs remain critical to market structure

Q2 Market Insights: Bitcoin regains dominance in risk-averse environment, ETFs remain critical to market structure

The market will show a downward trend in the short term, and then rebound and set new highs in the second half of the year.
공유하기
PANews2025/04/28 19:40