TLDRs; DeepSeek launched V3.2-exp, an experimental AI model cutting inference costs for long-context tasks by nearly half. The model uses “Sparse Attention” and a “lightning indexer” to handle lengthy inputs more efficiently. Released as an open-weight model on Hugging Face, it allows third-party testing and benchmarking. DeepSeek faces growing competition from heavily funded Chinese tech [...] The post New DeepSeek Model Halves API Costs for Extended Contexts appeared first on CoinCentral.TLDRs; DeepSeek launched V3.2-exp, an experimental AI model cutting inference costs for long-context tasks by nearly half. The model uses “Sparse Attention” and a “lightning indexer” to handle lengthy inputs more efficiently. Released as an open-weight model on Hugging Face, it allows third-party testing and benchmarking. DeepSeek faces growing competition from heavily funded Chinese tech [...] The post New DeepSeek Model Halves API Costs for Extended Contexts appeared first on CoinCentral.

New DeepSeek Model Halves API Costs for Extended Contexts

2025/09/30 21:59
3분 읽기
이 콘텐츠에 대한 의견이나 우려 사항이 있으시면 crypto.news@mexc.com으로 연락주시기 바랍니다

TLDRs;

  • DeepSeek launched V3.2-exp, an experimental AI model cutting inference costs for long-context tasks by nearly half.
  • The model uses “Sparse Attention” and a “lightning indexer” to handle lengthy inputs more efficiently.
  • Released as an open-weight model on Hugging Face, it allows third-party testing and benchmarking.
  • DeepSeek faces growing competition from heavily funded Chinese tech giants expanding their AI portfolios.

China-based AI startup DeepSeek has unveiled its newest experimental language model, V3.2-exp, designed to cut inference costs for long-context tasks nearly in half.

The model, announced Monday,  aims to address one of the most pressing challenges in large-scale AI adoption: the expense of handling extended inputs.

V3.2-exp leverages a new system called DeepSeek Sparse Attention, which pairs a “lightning indexer” with a secondary module for fine-grained token selection.

Together, these innovations allow the model to focus on the most relevant excerpts while managing token-level detail with precision. Early internal testing suggests that the system can significantly reduce server loads, with API costs potentially dropping by 50% for long-context operations.

Open-Weight Model Now Available

Unlike many commercial AI releases that remain closed, V3.2-exp has been launched as an open-weight model. It is now accessible on Hugging Face, giving researchers, developers, and enterprises an opportunity to run independent evaluations.

This decision highlights DeepSeek’s continued push toward transparency and collaboration, especially as companies increasingly scrutinize claims about efficiency and performance.

The model’s open release also aligns with DeepSeek’s previous strategy with its R1 model earlier this year, where open benchmarking allowed the community to verify its reasoning capabilities. By adopting the same approach for V3.2-exp, DeepSeek is signaling confidence in its efficiency breakthroughs.

Building on Past Releases

The launch of V3.2-exp comes after a string of updates and experiments from DeepSeek in recent months. Earlier this September, the company introduced DeepSeek-V3.1-Terminus, a refinement aimed at improving agent performance and addressing reported issues such as illegible symbols and inconsistent language switching.

While that update delivered small improvements in benchmarks like Humanity’s Last Exam and coding tasks, some challenges remained, particularly in Chinese-language performance.

Meanwhile, industry reports have revealed that DeepSeek is working on a next-generation agent-focused model, slated for unveiling in Q4 2025. The project reflects a broader industry shift toward autonomous AI systems, capable of executing multi-step tasks with minimal human supervision. The V3.2-exp release appears to complement this trajectory by strengthening the company’s technological foundation in efficiency before more advanced agent features are rolled out.

Competitive Landscape Heats Up

DeepSeek’s innovation comes at a time when competition in the Chinese AI sector is intensifying. Rival firms such as Alibaba and Tencent are scaling up their AI investments dramatically, with Alibaba pledging over 380 billion RMB ($52.9 billion) in cloud and AI infrastructure.

While DeepSeek has been lauded for achieving cost-efficient results with comparatively modest resources, analysts warn that the company must maintain momentum to avoid being overshadowed by its cash-rich rivals.

The post New DeepSeek Model Halves API Costs for Extended Contexts appeared first on CoinCentral.

시장 기회
플러리싱 에이아이 로고
플러리싱 에이아이 가격(SLEEPLESSAI)
$0.01874
$0.01874$0.01874
+3.19%
USD
플러리싱 에이아이 (SLEEPLESSAI) 실시간 가격 차트
면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, crypto.news@mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.

$30,000 in PRL + 15,000 USDT

$30,000 in PRL + 15,000 USDT$30,000 in PRL + 15,000 USDT

Deposit & trade PRL to boost your rewards!