REDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source releaseREDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source release

Zilliz Open Sources Industry-First Bilingual “Semantic Highlighting” Model to Slash RAG Token Costs and Boost Accuracy

2026/01/31 02:31
3 min read

REDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source release of its Bilingual Semantic Highlighting Model, an industry-first AI model designed to dramatically reduce token usage and improve answer quality in production RAG-powered AI applications.

This highlighting model introduces sentence-level relevance filtering, enabling AI developers to remove low-signal context before sending prompts to large language models. This approach directly addresses rising inference costs and accuracy issues caused by oversized context windows in enterprise RAG and RAG-powered AI deployments.

“As RAG systems move into production, teams are running into very real cost and quality limits,” said James Luan, VP of Engineering at Zilliz. “This model gives developers a practical way to reduce prompt size and improve answer accuracy without reworking their existing pipelines.”

Key Innovations and Technical Breakthroughs

  • Bilingual relevance by design: Optimized for both English and Chinese, the model addresses cross-lingual relevance challenges common in global RAG deployments. It is built on the MiniCPM-2B architecture, enabling low-latency, production-ready performance.
  • Sentence-level context filtering: Rather than scoring entire document chunks, the model evaluates relevance at the sentence level and retains only content that directly supports a user query before sending it to the LLM.
  • Lower token usage, higher answer quality: Zilliz reports that sentence-level filtering significantly compresses prompt size while improving downstream response quality, helping teams reduce inference costs and improve generation speed in production environments.

Availability

The Bilingual Semantic Highlighting Model is available today as an open-source release. To learn more about the training methodology and performance benchmarks, visit the Zilliz Technical Blog.

Download: : zilliz/semantic-highlight-bilingual-v1

About Zilliz

Zilliz is the company behind Milvus, the world’s most widely adopted open-source vector database. Zilliz Cloud brings that performance to production with a fully managed, cloud-native platform built for scalable, low-latency vector search and hybrid retrieval. It supports billion-scale workloads with sub-10ms latency, auto-scaling, and optimized indexes for GenAI use cases like semantic search and RAG.

Zilliz is built to make AI not just possible—but practical. With a focus on performance and cost-efficiency, it helps engineering teams move from prototype to production without overprovisioning or complex infrastructure. Over 10,000 organizations worldwide rely on Zilliz to build intelligent applications at scale.

Headquartered in Redwood Shores, California, Zilliz is backed by leading investors, including Aramco’s Prosperity 7 Ventures, Temasek’s Pavilion Capital, Hillhouse Capital, 5Y Capital, Yunqi Partners, Trustbridge Partners, and others. Learn more at  Zilliz.com.

Cision View original content to download multimedia:https://www.prnewswire.com/news-releases/zilliz-open-sources-industry-first-bilingual-semantic-highlighting-model-to-slash-rag-token-costs-and-boost-accuracy-302675291.html

SOURCE Zilliz

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

The post IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge! appeared on BitcoinEthereumNews.com. Crypto News 17 September 2025 | 18:00 Discover why BlockDAG’s upcoming Awakening Testnet launch makes it the best crypto to buy today as Story (IP) price jumps to $11.75 and Hyperliquid hits new highs. Recent crypto market numbers show strength but also some limits. The Story (IP) price jump has been sharp, fueled by big buybacks and speculation, yet critics point out that revenue still lags far behind its valuation. The Hyperliquid (HYPE) price looks solid around the mid-$50s after a new all-time high, but questions remain about sustainability once the hype around USDH proposals cools down. So the obvious question is: why chase coins that are either stretched thin or at risk of retracing when you could back a network that’s already proving itself on the ground? That’s where BlockDAG comes in. While other chains are stuck dealing with validator congestion or outages, BlockDAG’s upcoming Awakening Testnet will be stress-testing its EVM-compatible smart chain with real miners before listing. For anyone looking for the best crypto coin to buy, the choice between waiting on fixes or joining live progress feels like an easy one. BlockDAG: Smart Chain Running Before Launch Ethereum continues to wrestle with gas congestion, and Solana is still known for network freezes, yet BlockDAG is already showing a different picture. Its upcoming Awakening Testnet, set to launch on September 25, isn’t just a demo; it’s a live rollout where the chain’s base protocols are being stress-tested with miners connected globally. EVM compatibility is active, account abstraction is built in, and tools like updated vesting contracts and Stratum integration are already functional. Instead of waiting for fixes like other networks, BlockDAG is proving its infrastructure in real time. What makes this even more important is that the technology is operational before the coin even hits exchanges. That…
Share
BitcoinEthereumNews2025/09/18 00:32
Ondo Finance launches USDY yieldcoin on Stellar network

Ondo Finance launches USDY yieldcoin on Stellar network

The post Ondo Finance launches USDY yieldcoin on Stellar network appeared on BitcoinEthereumNews.com. Key Takeaways Ondo Finance has launched its USDY yieldcoin on the Stellar blockchain network. USDY is Ondo’s flagship yieldcoin focused on real-world asset expansion. Ondo Finance launched its USDY yieldcoin on the Stellar blockchain network today. USDY is described as Ondo’s flagship yieldcoin and represents the company’s expansion of real-world assets onto the Stellar platform. The launch aims to provide yield access across global economies through Stellar’s international network infrastructure. The deployment connects traditional finance with blockchain-based solutions by bringing real-world asset exposure to Stellar’s ecosystem. Ondo Finance positions the move as part of efforts to broaden access to yield-generating opportunities worldwide. Source: https://cryptobriefing.com/ondo-finance-usdy-yieldcoin-stellar-launch/
Share
BitcoinEthereumNews2025/09/18 03:58
Rap Star Drake Uses Stake to Wager $1M in Bitcoin on Patriots Despite Super Bowl LX Odds

Rap Star Drake Uses Stake to Wager $1M in Bitcoin on Patriots Despite Super Bowl LX Odds

Drake has never been shy about betting big, but on the eve of Super Bowl LX, the global music star took it up another notch by placing a $1 million wager on the
Share
Coinstats2026/02/09 04:00