Two new Jina reranker models deliver low-latency, production-ready relevance for hybrid search and RAG workloads
SAN FRANCISCO–(BUSINESS WIRE)–Elastic (NYSE: ESTC), the Search AI Company, today made two Jina Rerankers available on Elastic Inference Service (EIS), a GPU-accelerated inference-as-a-service that makes it easy to run fast, high-quality inference without complex setup or hosting. These rerankers bring low-latency, high-precision multilingual reranking to the Elastic ecosystem.
As generative AI prototypes move into production-ready search and RAG systems, users run into relevance and inference latency limits, particularly for multilingual use cases. Rerankers improve search quality by reordering results based on semantic relevance, helping surface the most accurate matches for a query. They improve relevance across aggregated, multi-query results, without reindexing or pipeline changes. This makes them especially valuable for hybrid search, RAG, and context-engineering workflows where better context boosts downstream accuracy.
By delivering GPU-accelerated Jina rerankers as a managed service, Elastic enables teams to improve search and RAG accuracy without managing model infrastructure.
“Search relevance is foundational to AI-driven experiences,” said Steve Kearns, general manager, Search at Elastic. “By bringing these Jina reranker models to Elastic Inference Service, we are enabling teams to deliver fast and accurate multilingual search, RAG, and agentic AI experiences, available out of the box with minimal setup.”
The two new Jina reranker models are optimized for different production needs:
Jina Reranker v2 (jina-reranker-v2-base-multilingual)
Built for scalable, agentic workflows.
Jina Reranker v3 (jina-reranker-v3)
Optimized for high-precision shortlist reranking.
These models extend Elastic’s growing catalogue of ready-to-use models available on EIS, which includes the open source multilingual and multimodal embeddings, rerankers, and small language models built by Jina and acquired by Elastic last year. EIS has an expanding catalogue of ready-to-use models on managed GPUs, with additional models expected to be added over time.
Availability
All Elastic Cloud trials have access to the Elastic Inference Service. Try it now on Elastic Cloud Serverless and Elastic Cloud Hosted.
Additional Resources
About Elastic
Elastic (NYSE: ESTC), the Search AI Company, integrates its deep expertise in search technology with artificial intelligence to help everyone transform all of their data into answers, actions, and outcomes. Elastic’s Search AI Platform — the foundation for its search, observability, and security solutions — is used by thousands of companies, including more than 50% of the Fortune 500. Learn more at elastic.co.
Elastic and associated marks are trademarks or registered trademarks of elasticsearch BV and its subsidiaries. All other company and product names may be trademarks of their respective owners.
Contacts
Media Contact
Elastic PR
PR-team@elastic.co


