The post Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration appeared on BitcoinEthereumNews.com. Rongchai Wang Oct 06, 2025 06:01 NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark. As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing. Velox and cuDF: A Powerful Combination The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF. Accelerating Presto with GPU Power By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts. Multi-GPU Execution for Enhanced Performance The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes. Hybrid Execution in Apache Spark In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid… The post Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration appeared on BitcoinEthereumNews.com. Rongchai Wang Oct 06, 2025 06:01 NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark. As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing. Velox and cuDF: A Powerful Combination The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF. Accelerating Presto with GPU Power By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts. Multi-GPU Execution for Enhanced Performance The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes. Hybrid Execution in Apache Spark In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid…

Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com


Rongchai Wang
Oct 06, 2025 06:01

NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark.





As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing.

Velox and cuDF: A Powerful Combination

The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF.

Accelerating Presto with GPU Power

By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts.

Multi-GPU Execution for Enhanced Performance

The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes.

Hybrid Execution in Apache Spark

In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid clusters. This strategy allows for efficient use of GPU resources while maintaining CPU availability for other tasks, resulting in significant performance improvements.

Community Involvement and Future Prospects

The open-source nature of this project encourages community involvement, aiming to drive further innovations across the data processing ecosystem. By implementing reusable GPU operators in Velox, the collaboration seeks to reduce duplication and simplify maintenance while accelerating various systems.

Image source: Shutterstock


Source: https://blockchain.news/news/revolutionizing-data-analytics-gpu-native-velox-nvidia-cudf-integration

Market Opportunity
NodeAI Logo
NodeAI Price(GPU)
$0.02735
$0.02735$0.02735
-1.40%
USD
NodeAI (GPU) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

White House Publishes Trump’s New Strategy Against Cybercrimes

White House Publishes Trump’s New Strategy Against Cybercrimes

Key Takeaways: An executive order that was signed by Donald Trump instructed U.S. agencies to step up efforts to counter network-based frauds and crypto scams in
Share
Crypto Ninjas2026/03/08 00:43
How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

The post How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings appeared on BitcoinEthereumNews.com. contributor Posted: September 17, 2025 As digital assets continue to reshape global finance, cloud mining has become one of the most effective ways for investors to generate stable passive income. Addressing the growing demand for simplicity, security, and profitability, IeByte has officially upgraded its fully automated cloud mining platform, empowering both beginners and experienced investors to earn Bitcoin, Dogecoin, and other mainstream cryptocurrencies without the need for hardware or technical expertise. Why cloud mining in 2025? Traditional crypto mining requires expensive hardware, high electricity costs, and constant maintenance. In 2025, with blockchain networks becoming more competitive, these barriers have grown even higher. Cloud mining solves this by allowing users to lease professional mining power remotely, eliminating the upfront costs and complexity. IeByte stands at the forefront of this transformation, offering investors a transparent and seamless path to daily earnings. IeByte’s upgraded auto-cloud mining platform With its latest upgrade, IeByte introduces: Full Automation: Mining contracts can be activated in just one click, with all processes handled by IeByte’s servers. Enhanced Security: Bank-grade encryption, cold wallets, and real-time monitoring protect every transaction. Scalable Options: From starter packages to high-level investment contracts, investors can choose the plan that matches their goals. Global Reach: Already trusted by users in over 100 countries. Mining contracts for 2025 IeByte offers a wide range of contracts tailored for every investor level. From entry-level plans with daily returns to premium high-yield packages, the platform ensures maximum accessibility. Contract Type Duration Price Daily Reward Total Earnings (Principal + Profit) Starter Contract 1 Day $200 $6 $200 + $6 + $10 bonus Bronze Basic Contract 2 Days $500 $13.5 $500 + $27 Bronze Basic Contract 3 Days $1,200 $36 $1,200 + $108 Silver Advanced Contract 1 Day $5,000 $175 $5,000 + $175 Silver Advanced Contract 2 Days $8,000 $320 $8,000 + $640 Silver…
Share
BitcoinEthereumNews2025/09/17 23:48
Taiko Makes Chainlink Data Streams Its Official Oracle

Taiko Makes Chainlink Data Streams Its Official Oracle

The post Taiko Makes Chainlink Data Streams Its Official Oracle appeared on BitcoinEthereumNews.com. Key Notes Taiko has officially integrated Chainlink Data Streams for its Layer 2 network. The integration provides developers with high-speed market data to build advanced DeFi applications. The move aims to improve security and attract institutional adoption by using Chainlink’s established infrastructure. Taiko, an Ethereum-based ETH $4 514 24h volatility: 0.4% Market cap: $545.57 B Vol. 24h: $28.23 B Layer 2 rollup, has announced the integration of Chainlink LINK $23.26 24h volatility: 1.7% Market cap: $15.75 B Vol. 24h: $787.15 M Data Streams. The development comes as the underlying Ethereum network continues to see significant on-chain activity, including large sales from ETH whales. The partnership establishes Chainlink as the official oracle infrastructure for the network. It is designed to provide developers on the Taiko platform with reliable and high-speed market data, essential for building a wide range of decentralized finance (DeFi) applications, from complex derivatives platforms to more niche projects involving unique token governance models. According to the project’s official announcement on Sept. 17, the integration enables the creation of more advanced on-chain products that require high-quality, tamper-proof data to function securely. Taiko operates as a “based rollup,” which means it leverages Ethereum validators for transaction sequencing for strong decentralization. Boosting DeFi and Institutional Interest Oracles are fundamental services in the blockchain industry. They act as secure bridges that feed external, off-chain information to on-chain smart contracts. DeFi protocols, in particular, rely on oracles for accurate, real-time price feeds. Taiko leadership stated that using Chainlink’s infrastructure aligns with its goals. The team hopes the partnership will help attract institutional crypto investment and support the development of real-world applications, a goal that aligns with Chainlink’s broader mission to bring global data on-chain. Integrating real-world economic information is part of a broader industry trend. Just last week, Chainlink partnered with the Sei…
Share
BitcoinEthereumNews2025/09/18 03:34