The post NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost appeared on BitcoinEthereumNews.com. Caroline Bishop Oct 16, 2025 01:14 NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models. NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog. Enhanced Performance Through Software Updates Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly. Advanced AI Techniques and Support The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model. Continuous Optimization and Benchmarks Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B. Day 0 Support and Future Prospects Developers can take advantage of day 0 support for new models on Jetson… The post NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost appeared on BitcoinEthereumNews.com. Caroline Bishop Oct 16, 2025 01:14 NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models. NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog. Enhanced Performance Through Software Updates Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly. Advanced AI Techniques and Support The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model. Continuous Optimization and Benchmarks Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B. Day 0 Support and Future Prospects Developers can take advantage of day 0 support for new models on Jetson…

NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com


Caroline Bishop
Oct 16, 2025 01:14

NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models.





NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog.

Enhanced Performance Through Software Updates

Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly.

Advanced AI Techniques and Support

The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model.

Continuous Optimization and Benchmarks

Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B.

Day 0 Support and Future Prospects

Developers can take advantage of day 0 support for new models on Jetson Thor, exemplified by the early support for gpt-oss on platforms like llamacpp/ollama. This ensures that developers can run the latest generative AI models at the edge without delay. NVIDIA also provides week zero support for numerous NVIDIA Nemotron models, further enhancing the platform’s versatility.

Optimizing AI Performance

To fully exploit Jetson Thor’s potential, NVIDIA recommends employing techniques such as quantization and speculative decoding. Quantization, which reduces the numerical precision of a model’s data, allows for a smaller memory footprint and faster memory access, crucial for edge applications. Speculative decoding enhances performance by using a draft-verification approach, significantly reducing latency.

Combining these techniques with NVIDIA’s vLLM and EAGLE-3 support, developers can achieve substantial performance improvements for large language models on the Jetson Thor platform. This makes it a compelling choice for those seeking to deploy advanced AI applications at the edge.

Image source: Shutterstock


Source: https://blockchain.news/news/nvidia-jetson-agx-thor-enhances-edge-ai-models-7x-performance-boost

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

WORLD3 and PlaysOut Unite to Advance Web3 Mini-Game Ecosystem

WORLD3 and PlaysOut Unite to Advance Web3 Mini-Game Ecosystem

WORLD3, a project known for combining Web3 technology with autonomous agents and artificial intelligence, has entered into a strategic collaboration with PlaysOut
Share
CoinTrust2026/03/10 15:08
TrendX Taps Trusta AI to Develop Safer and Smarter Web3 Network

TrendX Taps Trusta AI to Develop Safer and Smarter Web3 Network

The purpose of collaboration is to advance the Web3 landscape by combining the decentralized infrastructure of TrendX with AI-led capabilities of Trusta AI.
Share
Blockchainreporter2025/09/18 01:07
UK crypto holders brace for FCA’s expanded regulatory reach

UK crypto holders brace for FCA’s expanded regulatory reach

The post UK crypto holders brace for FCA’s expanded regulatory reach appeared on BitcoinEthereumNews.com. British crypto holders may soon face a very different landscape as the Financial Conduct Authority (FCA) moves to expand its regulatory reach in the industry. A new consultation paper outlines how the watchdog intends to apply its rulebook to crypto firms, shaping everything from asset safeguarding to trading platform operation. According to the financial regulator, these proposals would translate into clearer protections for retail investors and stricter oversight of crypto firms. UK FCA plans Until now, UK crypto users mostly encountered the FCA through rules on promotions and anti-money laundering checks. The consultation paper goes much further. It proposes direct oversight of stablecoin issuers, custodians, and crypto-asset trading platforms (CATPs). For investors, that means the wallets, exchanges, and coins they rely on could soon be subject to the same governance and resilience standards as traditional financial institutions. The regulator has also clarified that firms need official authorization before serving customers. This condition should, in theory, reduce the risk of sudden platform failures or unclear accountability. David Geale, the FCA’s executive director of payments and digital finance, said the proposals are designed to strike a balance between innovation and protection. He explained: “We want to develop a sustainable and competitive crypto sector – balancing innovation, market integrity and trust.” Geale noted that while the rules will not eliminate investment risks, they will create consistent standards, helping consumers understand what to expect from registered firms. Why does this matter for crypto holders? The UK regulatory framework shift would provide safer custody of assets, better disclosure of risks, and clearer recourse if something goes wrong. However, the regulator was also frank in its submission, arguing that no rulebook can eliminate the volatility or inherent risks of holding digital assets. Instead, the focus is on ensuring that when consumers choose to invest, they do…
Share
BitcoinEthereumNews2025/09/17 23:52