The post NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost appeared on BitcoinEthereumNews.com. Caroline Bishop Oct 16, 2025 01:14 NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models. NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog. Enhanced Performance Through Software Updates Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly. Advanced AI Techniques and Support The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model. Continuous Optimization and Benchmarks Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B. Day 0 Support and Future Prospects Developers can take advantage of day 0 support for new models on Jetson… The post NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost appeared on BitcoinEthereumNews.com. Caroline Bishop Oct 16, 2025 01:14 NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models. NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog. Enhanced Performance Through Software Updates Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly. Advanced AI Techniques and Support The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model. Continuous Optimization and Benchmarks Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B. Day 0 Support and Future Prospects Developers can take advantage of day 0 support for new models on Jetson…

NVIDIA Jetson AGX Thor Enhances Edge AI Models with 7x Performance Boost

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com


Caroline Bishop
Oct 16, 2025 01:14

NVIDIA’s Jetson AGX Thor achieves a 7x performance increase in generative AI, optimizing edge computing through continuous software advancements and support for cutting-edge AI models.





NVIDIA has unveiled significant advancements in its Jetson AGX Thor platform, promising a remarkable 7x increase in generative AI performance since its launch in August 2025. This enhancement underscores NVIDIA’s commitment to continuous optimization across its software ecosystem, according to NVIDIA’s blog.

Enhanced Performance Through Software Updates

Initially launched with a 5x boost over previous models, the Jetson AGX Thor has seen its capabilities further expanded through regular software updates. These updates have enabled developers to leverage substantial improvements on AI models such as Llama and DeepSeek. NVIDIA’s approach includes supporting leading models soon after their release, allowing developers to experiment with the latest AI technologies swiftly.

Advanced AI Techniques and Support

The Jetson Thor platform accommodates major quantization formats, including the new NVFP4 from NVIDIA’s Blackwell GPU architecture. This helps optimize inference, a crucial component of edge computing. New techniques like speculative decoding are now supported, significantly accelerating generative AI workloads at the edge. Speculative decoding, in particular, has shown to boost the output tokens per second by 7x, as demonstrated in benchmarks with the Llama 3.3 70B model.

Continuous Optimization and Benchmarks

Recent updates, such as the vLLM container release, have further enhanced Jetson Thor’s performance. For instance, the platform now delivers up to 3.5x greater performance on the same model and quantization compared to its initial launch performance. This is evidenced by benchmarks showing increased output tokens per second on models like Llama 3.3 70B and DeepSeek R1 70B.

Day 0 Support and Future Prospects

Developers can take advantage of day 0 support for new models on Jetson Thor, exemplified by the early support for gpt-oss on platforms like llamacpp/ollama. This ensures that developers can run the latest generative AI models at the edge without delay. NVIDIA also provides week zero support for numerous NVIDIA Nemotron models, further enhancing the platform’s versatility.

Optimizing AI Performance

To fully exploit Jetson Thor’s potential, NVIDIA recommends employing techniques such as quantization and speculative decoding. Quantization, which reduces the numerical precision of a model’s data, allows for a smaller memory footprint and faster memory access, crucial for edge applications. Speculative decoding enhances performance by using a draft-verification approach, significantly reducing latency.

Combining these techniques with NVIDIA’s vLLM and EAGLE-3 support, developers can achieve substantial performance improvements for large language models on the Jetson Thor platform. This makes it a compelling choice for those seeking to deploy advanced AI applications at the edge.

Image source: Shutterstock


Source: https://blockchain.news/news/nvidia-jetson-agx-thor-enhances-edge-ai-models-7x-performance-boost

Market Opportunity
Edge Logo
Edge Price(EDGE)
$0,16204
$0,16204$0,16204
+6,38%
USD
Edge (EDGE) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

CEO Sandeep Nailwal Shared Highlights About RWA on Polygon

CEO Sandeep Nailwal Shared Highlights About RWA on Polygon

The post CEO Sandeep Nailwal Shared Highlights About RWA on Polygon appeared on BitcoinEthereumNews.com. Polygon CEO Sandeep Nailwal highlighted Polygon’s lead in global bonds, Spiko US T-Bill, and Spiko Euro T-Bill. Polygon published an X post to share that its roadmap to GigaGas was still scaling. Sentiments around POL price were last seen to be bearish. Polygon CEO Sandeep Nailwal shared key pointers from the Dune and RWA.xyz report. These pertain to highlights about RWA on Polygon. Simultaneously, Polygon underlined its roadmap towards GigaGas. Sentiments around POL price were last seen fumbling under bearish emotions. Polygon CEO Sandeep Nailwal on Polygon RWA CEO Sandeep Nailwal highlighted three key points from the Dune and RWA.xyz report. The Chief Executive of Polygon maintained that Polygon PoS was hosting RWA TVL worth $1.13 billion across 269 assets plus 2,900 holders. Nailwal confirmed from the report that RWA was happening on Polygon. The Dune and https://t.co/W6WSFlHoQF report on RWA is out and it shows that RWA is happening on Polygon. Here are a few highlights: – Leading in Global Bonds: Polygon holds 62% share of tokenized global bonds (driven by Spiko’s euro MMF and Cashlink euro issues) – Spiko U.S.… — Sandeep | CEO, Polygon Foundation (※,※) (@sandeepnailwal) September 17, 2025 The X post published by Polygon CEO Sandeep Nailwal underlined that the ecosystem was leading in global bonds by holding a 62% share of tokenized global bonds. He further highlighted that Polygon was leading with Spiko US T-Bill at approximately 29% share of TVL along with Ethereum, adding that the ecosystem had more than 50% share in the number of holders. Finally, Sandeep highlighted from the report that there was a strong adoption for Spiko Euro T-Bill with 38% share of TVL. He added that 68% of returns were on Polygon across all the chains. Polygon Roadmap to GigaGas In a different update from Polygon, the community…
Share
BitcoinEthereumNews2025/09/18 01:10
👨🏿‍🚀TechCabal Daily – Folded by a paper cut

👨🏿‍🚀TechCabal Daily – Folded by a paper cut

In today's edition: Mpact’s paper mill is shutting down || An e-commerce play for SA’s Post Office || Kenya’s traffic cop
Share
Techcabal2026/03/10 14:05
MTN Plans Starlink Launch in Zambia

MTN Plans Starlink Launch in Zambia

MTN’s Starlink launch plan in Zambia signals a new phase for satellite internet expansion, aiming to accelerate rural connectivity and support the country’s digital
Share
Furtherafrica2026/03/10 14:00