NVIDIA’s Jetson platform has long shaped what’s possible for AI at the edge, powering physical AI, drones, and autonomous systems across industries. With the introduction of Jetson AGX Thor, built on the new Blackwell GPU architecture, developers now have another option alongside the established Jetson AGX Orin, based on Ampere. On paper, Thor delivers a major step up, with up to 7.5 times the AI compute and about 3.5 times the energy efficiency of Orin.
While the performance leap is undeniable, the real measure lies in how it impacts engineering decisions. For some teams, Orin’s track record and field-proven stability may make it the dependable choice. For others, Thor’s leap in compute and efficiency could unlock entirely new levels of capability. The question is not just how powerful Thor is, but how the strengths of each module align with different project needs.
This comparison sets the record straight. We’ll start with a detailed spec table, then explain how each metric plays out in practice. We’ll also look at migration considerations and the software ecosystem. By the end, you’ll see where each platform stands and where the real trade-offs lie.
To ground the discussion, the table below compares key specifications of NVIDIA Jetson AGX Thor versus NVIDIA Jetson AGX Orin (Ampere generation). This focuses on the flagship modules of each generation (Thor’s Jetson T5000 module vs. Orin’s 64GB AGX module) for an apples-to-apples comparison.
| Component | Jetson AGX Orin (Ampere Gen) | Jetson AGX Thor (Blackwell Gen) |
| CPU | 12-core Arm Cortex-A78AE v8.2 @ ~2.2 GHz (64-bit, 3 MB L2 + 6 MB L3) | 14-core Arm Neoverse-V3AE @ up to 2.6 GHz (64-bit, 1 MB L2 per core, 16 MB L3) |
| GPU | NVIDIA Ampere GPU, 2048 CUDA cores + 64 Tensor Cores; no MIG (single-instance GPU) | NVIDIA Blackwell GPU, 2560 CUDA cores + 96 Tensor Cores (5th-gen); supports Multi-Instance GPU (MIG) |
| AI Features | INT8: up to ~275 TOPS (marketing figure). GPU INT8: ~170 sparse TOPS; DLAs add up to ~105 sparse TOPS | FP4: 2070 (sparse) / 1035 (dense) TFLOPs. FP8: 1035 (sparse) / 517 (dense) TFLOPs. FP32 (CUDA): 8.064 / 7.096 TFLOPs. (NVIDIA note: FP8 TFLOPs ≈ INT8 TOPS) |
|
Memory (Type & Bandwidth) |
Up to 64 GB LPDDR5 @ 256-bit, ~205 GB/s bandwidth (64 GB module) | 128 GB LPDDR5X @ 256-bit, ~273 GB/s bandwidth |
| Power (TDP) | Configurable 15 W to 60 W (max). Lower power envelope suits battery-powered or thermally constrained devices. | Typical operating range 75–120 W for full performance. Requires robust cooling/power |
Jetson Thor can handle more CPU-intensive tasks or background processes without saturating, which is beneficial for complex robots that need to run real-time control loops, perception, and higher-level reasoning concurrently. The Neoverse V3 cores bring PC/server-grade processing to the edge, which helps for things like physics simulation, path planning, or running traditional algorithms alongside neural networks. By contrast, Jetson Orin’s CPU – while no slouch – may become a bottleneck if an application has heavy non-GPU workloads (for example, CPU-bound sensor processing or extended ROS 2 computations). Orin’s 12 Arm Cortex-A78AE cores were a step up from Xavier’s 8 Carmel cores, but Thor’s CPU is another substantial jump: NVIDIA even describes Thor’s CPU as “robust”, emphasizing its role in low-latency, real-time processing alongside the GPU.
For developers, one consideration is software thread distribution. On Orin, offloading work to the GPU and using accelerators was often essential to meet real-time constraints, because the CPU, though multi-core, could be taxed by complex workloads. With Thor, the beefier CPU may handle more on its own – for example, high-frequency control tasks or large dataset pre-processing – potentially simplifying software architectures. It also implies that moving from Orin to Thor could involve re-balancing CPU/GPU usage to take advantage of the new CPU headroom.
For AI inference and vision processing, Jetson Thor’s GPU is significantly more powerful. It delivers up to 1035 FP8 TFLOPs (sparse) or 2070 FP4 TFLOPs (sparse), depending on model precision, while Orin 64GB tops out at 170 INT8 Sparse TOPS on the GPU (275 TOPS marketing figure includes platform accelerators). That headroom lets you run bigger models or more concurrent models at real-time rates.
The addition of MIG on Thor is very relevant for robotics systems or physical AI that have mixed-criticality workloads. For instance, one MIG partition could be reserved for real-time perception (e.g. obstacle detection camera DNNs that must run at a steady 30 FPS), while another partition handles a less critical task like an NLP model or UI rendering. On Orin, all tasks share the GPU and one heavy model could momentarily starve others of
GPU cycles, whereas Thor can hard-partition GPU slices to guarantee performance isolation. This is a big plus for designing safety-conscious systems.
For developers, existing INT8 and FP16 models run on Thor without modification, and FP8 can be enabled with NVIDIA’s Transformer Engine APIs for additional speed. This backward compatibility protects current work while offering clear paths to optimization.
Many AI models today can run at reduced precision without significant loss in accuracy, especially if quantization-aware training or fine-tuning is used. NVIDIA’s Transformer Engine in Thor dynamically chooses between FP4 and FP8 for different parts of transformer models to maximize speed while preserving accuracy. This allows models that might need FP16 on Orin to run in FP8 on Thor, effectively doubling throughput. FP4 support also hints at research in ultra-low precision, which could be useful for certain generative AI tasks or for ensemble models where memory is a constraint.
For a robotics engineer, the raw numbers translate to capability. With Orin’s ~275 TOPS, a robot might budget 50 TOPS for object detection, 50 for segmentation, 20 for pose estimation, etc., and still could run several tasks concurrently at real-time. With Thor’s up to 1035 FP8 TFLOPs (sparse) ≈ ~1035 ‘equivalent’ INT8 TOPS per NVIDIA’s note, the same robot could add transformer-based vision-language models or reinforcement learning policies without leaving the edge. This enables more autonomous functionality without cloud offloading. For instance, Thor can run large vision-language-action models (like NVIDIA’s new Isaac GR00T family) in real time at the edge, whereas Orin might struggle or not fit those models in memory/performance budget.
Jetson Thor’s memory upgrade is a crucial enabler for advanced AI at the edge. It complements the GPU’s capabilities: you can actually load those giant models and data that the GPU can crunch. Jetson Orin’s memory, while large by embedded standards, might limit ultra-large-model applications (like > billions of params or high-res sensor fusion). For many current robotics tasks, 32 GB suffices, and Orin’s memory speed is not a bottleneck. But Thor is built for the next generation where robots could be running simultaneous multi-modal AI workloads and maintaining rich world models in memory. Essentially, Thor’s memory subsystem turns it into a “supercomputer on a module” – bridging a gap where previously one might need a connected server to handle such scale.
For end users, process node is not something visible, but it is experienced as performance or power consumption. Jetson Thor can fit server-class capability in nearly the same footprint as Orin at about 100 × 87 mm. Thor’s GPU runs at 1.57 GHz while maintaining efficiency per core.
Engineers should note that newer nodes can bring different operating requirements. Thor’s dense circuitry may be more sensitive to voltage drop and require cleaner power delivery on the carrier board, though NVIDIA has accounted for this in the module design. Newer nodes can also mean higher wafer costs and tighter supply early in production. Orin’s platform is more mature in market availability, which can translate to broader near-term module and carrier options.
From a system cost perspective, more power requires stricter regulators, larger batteries, and bigger coolers, which add weight and volume. Orin can scale down to around 15 W, making it suitable for lightweight robots or wearable devices that need low idle power. Thor’s minimum of about 40 W sets a higher floor, even with always-on cores handling light tasks, so it is less practical in those scenarios.
Thermal and power management remain important for both, but Thor raises the bar. Orin could throttle under heavy load in warm conditions, and Thor will require systems built to dissipate over 100 W reliably. This makes Orin the better fit for power- or heat-constrained applications, while Thor is designed for robots that demand maximum performance and can accommodate a larger power envelope.
Migrating from Jetson Orin to Jetson Thor is straightforward. Both share the same NVIDIA AI software stack, with Thor introducing JetPack 7 on Ubuntu 24.04 LTS (with Linux kernel 6.8) and CUDA 13. API compatibility ensures applications built with CUDA, TensorRT, OpenCV, and ROS on Orin run on Thor with little change, often faster out of the box. Developers can also re-optimize for FP8 to gain additional speed while continuing to use frameworks like PyTorch and TensorFlow. Isaac SDK and Isaac ROS migrate smoothly, and Thor’s larger compute budget with MIG support allows heavier models, higher resolutions, and split workloads to run reliably.
Hardware and ecosystem form the main differences. Thor is not pin-compatible with Orin, so new carrier boards are required, though vendors are already preparing updated solutions with BSP support. Orin benefits from years of maturity and community knowledge, while Thor enters as the newer platform with fewer references at launch.
Ultimately, the choice between Jetson Thor and Jetson Orin should be driven by your project’s requirements and constraints. Below is a neutral guide to help decide which module is appropriate given various considerations:
In essence, choose Jetson Thor for the most demanding, cutting-edge applications where its strengths (extreme performance and advanced features like MIG and lockstep cores) can be fully utilized. Choose Jetson Orin for applications that fall within the current state-of-the-art in edge AI and where efficiency, lower power, and a well-trodden path are priorities. In many cases, Orin can handle what you need today, and Thor is what you’d consider for what you might need tomorrow – it’s about aligning the tool to the job.
ACROSSER is a proud product partner with EverFocus in developing NVIDIA-powered industrial PC lineups. These systems integrate Jetson technology to bring edge AI computing to industries such as physical AI, surveillance, transportation, and automation.
Curious about how NVIDIA Jetson Thor performs in real-world applications? Apply for a 90-day free trial and experience the power of edge AI firsthand.
Monthly insights and updates. No spam. Just what matters.
We partner with tech leaders to turn ideas into reality, from concept to deployment.