Rent NVIDIA H200 NVL Cloud Instances
📊 Pricing at a Glance
NVIDIA H200 NVL rental pricing ranges from $3.62/GPU/hr to $5.25/GPU/hr across 24 instances from 4 providers (updated June 2026).
Looking for a specific provider? See Massed Compute NVIDIA H200 NVL, Vast.ai NVIDIA H200 NVL, or RunPod NVIDIA H200 NVL.
Available Offers
Compare the top 5 cheapest offers from 4 providers.
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() Massed Compute | NVIDIA H200 NVL 141GB VRAM | 141GB | 16 vCPU 180GB RAM 750GB Storage | Virginia | $3.62/GPU/hr | Available | ||
![]() Massed Compute | 2×NVIDIA H200 NVL 141GB VRAM | 141GB | 30 vCPU 360GB RAM 1500GB Storage | Virginia | $3.62/GPU/hr $7.24/hr total (2×) | Available | ||
![]() Vast.ai | 2×NVIDIA H200 NVL 141GB VRAM | 141GB | 512 vCPU 567GB RAM 3177GB Storage | Czechia | $3.69/GPU/hr $7.38/hr total (2×) | Available | ||
![]() Massed Compute | 4×NVIDIA H200 NVL 141GB VRAM | 141GB | 62 vCPU 720GB RAM 3000GB Storage | Virginia | $3.62/GPU/hr $14.48/hr total (4×) | Available | ||
![]() RunPod | NVIDIA H200 NVL 141GB VRAM | 141GB | 0 vCPU 0GB RAM | 🌍global | $3.79/GPU/hr |



QuantaCloud
H-series supply is constrained.
Most providers are sold out through Q3 2026. If you need 16+ GPU reserved or cluster capacity in the next 90 days, we can quote B300 or remaining Hopper inventory within 24 hours.
Technical Specifications
Strengths & Limitations
- Exceptional memory capacity and bandwidth for large datasets.
- Optimized for large language models and generative AI.
- High FP8 performance for accelerated AI training and inference.
- NVL configuration provides increased computational throughput.
- Leverages the advanced features of the Hopper architecture.
- Higher cost compared to previous generation GPUs.
- Power consumption can be significant under heavy load.
- NVL configuration may require specific software optimizations.
- Availability may be limited due to high demand.
- May be overkill for smaller or less memory-intensive workloads.
Top Use Cases
The H200 NVL's large memory capacity and high bandwidth are ideal for training massive language models with billions of parameters.
Accelerates the training and inference of complex recommendation models that require processing large user and item datasets.
Enables the creation of high-quality images, videos, and other content using generative AI models.
Real-World Benchmark
Market Analysis
The NVIDIA H200 NVL competes in the high-end GPU market, targeting organizations with demanding AI and HPC workloads. Its price point of $0.50/hr reflects its advanced capabilities and limited availability. It is positioned above GPUs like the NVIDIA A100 ($0.12/hr - $0.78/hr) and H100 ($0.74/hr - $1.87/hr) in terms of performance and memory capacity, but below the B200 ($5.19/hr - $42.00/hr). Its primary competitors include other high-end GPUs from NVIDIA and AMD, as well as custom-designed AI accelerators.
Frequently Asked Questions
What is the main advantage of the H200 NVL over the H100?▾
The H200 NVL offers significantly increased memory capacity and bandwidth compared to the H100, making it better suited for large language models and other memory-intensive applications.
What type of workloads benefit most from the H200 NVL?▾
Workloads that require processing large datasets and performing complex computations, such as large language model training, recommendation systems, and generative AI, benefit most from the H200 NVL.
Is the H200 NVL suitable for gaming?▾
While the H200 NVL is a powerful GPU, it is primarily designed for AI and HPC workloads, not gaming. Gaming performance may be excellent, but the cost is prohibitive compared to gaming-focused GPUs.
Alternative GPUs
Journalists, bloggers, and researchers: You're welcome to cite our data in your articles with attribution. Our pricing database is updated in real-time from 4+ cloud providers.
