Specifications Compared
| Spec | H100 | L40 |
|---|---|---|
| TDP | 700W | 300W |
| VRAM | 80-94 GB | 48 GB |
| CUDA Cores | 16,896 | 18,176 |
| Memory Type | HBM3 | GDDR6 |
| Architecture | Hopper | Ada Lovelace |
| Form Factors | SXM5, PCIe, NVL | PCIe |
| Interconnect | NVLink, PCIe 5.0, InfiniBand | |
| Tensor Cores | 528 | 568 |
| FP8 Performance | 3,958 TFLOPS | |
| FP16 Performance | 1,979 TFLOPS | 90.5 TFLOPS |
| FP32 Performance | 67 TFLOPS | 90.5 TFLOPS |
| FP64 Performance | 34 TFLOPS | |
| INT8 Performance | 3,958 TOPS | 724 TOPS |
| Memory Bandwidth | 3,350 GB/s | 864 GB/s |
Performance Analysis
The H100 dominates in tensor-heavy AI tasks: its FP16 performance hits 1979 TFLOPS, over 21 times the L40's 90.5 TFLOPS, accelerating model training where half-precision dominates. For FP32 workloads, the L40 edges ahead at 90.5 TFLOPS versus H100's 67 TFLOPS, benefiting scientific simulations or graphics rendering that rely on single-precision floats. The H100's FP8 capability at 3958 TFLOPS further enhances low-precision inference, enabling quantized large language models at scale.
Memory bandwidth reveals stark differences: H100's 3350 GB/s supports larger batch sizes and faster data movement compared to L40's 864 GB/s, reducing bottlenecks in training loops with high-resolution datasets. VRAM capacity amplifies this: 80 to 94 GB on H100 accommodates models exceeding 48 GB on L40, preventing out-of-memory errors in fine-tuning or inference of massive transformers.
Power dynamics matter for deployments: H100's 700W TDP demands robust cooling versus L40's efficient 300W, influencing cloud instance density and operational costs.
Live Cloud Pricing
Real-time prices from 25+ providers. Updated every 60 seconds.
H100
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() Hyperstack | 4×NVIDIA H100 PCIe 80GB VRAM | 80GB | 124 vCPU 720GB RAM 3300GB Storage | Canada | $1.90/GPU/hr $7.60/hr total (4×) | Available | ||
![]() Hyperstack | 2×NVIDIA H100 PCIe 80GB VRAM | 80GB | 60 vCPU 360GB RAM 1600GB Storage | Canada | $1.90/GPU/hr $3.80/hr total (2×) | Available | ||
![]() Hyperstack | 8×NVIDIA H100 PCIe 80GB VRAM | 80GB | 252 vCPU 1440GB RAM 6600GB Storage | Canada | $1.90/GPU/hr $15.20/hr total (8×) | Available | ||
![]() Hyperstack | NVIDIA H100 PCIe 80GB VRAM | 80GB | 28 vCPU 180GB RAM 850GB Storage | Canada | $1.90/GPU/hr | Available | ||
![]() Hyperstack | 8×NVIDIA H100 PCIe 80GB VRAM | 80GB | 252 vCPU 1440GB RAM 6600GB Storage | Canada | $1.95/GPU/hr $15.60/hr total (8×) | Available |
L40
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() TensorDock | NVIDIA L40S 48GB VRAM | 48GB | 0 vCPU 0GB RAM | Wolverhampton | $0.55/GPU/hr | Available | ||
![]() RunPod | NVIDIA L40 48GB VRAM | 48GB | 8 vCPU 94GB RAM | 🌍global | $0.82/GPU/hr | |||
![]() RunPod | NVIDIA L40S 48GB VRAM | 48GB | 16 vCPU 94GB RAM | 🌍global | $0.86/GPU/hr | |||
![]() Massed Compute | NVIDIA L40 48GB VRAM | 48GB | 14 vCPU 72GB RAM 625GB Storage | Iowa | $0.86/GPU/hr | Available | ||
![]() Massed Compute | 2×NVIDIA L40 48GB VRAM | 48GB | 26 vCPU 144GB RAM 1250GB Storage | Iowa | $0.86/GPU/hr $1.72/hr total (2×) | Available |
When to Choose the H100
Opt for the H100 in large-scale LLM training or fine-tuning: its 80 to 94 GB HBM3 VRAM handles models beyond 48 GB, while 1979 TFLOPS FP16 and 3350 GB/s bandwidth enable rapid iterations on datasets with billion-parameter architectures. Scenarios demanding NVLink interconnects or PCIe 5.0 for multi-GPU clusters favor H100, despite higher average pricing of $3.19 per hour.
When to Choose the L40
Select the L40 for cost-sensitive inference or graphics workloads: at $0.67 per hour minimum and 300W TDP, it delivers 90.5 TFLOPS FP16 within 48 GB GDDR6 VRAM for Stable Diffusion or lighter LLMs. PCIe form factor simplifies single-node setups without H100's power overhead.
Use Cases
H100's 80 to 94 GB HBM3 VRAM and 1979 TFLOPS FP16 support massive models and large batches unattainable on L40's 48 GB GDDR6.
H100's FP8 at 3958 TFLOPS and 3350 GB/s bandwidth excel in high-throughput quantized inference; L40 suffices for smaller models.
H100 accommodates parameter-heavy fine-tuning with superior 1979 TFLOPS FP16 versus L40's 90.5 TFLOPS.
L40's 90.5 TFLOPS FP32 and 48 GB VRAM handle image generation efficiently at lower $0.67 per hour cost.
L40 matches or exceeds H100 in FP32 at 90.5 TFLOPS with 300W TDP, ideal for simulations without AI-specific overhead.
Frequently Asked Questions
What is the VRAM difference between H100 and L40?▾
H100 offers 80 to 94 GB HBM3 VRAM, far exceeding L40's 48 GB GDDR6. This enables H100 for larger models in training. L40 fits mid-sized workloads efficiently.
How do cloud prices compare for H100 vs L40?▾
H100 starts at $0.80 per hour, averaging $3.19 across 55 offers. L40 begins at $0.67 per hour, averaging $0.87 across 12 offers. L40 provides better value for lighter tasks.
Which has higher FP16 performance?▾
H100 achieves 1979 TFLOPS FP16, over 21 times L40's 90.5 TFLOPS. This gap favors H100 in AI training. L40 competes better in FP32 at 90.5 TFLOPS.
What are the power requirements?▾
H100 consumes 700W TDP, requiring advanced cooling. L40 uses 300W TDP for higher density. Choose based on infrastructure limits.
Does H100 support NVLink?▾
H100 includes NVLink, PCIe 5.0, and InfiniBand interconnects for multi-GPU scaling. L40 relies on PCIe alone. H100 suits clustered training.
Which is newer, Hopper or Ada Lovelace?▾
L40 uses Ada Lovelace from 2023; H100 employs Hopper from 2022. Architecture differences prioritize AI in Hopper versus versatility in Ada.
Which is cheaper to rent, the H100 or the L40?▾
Cloud rental prices for both the H100 and L40 vary by provider, configuration, and availability. This page shows live pricing from 25+ providers updated every 60 seconds. Scroll to the Live Cloud Pricing section to compare current rates.
How much VRAM does the H100 have compared to the L40?▾
The H100 has 80 to 94 GB of HBM3 memory. The L40 has 48 GB of GDDR6 memory.
Can I find H100 and L40 GPUs available to rent right now?▾
Yes. This page shows real-time availability across 25+ cloud GPU providers. The Live Cloud Pricing section displays only in-stock offers with current pricing.
What is the main difference between the H100 and the L40?▾
The H100 uses the Hopper architecture (2022) while the L40 uses Ada Lovelace (2023). The H100 delivers 21.9x the FP16 throughput and 3.9x the memory bandwidth of the L40.



