Specifications Compared
| Spec | L40 | V100 |
|---|---|---|
| TDP | 300W | 300W |
| VRAM | 48 GB | 16-32 GB |
| CUDA Cores | 18,176 | 5,120 |
| Memory Type | GDDR6 | HBM2 |
| Architecture | Ada Lovelace | Volta |
| Form Factors | PCIe | SXM2, PCIe |
| Interconnect | NVLink, PCIe 3.0 | |
| Tensor Cores | 568 | 640 |
| FP16 Performance | 90.5 TFLOPS | 125 TFLOPS |
| FP32 Performance | 90.5 TFLOPS | 15.7 TFLOPS |
| INT8 Performance | 724 TOPS | |
| Memory Bandwidth | 864 GB/s | 900 GB/s |
Performance Analysis
FP32 performance marks a clear upgrade in the L40: 90.5 TFLOPS versus V100's 15.7 TFLOPS, accelerating training workflows that depend on single-precision arithmetic. In contrast, V100's 125 TFLOPS FP16 exceeds L40's 90.5 TFLOPS, suiting inference tasks optimized for half-precision tensor operations. This delta means L40 excels in balanced training and inference pipelines, while V100 favors FP16-dominant inference. The L40's 48 GB VRAM surpasses V100's 32 GB, enabling larger batch sizes and handling bigger models without swapping, which reduces latency in memory-bound scenarios. Memory bandwidth remains close at 864 GB/s for L40 and 900 GB/s for V100, but L40's GDDR6 pairs with higher capacity for sustained throughput in extended runs. Overall, these specs translate to L40's edge in diverse real-world AI applications requiring versatility.
Live Cloud Pricing
Real-time prices from 25+ providers. Updated every 60 seconds.
L40
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() TensorDock | NVIDIA L40S 48GB VRAM | 48GB | 0 vCPU 0GB RAM | Wolverhampton | $0.55/GPU/hr | Available | ||
![]() RunPod | NVIDIA L40 48GB VRAM | 48GB | 8 vCPU 94GB RAM | 🌍global | $0.82/GPU/hr | |||
![]() RunPod | NVIDIA L40S 48GB VRAM | 48GB | 16 vCPU 94GB RAM | 🌍global | $0.86/GPU/hr | |||
![]() Massed Compute | NVIDIA L40 48GB VRAM | 48GB | 14 vCPU 72GB RAM 625GB Storage | Iowa | $0.86/GPU/hr | Available | ||
![]() Massed Compute | 2×NVIDIA L40 48GB VRAM | 48GB | 26 vCPU 144GB RAM 1250GB Storage | Iowa | $0.86/GPU/hr $1.72/hr total (2×) | Available |
Tesla V100 32GB
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() TensorDock | NVIDIA Tesla V100 16GB 16GB VRAM | 16GB | 0 vCPU 0GB RAM | Texas | $0.19/GPU/hr | Available | ||
![]() TensorDock | NVIDIA Tesla V100 16GB 16GB VRAM | 16GB | 0 vCPU 0GB RAM | New York City | $0.19/GPU/hr | Available | ||
![]() TensorDock | NVIDIA Tesla V100 32GB 32GB VRAM | 32GB | 0 vCPU 0GB RAM | Texas | $0.29/GPU/hr | Available | ||
![]() TensorDock | NVIDIA Tesla V100 32GB 32GB VRAM | 32GB | 0 vCPU 0GB RAM | New York City | $0.29/GPU/hr | Available | ||
![]() Lambda Labs | 8×NVIDIA Tesla V100 16GB 16GB VRAM | 16GB | 88 vCPU 448GB RAM 6041GB Storage | Texas | $0.79/GPU/hr $6.32/hr total (8×) | Available |
When to Choose the L40
Opt for the NVIDIA L40 in scenarios demanding high VRAM and balanced compute: its 48 GB GDDR6 supports large-scale LLM training or fine-tuning with batch sizes infeasible on V100's 32 GB HBM2. The 90.5 TFLOPS FP32 performance halves training times compared to V100's 15.7 TFLOPS for FP32-heavy tasks. Newer Ada Lovelace architecture ensures compatibility with latest software stacks and optimizations.
When to Choose the Tesla V100 32GB
Select the NVIDIA Tesla V100 32GB for cost-sensitive FP16 inference workloads: starting at $0.29 per hour, it leverages 125 TFLOPS FP16 to outperform L40's 90.5 TFLOPS in half-precision operations. Legacy applications tuned for Volta architecture or NVLink interconnects benefit from its maturity across 46 cloud offers. Lower entry pricing suits experimentation or short bursts where 32 GB HBM2 suffices.
Use Cases
L40's 48 GB VRAM and 90.5 TFLOPS FP32 support larger models and faster training iterations than V100's 32 GB and 15.7 TFLOPS FP32.
Higher 48 GB VRAM on L40 accommodates bigger batches for production inference, offsetting V100's 125 TFLOPS FP16 advantage.
Balanced 90.5 TFLOPS FP16/FP32 and 48 GB VRAM on L40 optimize fine-tuning of large models, surpassing V100's FP32 bottleneck.
L40's 48 GB VRAM handles high-resolution image generation without memory constraints, paired with Ada Lovelace efficiencies.
V100's 125 TFLOPS FP16 and NVLink interconnect suit HPC simulations optimized for Volta, at lower starting price of $0.29 per hour.
Frequently Asked Questions
What is the VRAM capacity of L40 versus V100 32GB?▾
The L40 provides 48 GB GDDR6 VRAM, exceeding the V100 32GB's 32 GB HBM2. This difference allows L40 to manage larger datasets or models. Bandwidth stands at 864 GB/s for L40 and 900 GB/s for V100.
Which GPU has better FP32 performance?▾
L40 delivers 90.5 TFLOPS FP32, far surpassing V100's 15.7 TFLOPS. This gap benefits training tasks reliant on single-precision. FP16 favors V100 at 125 TFLOPS over L40's 90.5 TFLOPS.
How do cloud prices compare for these GPUs?▾
L40 pricing starts at $0.67 per hour with an average of $0.89 across 14 offers. V100 32GB begins at $0.29 per hour averaging $1.01 across 46 offers. Entry-level costs favor V100 for budget runs.
Do both GPUs have the same TDP?▾
Yes, both L40 and V100 operate at 300W TDP. This equality simplifies power planning in cloud instances. Form factors differ: L40 uses PCIe, V100 supports SXM2 and PCIe.
What architectures power these GPUs?▾
L40 employs Ada Lovelace from 2023, while V100 uses Volta from 2017. Ada offers modern features for AI. V100 includes NVLink for multi-GPU scaling.
Is L40 better for large model training?▾
Yes, L40's 48 GB VRAM and 90.5 TFLOPS FP32 outperform V100's 32 GB and 15.7 TFLOPS FP32. This supports bigger batches in LLM training. V100 suits FP16 inference instead.
Which is cheaper to rent, the L40 or the V100?▾
Cloud rental prices for both the L40 and V100 vary by provider, configuration, and availability. This page shows live pricing from 25+ providers updated every 60 seconds. Scroll to the Live Cloud Pricing section to compare current rates.
How much VRAM does the L40 have compared to the V100?▾
The L40 has 48 GB of GDDR6 memory. The V100 has 16 to 32 GB of HBM2 memory.
Can I find L40 and V100 GPUs available to rent right now?▾
Yes. This page shows real-time availability across 25+ cloud GPU providers. The Live Cloud Pricing section displays only in-stock offers with current pricing.
What is the main difference between the L40 and the V100?▾
The L40 uses the Ada Lovelace architecture (2023) while the V100 uses Volta (2017). The V100 delivers 1.4x the FP16 throughput and 1.0x the memory bandwidth of the L40.



