Specifications Compared
| Spec | L40S | RTX-A6000 |
|---|---|---|
| TDP | 350W | 300W |
| VRAM | 48 GB | 48 GB |
| CUDA Cores | 18,176 | 10,752 |
| Memory Type | GDDR6X | GDDR6 |
| Architecture | Ada Lovelace | Ampere |
| Form Factors | PCIe | PCIe |
| Interconnect | PCIe 4.0 | NVLink |
| Tensor Cores | 568 | 336 |
| FP8 Performance | 724 TFLOPS | |
| FP16 Performance | 362 TFLOPS | 38.7 TFLOPS |
| FP32 Performance | 91 TFLOPS | 38.7 TFLOPS |
| FP64 Performance | 1.4 TFLOPS | 0.6 TFLOPS |
| INT8 Performance | 724 TOPS | |
| Memory Bandwidth | 864 GB/s | 768 GB/s |
Performance Analysis
Superior compute performance defines the L40S: its 362 TFLOPS FP16 capability vastly exceeds the RTX A6000's 38.7 TFLOPS, enabling faster neural network training where half-precision computations dominate. The L40S FP32 performance at 91 TFLOPS also doubles the A6000's 38.7 TFLOPS, benefiting scientific simulations and rendering tasks requiring single-precision arithmetic. This generational leap from Ampere to Ada Lovelace translates to reduced training times for large models.
For inference, the L40S introduces 724 TFLOPS FP8 support, absent on the A6000, allowing quantized models to process more tokens per second. Higher memory bandwidth of 864 GB/s on the L40S versus 768 GB/s supports larger batch sizes, minimizing data loading bottlenecks in deep learning pipelines. Although the L40S draws 350W TDP compared to 300W on the A6000, its efficiency per watt improves for high-throughput workloads.
Interconnect options differ: PCIe 4.0 on the L40S versus NVLink on the A6000, impacting multi-GPU scaling in specific setups.
Live Cloud Pricing
Real-time prices from 25+ providers. Updated every 60 seconds.
L40S
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() TensorDock | NVIDIA L40S 48GB VRAM | 48GB | 0 vCPU 0GB RAM | Wolverhampton | $0.55/GPU/hr | Available | ||
![]() RunPod | NVIDIA L40S 48GB VRAM | 48GB | 16 vCPU 94GB RAM | 🌍global | $0.86/GPU/hr | |||
![]() Massed Compute | NVIDIA L40S 48GB VRAM | 48GB | 12 vCPU 72GB RAM 625GB Storage | Iowa | $0.88/GPU/hr | Available | ||
![]() Massed Compute | 2×NVIDIA L40S 48GB VRAM | 48GB | 24 vCPU 144GB RAM 1250GB Storage | Iowa | $0.88/GPU/hr $1.76/hr total (2×) | Available | ||
![]() Massed Compute | NVIDIA L40S 48GB VRAM | 48GB | 12 vCPU 72GB RAM 625GB Storage | Iowa | $0.88/GPU/hr | Available |
RTX A6000
| Provider | GPU Model | VRAM | Host Specs | Region | Price | Status | Action | |
|---|---|---|---|---|---|---|---|---|
![]() TensorDock | NVIDIA RTX A6000 48GB VRAM | 48GB | 0 vCPU 0GB RAM | Chubbuck, Idaho | $0.40/GPU/hr | Available | ||
![]() RunPod | NVIDIA RTX A6000 48GB VRAM | 48GB | 9 vCPU 50GB RAM | 🌍global | $0.49/GPU/hr | |||
![]() Hyperstack | NVIDIA RTX A6000 48GB VRAM | 48GB | 28 vCPU 58GB RAM 100GB Storage | Canada | $0.50/GPU/hr | Available | ||
![]() Hyperstack | 2×NVIDIA RTX A6000 48GB VRAM | 48GB | 60 vCPU 116GB RAM 300GB Storage | Canada | $0.50/GPU/hr $1.00/hr total (2×) | Available | ||
![]() Massed Compute | NVIDIA RTX A6000 48GB VRAM | 48GB | 6 vCPU 32GB RAM 256GB Storage | Iowa | $0.55/GPU/hr | Available |
When to Choose the L40S
Select the L40S for modern AI workloads demanding peak performance. Its 362 TFLOPS FP16 and 724 TFLOPS FP8 excel in LLM training and inference, where the RTX A6000's 38.7 TFLOPS falls short. The 864 GB/s bandwidth handles large batches efficiently, ideal for datacenter-scale deployments despite the 350W TDP.
When to Choose the RTX A6000
Opt for the RTX A6000 in budget-conscious scenarios or legacy applications optimized for Ampere. Starting at $0.25 per hour with 62 live offers, it provides value at 48 GB VRAM and 768 GB/s bandwidth. NVLink support aids multi-GPU setups where Ada compatibility is unnecessary, and 300W TDP suits power-limited environments.
Use Cases
The L40S's 362 TFLOPS FP16 and 91 TFLOPS FP32 enable significantly faster training iterations than the A6000's 38.7 TFLOPS in both precisions.
724 TFLOPS FP8 on the L40S accelerates quantized inference, while 864 GB/s bandwidth supports larger batches compared to the A6000's limitations.
Higher FP16 performance at 362 TFLOPS on the L40S speeds up fine-tuning of large models over the A6000's 38.7 TFLOPS.
The L40S's Ada architecture and 864 GB/s bandwidth generate images faster than the A6000, leveraging 48 GB VRAM effectively.
91 TFLOPS FP32 on the L40S outperforms the A6000's 38.7 TFLOPS for simulations, with higher bandwidth aiding data-heavy computations.
Frequently Asked Questions
Which GPU has more VRAM, L40S or RTX A6000?▾
Both the L40S and RTX A6000 feature 48 GB of VRAM. The L40S uses GDDR6X, while the A6000 employs GDDR6.
How does L40S FP16 performance compare to RTX A6000?▾
The L40S delivers 362 TFLOPS FP16, over 9 times the RTX A6000's 38.7 TFLOPS. This gap accelerates AI training significantly.
What is the memory bandwidth difference?▾
L40S offers 864 GB/s bandwidth versus 768 GB/s on the RTX A6000. Higher bandwidth on L40S supports larger batch sizes.
Which is cheaper in the cloud?▾
RTX A6000 starts at $0.25 per hour with average $1.03 across 62 offers, compared to L40S at $0.40 per hour average $1.10 across 18 offers.
Does L40S support FP8?▾
Yes, the L40S provides 724 TFLOPS FP8 for efficient inference. The RTX A6000 lacks this capability.
What are the TDPs?▾
L40S has a 350W TDP, while RTX A6000 is 300W. Both are PCIe form factors suitable for datacenters.
Which is cheaper to rent, the L40S or the RTX A6000?▾
Cloud rental prices for both the L40S and RTX A6000 vary by provider, configuration, and availability. This page shows live pricing from 25+ providers updated every 60 seconds. Scroll to the Live Cloud Pricing section to compare current rates.
How much VRAM does the L40S have compared to the RTX A6000?▾
The L40S has 48 GB of GDDR6X memory. The RTX A6000 has 48 GB of GDDR6 memory.
Can I find L40S and RTX A6000 GPUs available to rent right now?▾
Yes. This page shows real-time availability across 25+ cloud GPU providers. The Live Cloud Pricing section displays only in-stock offers with current pricing.
What is the main difference between the L40S and the RTX A6000?▾
The L40S uses the Ada Lovelace architecture (2023) while the RTX A6000 uses Ampere (2020). The L40S delivers 9.4x the FP16 throughput and 1.1x the memory bandwidth of the RTX A6000.



