NVIDIAUnknown Architecture

Rent NVIDIA L40S Cloud Instances

The NVIDIA L40S GPU is a data center GPU designed for demanding visualization, compute, and AI workloads. It offers a balance of performance and features, making it suitable for a wide range of professional applications.

📊 Pricing at a Glance

Cheapest Provider
TensorDock
$0.55/GPU/hr
Most Expensive
CoreWeave
$2.25/GPU/hr
Median Price
$1.20/GPU/hr
Total Instances
88
Providers
16
Last Updated
June 5, 2026

NVIDIA L40S rental pricing ranges from $0.55/GPU/hr to $2.25/GPU/hr across 88 instances from 16 providers (updated June 2026).

Looking for a specific provider? See TensorDock NVIDIA L40S, Massed Compute NVIDIA L40S, or VERDA NVIDIA L40S.

Available Offers

Compare the top 5 cheapest offers from 16 providers.

18 instances available
NVIDIA L40S
48GB VRAM
0 vCPU
0GB RAM
$0.55/GPU/hr
NVIDIA L40S
48GB VRAM
12 vCPU
72GB RAM
625GB Storage
$0.88/GPU/hr
NVIDIA L40S
48GB VRAM
12 vCPU
72GB RAM
625GB Storage
$0.88/GPU/hr
VERDA
VERDA
Helsinki
Available
NVIDIA L40S
48GB VRAM
20 vCPU
60GB RAM
$1.37/GPU/hr
Ori
Ori
Lille
Available
NVIDIA L40S
48GB VRAM
15 vCPU
90GB RAM
400GB Storage
$1.55/GPU/hr

QuantaCloud

Need GPUs at scale?

Building out an inference fleet or training cluster? QuantaCloud brokers reserved capacity across multiple data center partners. 16+ GPUs, flexible terms, custom quote in 24 hours.

No waitlist24hr quote turnaroundInfiniBand fabric

Technical Specifications

CUDA cores
18176
Memory type
48 GB GDDR6
Tensor cores
568 (3rd Generation)
FP8 performance
365 TFLOPS
Memory bandwidth
864 GB/s

Strengths & Limitations

Advantages
  • High memory capacity (48GB) ideal for large datasets and complex models.
  • Excellent performance for both visualization and compute workloads.
  • Supports NVIDIA virtual GPU (vGPU) software for improved resource utilization and management.
  • Optimized for AI inference and training.
  • Strong FP32 and Tensor Core performance.
Limitations
  • Higher power consumption compared to lower-end GPUs.
  • May be overkill for less demanding tasks.
  • Price point is higher than entry-level data center GPUs.

Top Use Cases

AI Inference

The L40S excels at accelerating AI inference workloads, enabling real-time insights and decision-making. Its Tensor Cores provide significant performance gains for deep learning models.

Data Science and Analytics

With its large memory capacity and powerful compute capabilities, the L40S is well-suited for data science tasks such as data analysis, model training, and simulation.

Professional Visualization

The L40S delivers exceptional performance for professional visualization applications, including CAD, CAE, and digital content creation. It supports high-resolution displays and complex 3D models.

Real-World Benchmark

AI Inference Performance (ResNet-50)
The NVIDIA L40S demonstrates strong performance in AI inference benchmarks, such as ResNet-50, achieving high throughput and low latency. This makes it suitable for real-time AI applications.
Est. Cost$0.22/hr

Market Analysis

The NVIDIA L40S occupies a mid-range position in the data center GPU market, offering a compelling balance of performance and features for a variety of workloads. It competes with other GPUs in the same price range, such as the A30 ($0.22/hr), and offers a step up in performance from the L4 ($0.20/hr). Its large memory capacity makes it a strong contender for memory-intensive applications.

Frequently Asked Questions

What is the typical power consumption of the NVIDIA L40S?â–¾

The typical power consumption of the NVIDIA L40S is around 300W.

Does the NVIDIA L40S support NVIDIA vGPU?â–¾

Yes, the NVIDIA L40S supports NVIDIA vGPU software, allowing for virtualization and sharing of GPU resources across multiple virtual machines.

What type of workloads is the NVIDIA L40S best suited for?â–¾

The NVIDIA L40S is well-suited for a wide range of workloads, including AI inference, data science, professional visualization, and virtual workstations.

Alternative GPUs

NVIDIA A30
$0.22/hr

Similar price point, but may offer different performance characteristics depending on the specific workload. The A30 has less memory (24GB) but may have better performance per dollar for certain compute tasks.

NVIDIA L4
$0.20/hr

A lower-cost alternative for less demanding workloads. The L4 has significantly lower memory (24GB) and compute performance, but is more power-efficient.

NVIDIA A10
$0.16/hr

A slightly less expensive option that still provides good performance for visualization and some AI tasks, although with less memory (24GB) and lower overall compute power.

Cite This Data
This pricing data is updated daily and free to cite with attribution.
Source: GPUPerHour.com — NVIDIA L40S GPU Rental Pricing Comparison (June 2026)

Journalists, bloggers, and researchers: You're welcome to cite our data in your articles with attribution. Our pricing database is updated in real-time from 16+ cloud providers.

L40S Pricing: What It Costs in 2026

â–¾

L40S cloud GPU pricing ranges from $0.55/hr on TensorDock to $2.25/hr on CoreWeave, based on 88 offers tracked by GPUPerHour across 16 providers. 18 instances are currently in stock across 22 regions.

Running a L40S continuously for one month at the cheapest available rate costs approximately $396. Most providers bill per second or per minute, so shorter jobs cost proportionally less. Prices on GPUPerHour update every 60 seconds, reflecting real-time changes in provider pricing and availability.

Renting L40S: Which Provider to Choose

â–¾

GPUPerHour tracks L40S offers from 16 providers. The cheapest option is TensorDock at $0.55/hr, followed by Massed Compute at $0.88/hr and VERDA at $1.37/hr.

Price is not the only factor when choosing a provider. Availability matters: 18 of 88 instances are in stock right now. Billing increments, region coverage, and security certifications also vary between providers. Use the pricing tool to filter by region, availability, and provider features.

How L40S Compares

â–¾

Compared to alternatives, the NVIDIA A30 is available from $0.22/hr. Similar price point, but may offer different performance characteristics depending on the specific workload. The A30 has less memory (24GB) but may have better performance per dollar for certain compute tasks. The NVIDIA L4 is available from $0.20/hr. A lower-cost alternative for less demanding workloads. The L4 has significantly lower memory (24GB) and compute performance, but is more power-efficient. The NVIDIA A10 is available from $0.16/hr. A slightly less expensive option that still provides good performance for visualization and some AI tasks, although with less memory (24GB) and lower overall compute power.

For detailed head-to-head analysis, see: a10 vs l40s, a100 pcie 40gb vs l40s, a100 pcie 80gb vs l40s.

L40S Pricing FAQ

How much does L40S cost per hour?â–¾

L40S cloud rental pricing starts at $0.55/hr on TensorDock and goes up to $2.25/hr. Running a L40S continuously for one month at the cheapest rate costs approximately $396. GPUPerHour tracks pricing from 16 providers with prices updated every 60 seconds.

Which is the cheapest provider for L40S?â–¾

The cheapest L40S is available on TensorDock at $0.55/hr, Massed Compute at $0.88/hr, VERDA at $1.37/hr. 18 instances are currently in stock across 16 providers.

What are the alternatives to L40S?â–¾

Alternatives to the L40S include NVIDIA A30 ($0.22/hr), NVIDIA L4 ($0.20/hr), NVIDIA A10 ($0.16/hr). Similar price point, but may offer different performance characteristics depending on the specific workload. The A30 has less memory (24GB) but may have better performance per dollar for certain compute tasks.

NVIDIA L40S Price: $0.47/hr on TensorDock | GPUPerHour