How much VRAM does the L40 have?

The L40 has 48 GB of GDDR6 memory.

When was the L40 released?

The L40 was released in 2022 by NVIDIA, based on the Ada Lovelace architecture in the PCIe form factor.

How much does it cost to rent the L40?

The L40 rents for $0.82/hr at the cheapest marketplace, with a typical listing-weighted median of $0.86/hr across 5 marketplace partners. Updated daily.

Is the L40 good for AI training or inference?

The L40 delivers 181 FP16 TFLOPS (dense, no sparsity) with 48 GB of VRAM. Suited for large-model training and high-throughput inference.

Home/GPU Prices/L40/Specifications

NVIDIA · Ada Lovelace · 2022

L40
AIMC Specifications

Name: L40
Brand: NVIDIA
Availability: InStock

Complete technical reference: architecture, memory, performance, and live rental pricing.

Memory

48 GB

GDDR6

Form Factor

PCIe

Datacenter

FP16 Compute

181

TFLOPS (dense)

Open Cost Calculator

Live Rental Pricing

Current market pricing across all authorized partners, updated daily.

Cheapest

$0.82/hr

Typical (median)

$0.86/hr

Marketplaces

5

See full marketplace breakdown for L40

Full Specifications

Factual specifications from manufacturer datasheets.

Manufacturer	NVIDIA
Architecture	Ada Lovelace
Memory Capacity	48 GB
Memory Type	GDDR6
Form Factor	PCIe
Release Year	2022
GPU Class	Datacenter
FP16 TFLOPS (dense)	181
VRAM (compute)	48 GB

Architecture & Use Cases

Technical overview of the L40.

The NVIDIA L40 is an Ada Lovelace architecture datacenter GPU designed for visual computing, rendering, and AI workloads, announced in September 2022. It represents the successor to the A40, bringing Ada Lovelace architecture improvements to datacenter visualization and inference applications.

The L40 uses the AD102 die with 48GB of GDDR6 ECC memory providing 864 GB/s bandwidth. It includes 18,176 CUDA cores, 568 fourth-generation Tensor Cores, and 142 third-generation RT cores for hardware-accelerated ray tracing. The chip is manufactured on TSMC's custom 4N process.

Unlike the L40S (inference-optimized variant), the L40 emphasizes graphics capabilities including full RT core enablement for professional rendering workloads. The Tensor Cores support standard Ada precision modes (FP16, BF16, TF32, INT8) but not the enhanced FP8 Transformer Engine found in L40S.

The dual-slot PCIe Gen4 x16 form factor has a 300W TDP, enabling air-cooled deployment in standard servers. Hardware video capabilities include 8th-generation NVENC with AV1 encode support and 5th-generation NVDEC. Primary deployment scenarios include professional visualization servers, rendering farms, VDI infrastructure, and cloud gaming platforms.