How much VRAM does the L4 have?

The L4 has 24 GB of GDDR6 memory.

When was the L4 released?

The L4 was released in 2023 by NVIDIA, based on the Ada Lovelace architecture in the PCIe form factor.

How much does it cost to rent the L4?

The L4 rents for $0.39/hr at the cheapest marketplace, with a typical listing-weighted median of $0.86/hr across 7 marketplace partners. Updated daily.

Is the L4 good for AI training or inference?

The L4 delivers 121 FP16 TFLOPS (dense, no sparsity) with 24 GB of VRAM. Suited for large-model training and high-throughput inference.

Home/GPU Prices/L4/Specifications

NVIDIA · Ada Lovelace · 2023

L4
AIMC Specifications

Name: L4
Brand: NVIDIA
Availability: InStock

Complete technical reference: architecture, memory, performance, and live rental pricing.

Memory

24 GB

GDDR6

Form Factor

PCIe

Datacenter

FP16 Compute

121

TFLOPS (dense)

Open Cost Calculator

Live Rental Pricing

Current market pricing across all authorized partners, updated daily.

Cheapest

$0.39/hr

Typical (median)

$0.86/hr

Marketplaces

7

See full marketplace breakdown for L4

Full Specifications

Factual specifications from manufacturer datasheets.

Manufacturer	NVIDIA
Architecture	Ada Lovelace
Memory Capacity	24 GB
Memory Type	GDDR6
Form Factor	PCIe
Release Year	2023
GPU Class	Datacenter
FP16 TFLOPS (dense)	121
VRAM (compute)	24 GB

Architecture & Use Cases

Technical overview of the L4.

The NVIDIA L4 is a compact Ada Lovelace architecture accelerator optimized for AI inference at the edge and in mainstream servers, announced in March 2023. It provides strong inference performance in a low-power, single-slot form factor designed for dense deployment and power-constrained environments.

The L4 uses the AD104 die with 24GB of GDDR6 memory providing 300 GB/s bandwidth. It includes 7,424 CUDA cores, 232 fourth-generation Tensor Cores, and 58 third-generation RT cores. The chip is manufactured on TSMC's custom 4N process, the same node used across the Ada Lovelace family.

A key feature is the extremely low 72W TDP in a low-profile, single-slot PCIe Gen4 x16 form factor. This enables deployment in standard servers without additional power delivery and in edge computing environments with thermal constraints. Up to 8 L4 GPUs can fit in a standard 4U server.

Fourth-generation Tensor Cores support FP8, FP16, BF16, TF32, and INT8 operations optimized for inference. Hardware video capabilities include 8th-generation NVENC with AV1 encode and 5th-generation NVDEC. Primary deployment scenarios include edge AI inference, video analytics, and dense inference servers where power efficiency is critical.