How much VRAM does the A40 have?

The A40 has 48 GB of GDDR6 memory.

When was the A40 released?

The A40 was released in 2020 by NVIDIA, based on the Ampere architecture in the PCIe form factor.

How much does it cost to rent the A40?

The A40 rents for $0.44/hr at the cheapest marketplace, with a typical listing-weighted median of $0.54/hr across 4 marketplace partners. Updated daily.

Is the A40 good for AI training or inference?

The A40 delivers 150 FP16 TFLOPS (dense, no sparsity) with 48 GB of VRAM. Suited for large-model training and high-throughput inference.

Home/GPU Prices/A40/Specifications

NVIDIA · Ampere · 2020

A40
AIMC Specifications

Name: A40
Brand: NVIDIA
Availability: InStock

Complete technical reference: architecture, memory, performance, and live rental pricing.

Memory

48 GB

GDDR6

Form Factor

PCIe

Datacenter

FP16 Compute

150

TFLOPS (dense)

Open Cost Calculator

Live Rental Pricing

Current market pricing across all authorized partners, updated daily.

Cheapest

$0.44/hr

Typical (median)

$0.54/hr

Marketplaces

4

See full marketplace breakdown for A40

Full Specifications

Factual specifications from manufacturer datasheets.

Manufacturer	NVIDIA
Architecture	Ampere
Memory Capacity	48 GB
Memory Type	GDDR6
Form Factor	PCIe
Release Year	2020
GPU Class	Datacenter
FP16 TFLOPS (dense)	150
VRAM (compute)	48 GB

Architecture & Use Cases

Technical overview of the A40.

The NVIDIA A40 is an Ampere architecture datacenter GPU optimized for visual computing, rendering, and AI inference workloads. Announced in October 2020, it uses the full GA102 die (also found in RTX A6000 and consumer RTX 3090) configured for datacenter deployment with enterprise features.

The A40 features 48GB of GDDR6 ECC memory with 696 GB/s bandwidth, providing substantial capacity for large visualization datasets and AI models. The GA102 die includes 10,752 CUDA cores, 336 Tensor Cores, and 84 RT (Ray Tracing) cores, enabling hardware-accelerated ray tracing for professional rendering.

Unlike the HBM-equipped A100, the A40 uses GDDR6 memory which provides higher capacity at lower cost per GB, making it cost-effective for workloads that don't require HBM bandwidth. The dual-slot PCIe Gen4 x16 form factor has a 300W TDP and supports passive cooling in properly ventilated server chassis.

The A40 supports NVIDIA Virtual GPU (vGPU) software for virtualized deployments, enabling multiple virtual machines to share a single GPU. Hardware video encode/decode engines (NVENC/NVDEC) support up to 8 simultaneous 4K video streams, making the A40 suitable for video transcoding, streaming, and video analytics applications.