How much VRAM does the H100 PCIe have?

The H100 PCIe has 80 GB of HBM2e memory.

When was the H100 PCIe released?

The H100 PCIe was released in 2022 by NVIDIA, based on the Hopper architecture in the PCIe form factor.

How much does it cost to rent the H100 PCIe?

The H100 PCIe rents for $1.00/hr at the cheapest marketplace, with a typical listing-weighted median of $3.19/hr across 9 marketplace partners. Updated daily.

Is the H100 PCIe good for AI training or inference?

The H100 PCIe delivers 756 FP16 TFLOPS (dense, no sparsity) with 80 GB of VRAM. Suited for large-model training and high-throughput inference.

Home/GPU Prices/H100 PCIe/Specifications

NVIDIA · Hopper · 2022

H100 PCIe
AIMC Specifications

Name: H100 PCIe
Brand: NVIDIA
Availability: InStock

Complete technical reference: architecture, memory, performance, and live rental pricing.

Memory

80 GB

HBM2e

Form Factor

PCIe

Datacenter

FP16 Compute

756

TFLOPS (dense)

Open Cost Calculator

Live Rental Pricing

Current market pricing across all authorized partners, updated daily.

Cheapest

$1.00/hr

Typical (median)

$3.19/hr

Marketplaces

9

See full marketplace breakdown for H100 PCIe

Full Specifications

Factual specifications from manufacturer datasheets.

Manufacturer	NVIDIA
Architecture	Hopper
Memory Capacity	80 GB
Memory Type	HBM2e
Form Factor	PCIe
Release Year	2022
GPU Class	Datacenter
FP16 TFLOPS (dense)	756
VRAM (compute)	80 GB

Architecture & Use Cases

Technical overview of the H100 PCIe.

The NVIDIA H100 PCIe is the PCIe form factor variant of the Hopper architecture flagship, announced at GTC March 2022 alongside the H100 SXM. It brings Hopper capabilities to standard server infrastructure without requiring specialized SXM baseboards, making it accessible to a broader range of enterprise deployments.

The H100 PCIe uses a cut-down version of the GH100 die with 80GB of HBM2e memory (not HBM3) providing 2 TB/s of bandwidth. While lower than the SXM variant's 3.35 TB/s, this is still a significant upgrade from A100 PCIe. The chip includes 14,592 CUDA cores and 456 Tensor Cores, slightly reduced from the SXM variant.

The dual-slot PCIe Gen5 x16 form factor is compatible with standard server platforms without specialized baseboards. TDP is 350W, exactly half the SXM variant, enabling air-cooled deployments. For multi-GPU configurations, it supports NVLink bridge connecting two cards with 600 GB/s bandwidth, or PCIe peer-to-peer communication.

The H100 PCIe includes all Hopper architectural features: the Transformer Engine with FP8 precision, 4th-generation Tensor Cores, DPX instructions, and MIG support for up to 7 instances. It's particularly suited for inference workloads, smaller-scale training, and deployments where SXM infrastructure is not available or practical.