Can Vast.ai's H200 NVL handle LLM Inference?

Yes. The H200 NVL on Vast.ai scores 100/100 for LLM Inference — excellent fit. It meets the 12 GB VRAM minimum with 141 GB available.

How much does Vast.ai charge for the H200 NVL?

The H200 NVL on Vast.ai has a listing-weighted median of $18.612/hr across 9 observed listings. Across 4 marketplaces tracking this GPU, prices vary; see alternatives below.

Where else can I rent the H200 NVL?

The H200 NVL is also available at RunPod ($2.145/hr), Massed Compute ($2.830/hr), AIME ($4.870/hr), among 4 total marketplaces.

Home/Providers/Vast.ai/H200 NVL/For LLM Inference

AIMC Fit Analysis · AI

H200 NVL on Vast.ai
for LLM Inference

Serving large language models for chat, completion, and agentic workloads. AIMC scores this specific combination 100/100 — excellent fit.

Fit Score

100/100

Excellent fit

Hourly Rate

$18.61

9 listings on Vast.ai

VRAM vs Required

141 / 12 GB

11.8× the minimum

Track this combination — Free Trial See full H200 NVL pricing

Is the H200 NVL on Vast.ai good for LLM Inference?

Excellent fit. AIMC's fit score combines VRAM headroom, GPU class match, and FP16 compute against the workload's requirements — independent of pricing.

Datacenter class is well-suited for LLM Inference
141 GB VRAM provides ample headroom (11.8x the minimum)
989 FP16 TFLOPS substantially exceeds the 50 TFLOPS threshold

See H200 NVL fit analysis across all marketplaces All Vast.ai GPUs for LLM Inference

What this costs on Vast.ai

Listing-weighted median across 9 observed H200 NVL listings at Vast.ai. The same GPU is tracked at 4 marketplaces total.

At Vast.ai

$18.61/hr

9 listings

Cheapest Alt

$2.15/hr

RunPod (-88.5%)

Marketplaces

tracking this GPU

Same H200 NVL at other marketplaces

Top 3 alternative providers for the same GPU, sorted by price ascending.

Other Vast.ai GPUs for LLM Inference

Alternative high-fit options at the same provider, sorted by fit score.

B200 SXM100/100

$16.67/hr · Datacenter GPUs

H100 SXM100/100

$8.40/hr · Datacenter GPUs

A100 SXM 80GB100/100

$4.27/hr · Datacenter GPUs

A100 PCIe 80GB100/100

$2.14/hr · Datacenter GPUs

About LLM Inference

Serving large language models for chat, completion, and agentic workloads. LLM Inference requires at least 12 GB VRAM and benefits from Datacenter or Workstation or Consumer-class compute.

Full LLM Inference guide and all viable GPUs

Track Vast.ai's H200 NVL pricing

Get alerts when Vast.ai adjusts pricing on the H200 NVL — useful for sustained llm inference workloads.

Start Free Trial H200 NVL market overview

Home/Providers/Vast.ai/H200 NVL/For LLM Inference

AIMC Fit Analysis · AI

H200 NVL on Vast.ai
for LLM Inference

Serving large language models for chat, completion, and agentic workloads. AIMC scores this specific combination 100/100 — excellent fit.

Fit Score

100/100

Excellent fit

Hourly Rate

$18.61

9 listings on Vast.ai

VRAM vs Required

141 / 12 GB

11.8× the minimum

Track this combination — Free Trial See full H200 NVL pricing

Is the H200 NVL on Vast.ai good for LLM Inference?

Excellent fit. AIMC's fit score combines VRAM headroom, GPU class match, and FP16 compute against the workload's requirements — independent of pricing.

Datacenter class is well-suited for LLM Inference
141 GB VRAM provides ample headroom (11.8x the minimum)
989 FP16 TFLOPS substantially exceeds the 50 TFLOPS threshold

See H200 NVL fit analysis across all marketplaces All Vast.ai GPUs for LLM Inference

What this costs on Vast.ai

Listing-weighted median across 9 observed H200 NVL listings at Vast.ai. The same GPU is tracked at 4 marketplaces total.

At Vast.ai

$18.61/hr

9 listings

Cheapest Alt

$2.15/hr

RunPod (-88.5%)

Marketplaces

tracking this GPU

Same H200 NVL at other marketplaces

Top 3 alternative providers for the same GPU, sorted by price ascending.

Other Vast.ai GPUs for LLM Inference

Alternative high-fit options at the same provider, sorted by fit score.

B200 SXM100/100

$16.67/hr · Datacenter GPUs

H100 SXM100/100

$8.40/hr · Datacenter GPUs

A100 SXM 80GB100/100

$4.27/hr · Datacenter GPUs

A100 PCIe 80GB100/100

$2.14/hr · Datacenter GPUs

About LLM Inference

Serving large language models for chat, completion, and agentic workloads. LLM Inference requires at least 12 GB VRAM and benefits from Datacenter or Workstation or Consumer-class compute.

Full LLM Inference guide and all viable GPUs

Track Vast.ai's H200 NVL pricing

Get alerts when Vast.ai adjusts pricing on the H200 NVL — useful for sustained llm inference workloads.

Start Free Trial H200 NVL market overview

H200 NVL on Vast.aifor LLM Inference

Is the H200 NVL on Vast.ai good for LLM Inference?

What this costs on Vast.ai

Same H200 NVL at other marketplaces

Other Vast.ai GPUs for LLM Inference

About LLM Inference

Track Vast.ai's H200 NVL pricing

H200 NVL on Vast.aifor LLM Inference

Is the H200 NVL on Vast.ai good for LLM Inference?

What this costs on Vast.ai

Same H200 NVL at other marketplaces

Other Vast.ai GPUs for LLM Inference

About LLM Inference

Track Vast.ai's H200 NVL pricing

H200 NVL on Vast.ai
for LLM Inference

H200 NVL on Vast.ai
for LLM Inference