NVIDIA A40

Name: NVIDIA A40
Brand: NVIDIA
Availability: InStock

10,752

CUDA Cores

48GB

VRAM

696

GB/s

Data Center

Technical Specifications

10,752

CUDA Cores

1305

Base MHz

1740

Boost MHz

48GB GDDR6

384-bit bus

Performance

37.4

FP32 TFLOPS

74.8

FP16 TFLOPS

300W

TDP

Cloud Availability

3

Available Instances

$0.44/hr

Starting Price

Detailed Specifications

Architecture	Ampere (Unknown)
Release Date	2020-10-05
Launch Price	$6,000.00
Process	8nm
Transistors	28.3B

AI Features

Gen 3

Tensor Cores

Disabled

Transformer Engine

Not Supported

Flash Attention

Physical Specifications

Dimensions

10.5in

Length

4.4in

Width

2-slot

Height

About A40 GPU

The NVIDIA A40 is a powerful GPU designed for AI/ML workloads, offering exceptional performance for both training and inference tasks. With 48GB of VRAM and 10,752 CUDA cores, it provides the memory capacity and computational power needed for modern deep learning models.

Released in 2020, the A40 features Ampere architecture with advanced AI accelerators including Tensor Cores and Transformer Engine support. This makes it ideal for large language models, computer vision tasks, and generative AI applications.

Learn more: GPU Cloud Provider Comparison →

When considering cloud rental options for the A40, pricing starts at $0.44/hour from various providers. This GPU offers excellent price-to-performance for AI training workloads, with its high memory bandwidth of 696 GB/s enabling fast data transfer for large datasets.

The A40 features CUDA compute capability 8.6 and is compatible with all major deep learning frameworks including PyTorch, TensorFlow, and JAX. Its 8nm manufacturing process ensures efficient power consumption relative to performance output.

Learn more: GPU Cost Calculator →

Rent A40 from Our Partners

Get started quickly with these trusted GPU cloud providers. We may earn a commission when you sign up.

Thunder Compute

Starting from $0.44/hr

Per-second billing, great for testing

Sign Up & Get $10 →

RunPod

Starting from $0.44/hr

Serverless with fast cold starts

Start on RunPod →

Vast.ai

Starting from $0.44/hr

Lowest prices on the market

Browse Vast.ai →

Related GPUs

A6000

48GB VRAM • NVIDIA

RTX 6000 Ada

48GB VRAM • NVIDIA

L40S

48GB VRAM • NVIDIA

L40

48GB VRAM • NVIDIA

Related Tools

GPU Comparison

Compare specs across all GPUs

Cost Estimator

Calculate rental costs

Model Size Calculator

Check GPU memory requirements

Performance Calculator

Estimate training/inference speed

External Resources

Learn more about GPUs from these authoritative sources:

NVIDIA CUDA Documentation →

Official CUDA programming guide

NVIDIA GPU Specifications →

Official NVIDIA GPU specs

TechPowerUp GPU Database →

Comprehensive GPU specifications

CUDA Compute Capability Guide →

GPU compute capability reference

What You Need to Know About the A40

Complete Specifications for the NVIDIA A40

Get detailed technical specifications for the NVIDIA A40 including VRAM capacity of , CUDA core count, Tensor Core count, memory bandwidth, and CUDA compute capability of . This GPU is designed for demanding AI training, inference, and high-performance computing workloads. Understanding these specifications helps you determine whether it is the right fit for PyTorch, TensorFlow, or custom CUDA-based applications.

Compare NVIDIA A40 Cloud Rental Prices per Hour

Find the best cloud rental prices for the NVIDIA A40 across providers like RunPod, Vast.ai, Lambda Labs, and CoreWeave. GPUvec aggregates real-time pricing data so you can compare costs per hour, find available instances, and choose the most cost-effective provider. GPU cloud pricing for this model varies by region and instance type, so comparing multiple options can save significantly on compute costs.

Is the NVIDIA A40 the Right GPU for Your AI Workload?

Learn whether the NVIDIA A40 is the right choice for your specific AI and ML workloads. We cover use cases including large language model training, fine-tuning, inference serving, computer vision, scientific computing, and rendering. Compare its specifications and pricing against other GPUs like the H100, A100, and RTX 5090 to make an informed decision for your infrastructure needs.

Top GPUs for Training and Inference

Category	Rank 1	Rank 2	Rank 3
Best for Training	NVIDIA H200	NVIDIA H100	NVIDIA B200
Best for Inference	NVIDIA A40	NVIDIA A100	NVIDIA A10

Compare GPU specifications and cloud instances to find the best GPU for your workload.