NVIDIA A16

5,120

CUDA Cores

64GB

VRAM

888

GB/s

Data Center
Updated April 21, 2026 • 2026 Edition
A16 GPU Specifications

Technical Specifications

5,120

CUDA Cores

900

Base MHz

1400

Boost MHz

64GB GDDR6

512-bit bus

Performance

15

FP32 TFLOPS

60

FP16 TFLOPS

250W

TDP

Cloud Availability

5

Available Instances

$0.47/hr

Starting Price

Detailed Specifications

Architecture Ampere (Unknown)
Release Date 2020-10-05
Launch Price $2,500.00
Process 8nm
Transistors 34.8B

AI Features

Gen 3

Tensor Cores

Disabled

Transformer Engine

Not Supported

Flash Attention

Physical Specifications

Dimensions

10.5in

Length

4.4in

Width

2-slot

Height

About A16 GPU

The NVIDIA A16 is a powerful GPU designed for AI/ML workloads, offering exceptional performance for both training and inference tasks. With 64GB of VRAM and 5,120 CUDA cores, it provides the memory capacity and computational power needed for modern deep learning models.

Released in 2020, the A16 features Ampere architecture with advanced AI accelerators including Tensor Cores and Transformer Engine support. This makes it ideal for large language models, computer vision tasks, and generative AI applications.

When considering cloud rental options for the A16, pricing starts at $0.47/hour from various providers. This GPU offers excellent price-to-performance for AI training workloads, with its high memory bandwidth of 888 GB/s enabling fast data transfer for large datasets.

The A16 features CUDA compute capability 8.6 and is compatible with all major deep learning frameworks including PyTorch, TensorFlow, and JAX. Its 8nm manufacturing process ensures efficient power consumption relative to performance output.

Rent A16 from Our Partners

Get started quickly with these trusted GPU cloud providers. We may earn a commission when you sign up.

Thunder Compute

Starting from $0.47/hr

Per-second billing, great for testing

Sign Up & Get $10 →

RunPod

Starting from $0.47/hr

Serverless with fast cold starts

Start on RunPod →

Vast.ai

Starting from $0.47/hr

Lowest prices on the market

Browse Vast.ai →

External Resources

Learn more about GPUs from these authoritative sources:

NVIDIA CUDA Documentation →

Official CUDA programming guide

NVIDIA GPU Specifications →

Official NVIDIA GPU specs

TechPowerUp GPU Database →

Comprehensive GPU specifications

CUDA Compute Capability Guide →

GPU compute capability reference

Top GPUs for Training and Inference

Category Rank 1 Rank 2 Rank 3
Best for Training NVIDIA H200 NVIDIA H100 NVIDIA B200
Best for Inference NVIDIA A40 NVIDIA A100 NVIDIA A10

Compare GPU specifications and cloud instances to find the best GPU for your workload.