NVIDIA A100 SXM4 40 GB vs NVIDIA RTX A5000

Comparative analysis of NVIDIA A100 SXM4 40 GB and NVIDIA RTX A5000 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, PassMark - G2D Mark, PassMark - G3D Mark.

 

Differences

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • Around 40% higher texture fill rate: 609.1 GTexel/s vs 433.9 GTexel/s
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 7 nm vs 8 nm
  • Around 67% higher maximum memory size: 40 GB vs 24 GB
Texture fill rate 609.1 GTexel/s vs 433.9 GTexel/s
Manufacturing process technology 7 nm vs 8 nm
Maximum memory size 40 GB vs 24 GB

Reasons to consider the NVIDIA RTX A5000

  • Videocard is newer: launch date 10 month(s) later
  • Around 7% higher core clock speed: 1170 MHz vs 1095 MHz
  • Around 20% higher boost clock speed: 1695 MHz vs 1410 MHz
  • Around 19% higher pipelines: 8192 vs 6912
  • Around 74% lower typical power consumption: 230 Watt vs 400 Watt
  • Around 65% higher memory clock speed: 2000 MHz (16 Gbps effective) vs 1215 MHz (2.4 Gbps effective)
Launch date 12 Apr 2021 vs 14 May 2020
Core clock speed 1170 MHz vs 1095 MHz
Boost clock speed 1695 MHz vs 1410 MHz
Pipelines 8192 vs 6912
Thermal Design Power (TDP) 230 Watt vs 400 Watt
Memory clock speed 2000 MHz (16 Gbps effective) vs 1215 MHz (2.4 Gbps effective)

Compare benchmarks

GPU 1: NVIDIA A100 SXM4 40 GB
GPU 2: NVIDIA RTX A5000

Name NVIDIA A100 SXM4 40 GB NVIDIA RTX A5000
Geekbench - OpenCL 191749
PassMark - G2D Mark 884
PassMark - G3D Mark 24150

Compare specifications (specs)

NVIDIA A100 SXM4 40 GB NVIDIA RTX A5000

Essentials

Architecture Ampere Ampere
Code name GA100 GA102
Launch date 14 May 2020 12 Apr 2021
Place in performance rating 3 1

Technical info

Boost clock speed 1410 MHz 1695 MHz
Core clock speed 1095 MHz 1170 MHz
Manufacturing process technology 7 nm 8 nm
Peak Double Precision (FP64) Performance 9.746 TFLOPS (1:2) 867.8 GFLOPS (1:32)
Peak Half Precision (FP16) Performance 77.97 TFLOPS (4:1) 27.77 TFLOPS (1:1)
Peak Single Precision (FP32) Performance 19.49 TFLOPS 27.77 TFLOPS
Pipelines 6912 8192
Pixel fill rate 225.6 GPixel/s 162.7 GPixel/s
Texture fill rate 609.1 GTexel/s 433.9 GTexel/s
Thermal Design Power (TDP) 400 Watt 230 Watt
Transistor count 54200 million 28300 million

Video outputs and ports

Display Connectors No outputs 4x DisplayPort

Compatibility, dimensions and requirements

Form factor IGP Dual-slot
Interface PCIe 4.0 x16 PCIe 4.0 x16
Recommended system power (PSU) 800 Watt 550 Watt
Supplementary power connectors None 1x 8-pin
Length 267 mm (10.5 inches)
Width 112 mm (4.4 inches)

API support

OpenCL 3.0 3.0
DirectX 12.2
OpenGL 4.6
Shader Model 6.6
Vulkan

Memory

High bandwidth memory (HBM)
Maximum RAM amount 40 GB 24 GB
Memory bandwidth 1555 GB/s 768 GB/s
Memory bus width 5120 bit 384 bit
Memory clock speed 1215 MHz (2.4 Gbps effective) 2000 MHz (16 Gbps effective)
Memory type HBM2e GDDR6