NVIDIA A100 SXM4 40 GB vs NVIDIA TITAN RTX

Comparative analysis of NVIDIA A100 SXM4 40 GB and NVIDIA TITAN RTX videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps), PassMark - G3D Mark, PassMark - G2D Mark, 3DMark Fire Strike - Graphics Score.

 

Differences

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • Videocard is newer: launch date 1 year(s) 4 month(s) later
  • Around 50% higher pipelines: 6912 vs 4608
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 7 nm vs 12 nm
  • Around 45% better performance in Geekbench - OpenCL: 200738 vs 138072
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Frames): 27823 vs 3707
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Fps): 27823 vs 3707
  • 15.5x better performance in GFXBench 4.0 - T-Rex (Frames): 51880 vs 3353
  • 15.5x better performance in GFXBench 4.0 - T-Rex (Fps): 51880 vs 3353
Specifications (specs)
Launch date 14 May 2020 vs 18 December 2018
Pipelines 6912 vs 4608
Manufacturing process technology 7 nm vs 12 nm
Benchmarks
Geekbench - OpenCL 200738 vs 138072
GFXBench 4.0 - Manhattan (Frames) 27823 vs 3707
GFXBench 4.0 - Manhattan (Fps) 27823 vs 3707
GFXBench 4.0 - T-Rex (Frames) 51880 vs 3353
GFXBench 4.0 - T-Rex (Fps) 51880 vs 3353

Reasons to consider the NVIDIA TITAN RTX

  • Around 23% higher core clock speed: 1350 MHz vs 1095 MHz
  • Around 26% higher boost clock speed: 1770 MHz vs 1410 MHz
  • Around 43% lower typical power consumption: 280 Watt vs 400 Watt
  • 11.5x more memory clock speed: 14000 MHz vs 1215 MHz (2.4 Gbps effective)
  • Around 23% better performance in GFXBench 4.0 - Car Chase Offscreen (Frames): 25820 vs 21006
  • Around 23% better performance in GFXBench 4.0 - Car Chase Offscreen (Fps): 25820 vs 21006
Specifications (specs)
Core clock speed 1350 MHz vs 1095 MHz
Boost clock speed 1770 MHz vs 1410 MHz
Thermal Design Power (TDP) 280 Watt vs 400 Watt
Memory clock speed 14000 MHz vs 1215 MHz (2.4 Gbps effective)
Benchmarks
GFXBench 4.0 - Car Chase Offscreen (Frames) 25820 vs 21006
GFXBench 4.0 - Car Chase Offscreen (Fps) 25820 vs 21006

Compare benchmarks

GPU 1: NVIDIA A100 SXM4 40 GB
GPU 2: NVIDIA TITAN RTX

Geekbench - OpenCL
GPU 1
GPU 2
200738
138072
GFXBench 4.0 - Car Chase Offscreen (Frames)
GPU 1
GPU 2
21006
25820
GFXBench 4.0 - Car Chase Offscreen (Fps)
GPU 1
GPU 2
21006
25820
GFXBench 4.0 - Manhattan (Frames)
GPU 1
GPU 2
27823
3707
GFXBench 4.0 - Manhattan (Fps)
GPU 1
GPU 2
27823
3707
GFXBench 4.0 - T-Rex (Frames)
GPU 1
GPU 2
51880
3353
GFXBench 4.0 - T-Rex (Fps)
GPU 1
GPU 2
51880
3353
Name NVIDIA A100 SXM4 40 GB NVIDIA TITAN RTX
Geekbench - OpenCL 200738 138072
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006 25820
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006 25820
GFXBench 4.0 - Manhattan (Frames) 27823 3707
GFXBench 4.0 - Manhattan (Fps) 27823 3707
GFXBench 4.0 - T-Rex (Frames) 51880 3353
GFXBench 4.0 - T-Rex (Fps) 51880 3353
PassMark - G3D Mark 20362
PassMark - G2D Mark 840
3DMark Fire Strike - Graphics Score 3794

Compare specifications (specs)

NVIDIA A100 SXM4 40 GB NVIDIA TITAN RTX

Essentials

Architecture Ampere Turing
Code name GA100 TU102
Launch date 14 May 2020 18 December 2018
Place in performance rating 7 109
Launch price (MSRP) $2,499
Type Desktop

Technical info

Boost clock speed 1410 MHz 1770 MHz
Core clock speed 1095 MHz 1350 MHz
Manufacturing process technology 7 nm 12 nm
Peak Double Precision (FP64) Performance 9.746 TFLOPS (1:2)
Peak Half Precision (FP16) Performance 77.97 TFLOPS (4:1)
Peak Single Precision (FP32) Performance 19.49 TFLOPS
Pipelines 6912 4608
Pixel fill rate 225.6 GPixel/s
Texture fill rate 609.1 GTexel/s
Thermal Design Power (TDP) 400 Watt 280 Watt
Transistor count 54200 million 18,600 million

Video outputs and ports

Display Connectors No outputs 1x HDMI, 3x DisplayPort, 1x USB Type-C
DisplayPort count 3
DisplayPort support
HDMI

Compatibility, dimensions and requirements

Form factor IGP
Interface PCIe 4.0 x16 PCIe 3.0 x16
Recommended system power (PSU) 800 Watt
Supplementary power connectors None 2x 8-pin
Length 267 mm

API support

OpenCL 3.0
DirectX 12.0
OpenGL 4.6

Memory

High bandwidth memory (HBM)
Maximum RAM amount 40 GB
Memory bandwidth 1555 GB/s
Memory bus width 5120 bit 384 Bit
Memory clock speed 1215 MHz (2.4 Gbps effective) 14000 MHz
Memory type HBM2e GDDR6