NVIDIA A100 SXM4 40 GB vs NVIDIA GeForce RTX 3080 Ti

Comparative analysis of NVIDIA A100 SXM4 40 GB and NVIDIA GeForce RTX 3080 Ti videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps), PassMark - G3D Mark, PassMark - G2D Mark, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s), 3DMark Fire Strike - Graphics Score.

 

Differences

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • Around 14% higher texture fill rate: 609.1 GTexel/s vs 532.8 GTexel/s
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 7 nm vs 8 nm
  • 3.3x more maximum memory size: 40 GB vs 12 GB
  • Around 2% higher memory clock speed: 1215 MHz (2.4 Gbps effective) vs 1188 MHz, 19 Gbps effective
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Frames): 27823 vs 3718
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Fps): 27823 vs 3718
  • 15.4x better performance in GFXBench 4.0 - T-Rex (Frames): 51880 vs 3359
  • 15.4x better performance in GFXBench 4.0 - T-Rex (Fps): 51880 vs 3359
Specifications (specs)
Texture fill rate 609.1 GTexel/s vs 532.8 GTexel/s
Manufacturing process technology 7 nm vs 8 nm
Maximum memory size 40 GB vs 12 GB
Memory clock speed 1215 MHz (2.4 Gbps effective) vs 1188 MHz, 19 Gbps effective
Benchmarks
GFXBench 4.0 - Manhattan (Frames) 27823 vs 3718
GFXBench 4.0 - Manhattan (Fps) 27823 vs 3718
GFXBench 4.0 - T-Rex (Frames) 51880 vs 3359
GFXBench 4.0 - T-Rex (Fps) 51880 vs 3359

Reasons to consider the NVIDIA GeForce RTX 3080 Ti

  • Videocard is newer: launch date 1 year(s) 0 month(s) later
  • Around 25% higher core clock speed: 1365 MHz vs 1095 MHz
  • Around 18% higher boost clock speed: 1665 MHz vs 1410 MHz
  • Around 48% higher pipelines: 10240 vs 6912
  • Around 14% lower typical power consumption: 350 Watt vs 400 Watt
  • Around 4% better performance in Geekbench - OpenCL: 209081 vs 200738
  • Around 66% better performance in GFXBench 4.0 - Car Chase Offscreen (Frames): 34770 vs 21006
  • Around 66% better performance in GFXBench 4.0 - Car Chase Offscreen (Fps): 34770 vs 21006
Specifications (specs)
Launch date 31 May 2021 vs 14 May 2020
Core clock speed 1365 MHz vs 1095 MHz
Boost clock speed 1665 MHz vs 1410 MHz
Pipelines 10240 vs 6912
Thermal Design Power (TDP) 350 Watt vs 400 Watt
Benchmarks
Geekbench - OpenCL 209081 vs 200738
GFXBench 4.0 - Car Chase Offscreen (Frames) 34770 vs 21006
GFXBench 4.0 - Car Chase Offscreen (Fps) 34770 vs 21006

Compare benchmarks

GPU 1: NVIDIA A100 SXM4 40 GB
GPU 2: NVIDIA GeForce RTX 3080 Ti

Geekbench - OpenCL
GPU 1
GPU 2
200738
209081
GFXBench 4.0 - Car Chase Offscreen (Frames)
GPU 1
GPU 2
21006
34770
GFXBench 4.0 - Car Chase Offscreen (Fps)
GPU 1
GPU 2
21006
34770
GFXBench 4.0 - Manhattan (Frames)
GPU 1
GPU 2
27823
3718
GFXBench 4.0 - Manhattan (Fps)
GPU 1
GPU 2
27823
3718
GFXBench 4.0 - T-Rex (Frames)
GPU 1
GPU 2
51880
3359
GFXBench 4.0 - T-Rex (Fps)
GPU 1
GPU 2
51880
3359
Name NVIDIA A100 SXM4 40 GB NVIDIA GeForce RTX 3080 Ti
Geekbench - OpenCL 200738 209081
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006 34770
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006 34770
GFXBench 4.0 - Manhattan (Frames) 27823 3718
GFXBench 4.0 - Manhattan (Fps) 27823 3718
GFXBench 4.0 - T-Rex (Frames) 51880 3359
GFXBench 4.0 - T-Rex (Fps) 51880 3359
PassMark - G3D Mark 26935
PassMark - G2D Mark 1088
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 485.179
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 661.977
CompuBench 1.5 Desktop - T-Rex (Frames/s) 60.472
CompuBench 1.5 Desktop - Video Composition (Frames/s) 129.036
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 2441.001
3DMark Fire Strike - Graphics Score 5077

Compare specifications (specs)

NVIDIA A100 SXM4 40 GB NVIDIA GeForce RTX 3080 Ti

Essentials

Architecture Ampere Ampere
Code name GA100 GA102
Launch date 14 May 2020 31 May 2021
Place in performance rating 7 60
Launch price (MSRP) $1199
Type Desktop

Technical info

Boost clock speed 1410 MHz 1665 MHz
Core clock speed 1095 MHz 1365 MHz
Manufacturing process technology 7 nm 8 nm
Peak Double Precision (FP64) Performance 9.746 TFLOPS (1:2) 532.8 GFLOPS (1:64)
Peak Half Precision (FP16) Performance 77.97 TFLOPS (4:1) 34.10 TFLOPS (1:1)
Peak Single Precision (FP32) Performance 19.49 TFLOPS 34.10 TFLOPS
Pipelines 6912 10240
Pixel fill rate 225.6 GPixel/s 186.5 GPixel/s
Texture fill rate 609.1 GTexel/s 532.8 GTexel/s
Thermal Design Power (TDP) 400 Watt 350 Watt
Transistor count 54200 million 28300 million

Video outputs and ports

Display Connectors No outputs 1x HDMI 2.1, 3x DisplayPort 1.4a

Compatibility, dimensions and requirements

Form factor IGP Dual-slot
Interface PCIe 4.0 x16 PCIe 4.0 x16
Recommended system power (PSU) 800 Watt 750 Watt
Supplementary power connectors None 1x 12-pin
Height 40 mm, 1.6 inches
Length 285 mm, 11.2 inches
Width 112 mm, 4.4 inches

API support

OpenCL 3.0 3.0
DirectX 12 Ultimate (12_2)
OpenGL 4.6
Shader Model 6.7
Vulkan

Memory

High bandwidth memory (HBM)
Maximum RAM amount 40 GB 12 GB
Memory bandwidth 1555 GB/s 912.4 GB/s
Memory bus width 5120 bit 384 bit
Memory clock speed 1215 MHz (2.4 Gbps effective) 1188 MHz, 19 Gbps effective
Memory type HBM2e GDDR6X