NVIDIA A100 SXM4 40 GB vs NVIDIA GeForce RTX 3090

Comparative analysis of NVIDIA A100 SXM4 40 GB and NVIDIA GeForce RTX 3090 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps), PassMark - G3D Mark, PassMark - G2D Mark, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s), 3DMark Fire Strike - Graphics Score.

 

Differences

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • Around 10% higher texture fill rate: 609.1 GTexel/s vs 556.0 GTexel/s
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 7 nm vs 8 nm
  • Around 67% higher maximum memory size: 40 GB vs 24 GB
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Frames): 27823 vs 3713
  • 7.5x better performance in GFXBench 4.0 - Manhattan (Fps): 27823 vs 3713
  • 15.5x better performance in GFXBench 4.0 - T-Rex (Frames): 51880 vs 3354
  • 15.5x better performance in GFXBench 4.0 - T-Rex (Fps): 51880 vs 3354
Specifications (specs)
Texture fill rate 609.1 GTexel/s vs 556.0 GTexel/s
Manufacturing process technology 7 nm vs 8 nm
Maximum memory size 40 GB vs 24 GB
Benchmarks
GFXBench 4.0 - Manhattan (Frames) 27823 vs 3713
GFXBench 4.0 - Manhattan (Fps) 27823 vs 3713
GFXBench 4.0 - T-Rex (Frames) 51880 vs 3354
GFXBench 4.0 - T-Rex (Fps) 51880 vs 3354

Reasons to consider the NVIDIA GeForce RTX 3090

  • Videocard is newer: launch date 3 month(s) later
  • Around 27% higher core clock speed: 1395 MHz vs 1095 MHz
  • Around 20% higher boost clock speed: 1695 MHz vs 1410 MHz
  • Around 52% higher pipelines: 10496 vs 6912
  • Around 14% lower typical power consumption: 350 Watt vs 400 Watt
  • Around 7% better performance in Geekbench - OpenCL: 205239 vs 191749
  • Around 59% better performance in GFXBench 4.0 - Car Chase Offscreen (Frames): 33341 vs 21006
  • Around 59% better performance in GFXBench 4.0 - Car Chase Offscreen (Fps): 33341 vs 21006
Specifications (specs)
Launch date 1 Sep 2020 vs 14 May 2020
Core clock speed 1395 MHz vs 1095 MHz
Boost clock speed 1695 MHz vs 1410 MHz
Pipelines 10496 vs 6912
Thermal Design Power (TDP) 350 Watt vs 400 Watt
Memory clock speed 1219 MHz (19.5 Gbps effective) vs 1215 MHz (2.4 Gbps effective)
Benchmarks
Geekbench - OpenCL 205239 vs 191749
GFXBench 4.0 - Car Chase Offscreen (Frames) 33341 vs 21006
GFXBench 4.0 - Car Chase Offscreen (Fps) 33341 vs 21006

Compare benchmarks

GPU 1: NVIDIA A100 SXM4 40 GB
GPU 2: NVIDIA GeForce RTX 3090

Geekbench - OpenCL
GPU 1
GPU 2
191749
205239
GFXBench 4.0 - Car Chase Offscreen (Frames)
GPU 1
GPU 2
21006
33341
GFXBench 4.0 - Car Chase Offscreen (Fps)
GPU 1
GPU 2
21006
33341
GFXBench 4.0 - Manhattan (Frames)
GPU 1
GPU 2
27823
3713
GFXBench 4.0 - Manhattan (Fps)
GPU 1
GPU 2
27823
3713
GFXBench 4.0 - T-Rex (Frames)
GPU 1
GPU 2
51880
3354
GFXBench 4.0 - T-Rex (Fps)
GPU 1
GPU 2
51880
3354
Name NVIDIA A100 SXM4 40 GB NVIDIA GeForce RTX 3090
Geekbench - OpenCL 191749 205239
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006 33341
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006 33341
GFXBench 4.0 - Manhattan (Frames) 27823 3713
GFXBench 4.0 - Manhattan (Fps) 27823 3713
GFXBench 4.0 - T-Rex (Frames) 51880 3354
GFXBench 4.0 - T-Rex (Fps) 51880 3354
PassMark - G3D Mark 25975
PassMark - G2D Mark 990
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 737.298
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 7585.258
CompuBench 1.5 Desktop - T-Rex (Frames/s) 66.951
CompuBench 1.5 Desktop - Video Composition (Frames/s) 309.051
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 2451.491
3DMark Fire Strike - Graphics Score 19970

Compare specifications (specs)

NVIDIA A100 SXM4 40 GB NVIDIA GeForce RTX 3090

Essentials

Architecture Ampere Ampere
Code name GA100 GA102
Launch date 14 May 2020 1 Sep 2020
Place in performance rating 1 5
Launch price (MSRP) $1499
Type Desktop

Technical info

Boost clock speed 1410 MHz 1695 MHz
Core clock speed 1095 MHz 1395 MHz
Manufacturing process technology 7 nm 8 nm
Peak Double Precision (FP64) Performance 9.746 TFLOPS (1:2) 556.0 GFLOPS (1:64)
Peak Half Precision (FP16) Performance 77.97 TFLOPS (4:1) 35.58 TFLOPS (1:1)
Peak Single Precision (FP32) Performance 19.49 TFLOPS 35.58 TFLOPS
Pipelines 6912 10496
Pixel fill rate 225.6 GPixel/s 189.8 GPixel/s
Texture fill rate 609.1 GTexel/s 556.0 GTexel/s
Thermal Design Power (TDP) 400 Watt 350 Watt
Transistor count 54200 million 28300 million

Video outputs and ports

Display Connectors No outputs 1x HDMI, 3x DisplayPort

Compatibility, dimensions and requirements

Form factor IGP
Interface PCIe 4.0 x16 PCIe 4.0 x16
Recommended system power (PSU) 800 Watt 750 Watt
Supplementary power connectors None 1x 12-pin
Height 138 mm (5.4 inches)
Length 313 mm (12.3 inches)
Width Triple-slot

API support

OpenCL 3.0 2.0
DirectX 12.2
OpenGL 4.6
Shader Model 6.5
Vulkan

Memory

High bandwidth memory (HBM)
Maximum RAM amount 40 GB 24 GB
Memory bandwidth 1555 GB/s 936.2 GB/s
Memory bus width 5120 bit 384 bit
Memory clock speed 1215 MHz (2.4 Gbps effective) 1219 MHz (19.5 Gbps effective)
Memory type HBM2e GDDR6X