NVIDIA GeForce RTX 4090 vs NVIDIA A100 SXM4 40 GB

Comparative analysis of NVIDIA GeForce RTX 4090 and NVIDIA A100 SXM4 40 GB videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: PassMark - G2D Mark, PassMark - G3D Mark, 3DMark Fire Strike - Graphics Score, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s), Geekbench - OpenCL, CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps).

 

Differences

Reasons to consider the NVIDIA GeForce RTX 4090

  • Videocard is newer: launch date 2 year(s) 4 month(s) later
  • 2x more core clock speed: 2235 MHz vs 1095 MHz
  • Around 79% higher boost clock speed: 2520 MHz vs 1410 MHz
  • 2.4x more pipelines: 16384 vs 6912
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 4 nm vs 7 nm
  • Around 8% higher memory clock speed: 1313 MHz, 21 Gbps effective vs 1215 MHz (2.4 Gbps effective)
  • Around 58% better performance in Geekbench - OpenCL: 317521 vs 200556
Specifications (specs)
Launch date 20 Sep 2022 vs 14 May 2020
Core clock speed 2235 MHz vs 1095 MHz
Boost clock speed 2520 MHz vs 1410 MHz
Pipelines 16384 vs 6912
Manufacturing process technology 4 nm vs 7 nm
Memory clock speed 1313 MHz, 21 Gbps effective vs 1215 MHz (2.4 Gbps effective)
Benchmarks
Geekbench - OpenCL 317521 vs 200556

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • 472.2x more texture fill rate: 609.1 GTexel/s vs 1,290 GTexel/s
  • Around 13% lower typical power consumption: 400 Watt vs 450 Watt
  • Around 67% higher maximum memory size: 40 GB vs 24 GB
Texture fill rate 609.1 GTexel/s vs 1,290 GTexel/s
Thermal Design Power (TDP) 400 Watt vs 450 Watt
Maximum memory size 40 GB vs 24 GB

Compare benchmarks

GPU 1: NVIDIA GeForce RTX 4090
GPU 2: NVIDIA A100 SXM4 40 GB

Geekbench - OpenCL
GPU 1
GPU 2
317521
200556
Name NVIDIA GeForce RTX 4090 NVIDIA A100 SXM4 40 GB
PassMark - G2D Mark 1294
PassMark - G3D Mark 38529
3DMark Fire Strike - Graphics Score 36466
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 461.456
CompuBench 1.5 Desktop - T-Rex (Frames/s) 93.23
CompuBench 1.5 Desktop - Video Composition (Frames/s) 200.733
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 4413.025
Geekbench - OpenCL 317521 200556
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 0
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006
GFXBench 4.0 - Manhattan (Frames) 27823
GFXBench 4.0 - Manhattan (Fps) 27823
GFXBench 4.0 - T-Rex (Frames) 51880
GFXBench 4.0 - T-Rex (Fps) 51880

Compare specifications (specs)

NVIDIA GeForce RTX 4090 NVIDIA A100 SXM4 40 GB

Essentials

Architecture Ada Lovelace Ampere
Code name AD102 GA100
Launch date 20 Sep 2022 14 May 2020
Place in performance rating 10 9

Technical info

Boost clock speed 2520 MHz 1410 MHz
Core clock speed 2235 MHz 1095 MHz
Manufacturing process technology 4 nm 7 nm
Peak Double Precision (FP64) Performance 1,290 GFLOPS (1:64) 9.746 TFLOPS (1:2)
Peak Half Precision (FP16) Performance 82.58 TFLOPS (1:1) 77.97 TFLOPS (4:1)
Peak Single Precision (FP32) Performance 82.58 TFLOPS 19.49 TFLOPS
Pipelines 16384 6912
Pixel fill rate 443.5 GPixel/s 225.6 GPixel/s
Texture fill rate 1,290 GTexel/s 609.1 GTexel/s
Thermal Design Power (TDP) 450 Watt 400 Watt
Transistor count 76300 million 54200 million

Video outputs and ports

Display Connectors 1x HDMI 2.1, 3x DisplayPort 1.4a No outputs

Compatibility, dimensions and requirements

Form factor Triple-slot IGP
Height 61 mm, 2.4 inches
Interface PCIe 4.0 x16 PCIe 4.0 x16
Length 304 mm, 12 inches
Recommended system power (PSU) 850 Watt 800 Watt
Supplementary power connectors 1x 16-pin None
Width 137 mm, 5.4 inches

API support

DirectX 12 Ultimate (12_2)
OpenCL 3.0 3.0
OpenGL 4.6
Shader Model 6.7
Vulkan

Memory

Maximum RAM amount 24 GB 40 GB
Memory bandwidth 1,008 GB/s 1555 GB/s
Memory bus width 384 bit 5120 bit
Memory clock speed 1313 MHz, 21 Gbps effective 1215 MHz (2.4 Gbps effective)
Memory type GDDR6X HBM2e
High bandwidth memory (HBM)