NVIDIA A100 SXM4 40 GB vs NVIDIA Tesla T4

Comparative analysis of NVIDIA A100 SXM4 40 GB and NVIDIA Tesla T4 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps), PassMark - G3D Mark, PassMark - G2D Mark, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s).

 

Differences

Reasons to consider the NVIDIA A100 SXM4 40 GB

  • Videocard is newer: launch date 1 year(s) 8 month(s) later
  • Around 9% higher core clock speed: 1095 MHz vs 1005 MHz
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 7 nm vs 12 nm
  • 3.3x better performance in Geekbench - OpenCL: 200534 vs 61276
  • Around 49% better performance in GFXBench 4.0 - Car Chase Offscreen (Frames): 21006 vs 14076
  • Around 49% better performance in GFXBench 4.0 - Car Chase Offscreen (Fps): 21006 vs 14076
  • 14.1x better performance in GFXBench 4.0 - Manhattan (Frames): 27823 vs 1976
  • 14.1x better performance in GFXBench 4.0 - Manhattan (Fps): 27823 vs 1976
  • 29.1x better performance in GFXBench 4.0 - T-Rex (Frames): 51880 vs 1781
  • 29.1x better performance in GFXBench 4.0 - T-Rex (Fps): 51880 vs 1781
Specifications (specs)
Launch date 14 May 2020 vs 13 September 2018
Core clock speed 1095 MHz vs 1005 MHz
Manufacturing process technology 7 nm vs 12 nm
Benchmarks
Geekbench - OpenCL 200534 vs 61276
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006 vs 14076
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006 vs 14076
GFXBench 4.0 - Manhattan (Frames) 27823 vs 1976
GFXBench 4.0 - Manhattan (Fps) 27823 vs 1976
GFXBench 4.0 - T-Rex (Frames) 51880 vs 1781
GFXBench 4.0 - T-Rex (Fps) 51880 vs 1781

Reasons to consider the NVIDIA Tesla T4

  • Around 7% higher boost clock speed: 1515 MHz vs 1410 MHz
  • 5.3x lower typical power consumption: 75 Watt vs 400 Watt
  • 8.2x more memory clock speed: 10000 MHz vs 1215 MHz (2.4 Gbps effective)
Boost clock speed 1515 MHz vs 1410 MHz
Thermal Design Power (TDP) 75 Watt vs 400 Watt
Memory clock speed 10000 MHz vs 1215 MHz (2.4 Gbps effective)

Compare benchmarks

GPU 1: NVIDIA A100 SXM4 40 GB
GPU 2: NVIDIA Tesla T4

Geekbench - OpenCL
GPU 1
GPU 2
200534
61276
GFXBench 4.0 - Car Chase Offscreen (Frames)
GPU 1
GPU 2
21006
14076
GFXBench 4.0 - Car Chase Offscreen (Fps)
GPU 1
GPU 2
21006
14076
GFXBench 4.0 - Manhattan (Frames)
GPU 1
GPU 2
27823
1976
GFXBench 4.0 - Manhattan (Fps)
GPU 1
GPU 2
27823
1976
GFXBench 4.0 - T-Rex (Frames)
GPU 1
GPU 2
51880
1781
GFXBench 4.0 - T-Rex (Fps)
GPU 1
GPU 2
51880
1781
Name NVIDIA A100 SXM4 40 GB NVIDIA Tesla T4
Geekbench - OpenCL 200534 61276
GFXBench 4.0 - Car Chase Offscreen (Frames) 21006 14076
GFXBench 4.0 - Car Chase Offscreen (Fps) 21006 14076
GFXBench 4.0 - Manhattan (Frames) 27823 1976
GFXBench 4.0 - Manhattan (Fps) 27823 1976
GFXBench 4.0 - T-Rex (Frames) 51880 1781
GFXBench 4.0 - T-Rex (Fps) 51880 1781
PassMark - G3D Mark 10744
PassMark - G2D Mark 590
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 127.622
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 3026.812
CompuBench 1.5 Desktop - T-Rex (Frames/s) 18.798
CompuBench 1.5 Desktop - Video Composition (Frames/s) 119.936
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 910.721

Compare specifications (specs)

NVIDIA A100 SXM4 40 GB NVIDIA Tesla T4

Essentials

Architecture Ampere Turing
Code name GA100 TU104
Launch date 14 May 2020 13 September 2018
Place in performance rating 10 278
Type Workstation

Technical info

Boost clock speed 1410 MHz 1515 MHz
Core clock speed 1095 MHz 1005 MHz
Manufacturing process technology 7 nm 12 nm
Peak Double Precision (FP64) Performance 9.746 TFLOPS (1:2)
Peak Half Precision (FP16) Performance 77.97 TFLOPS (4:1)
Peak Single Precision (FP32) Performance 19.49 TFLOPS
Pipelines 6912
Pixel fill rate 225.6 GPixel/s
Texture fill rate 609.1 GTexel/s
Thermal Design Power (TDP) 400 Watt 75 Watt
Transistor count 54200 million 13,600 million

Video outputs and ports

Display Connectors No outputs No outputs

Compatibility, dimensions and requirements

Form factor IGP
Interface PCIe 4.0 x16 PCIe 3.0 x16
Recommended system power (PSU) 800 Watt
Supplementary power connectors None None

API support

OpenCL 3.0
DirectX 12.0 (12_1)
OpenGL 4.6

Memory

High bandwidth memory (HBM)
Maximum RAM amount 40 GB
Memory bandwidth 1555 GB/s
Memory bus width 5120 bit
Memory clock speed 1215 MHz (2.4 Gbps effective) 10000 MHz
Memory type HBM2e