NVIDIA A40 vs NVIDIA Tesla V100 DGXS

Comparative analysis of NVIDIA A40 and NVIDIA Tesla V100 DGXS videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, PassMark - G2D Mark, PassMark - G3D Mark.

 

Differences

Reasons to consider the NVIDIA A40

  • Videocard is newer: launch date 2 year(s) 6 month(s) later
  • Around 59% higher core clock speed: 1305 MHz vs 823 MHz
  • Around 90% higher boost clock speed: 1740 MHz vs 918 MHz
  • 1262.6x more texture fill rate: 584.6 GTexel/s vs 463.0 GTexel / s
  • 2.1x more pipelines: 10752 vs 5120
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 8 nm vs 12 nm
  • Around 50% higher maximum memory size: 48 GB vs 32 GB
  • Around 3% higher memory clock speed: 1812 MHz (14.5 Gbps effective) vs 1752 MHz
  • Around 20% better performance in Geekbench - OpenCL: 193429 vs 161295
Specifications (specs)
Launch date 5 Oct 2020 vs 27 March 2018
Core clock speed 1305 MHz vs 823 MHz
Boost clock speed 1740 MHz vs 918 MHz
Texture fill rate 584.6 GTexel/s vs 463.0 GTexel / s
Pipelines 10752 vs 5120
Manufacturing process technology 8 nm vs 12 nm
Maximum memory size 48 GB vs 32 GB
Memory clock speed 1812 MHz (14.5 Gbps effective) vs 1752 MHz
Benchmarks
Geekbench - OpenCL 193429 vs 161295

Reasons to consider the NVIDIA Tesla V100 DGXS

  • Around 20% lower typical power consumption: 250 Watt vs 300 Watt
Thermal Design Power (TDP) 250 Watt vs 300 Watt

Compare benchmarks

GPU 1: NVIDIA A40
GPU 2: NVIDIA Tesla V100 DGXS

Geekbench - OpenCL
GPU 1
GPU 2
193429
161295
Name NVIDIA A40 NVIDIA Tesla V100 DGXS
Geekbench - OpenCL 193429 161295
PassMark - G2D Mark 627
PassMark - G3D Mark 14665

Compare specifications (specs)

NVIDIA A40 NVIDIA Tesla V100 DGXS

Essentials

Architecture Ampere Volta
Code name GA102 GV100
Launch date 5 Oct 2020 27 March 2018
Place in performance rating 53 58
Type Workstation

Technical info

Boost clock speed 1740 MHz 918 MHz
Core clock speed 1305 MHz 823 MHz
Manufacturing process technology 8 nm 12 nm
Peak Double Precision (FP64) Performance 1169 GFLOPS (1:32)
Peak Half Precision (FP16) Performance 37.42 TFLOPS (1:1)
Peak Single Precision (FP32) Performance 37.42 TFLOPS
Pipelines 10752 5120
Pixel fill rate 194.9 GPixel/s
Texture fill rate 584.6 GTexel/s 463.0 GTexel / s
Thermal Design Power (TDP) 300 Watt 250 Watt
Transistor count 28300 million 21,100 million
Floating-point performance 14,817 gflops

Video outputs and ports

Display Connectors 3x DisplayPort No outputs

Compatibility, dimensions and requirements

Form factor Dual-slot
Interface PCIe 4.0 x16 PCIe 3.0 x16
Length 267 mm (10.5 inches)
Recommended system power (PSU) 700 Watt
Supplementary power connectors 8-pin EPS None
Width 112 mm (4.4 inches)

API support

DirectX 12.2 12.0 (12_1)
OpenCL 3.0
OpenGL 4.6 4.6
Shader Model 6.6
Vulkan

Memory

Maximum RAM amount 48 GB 32 GB
Memory bandwidth 695.8 GB/s 897.0 GB / s
Memory bus width 384 bit 4096 Bit
Memory clock speed 1812 MHz (14.5 Gbps effective) 1752 MHz
Memory type GDDR6 HBM2