NVIDIA A40 vs NVIDIA Tesla V100 DGXS

Comparative analysis of NVIDIA A40 and NVIDIA Tesla V100 DGXS videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, PassMark - G2D Mark, PassMark - G3D Mark.

NVIDIA A40

NVIDIA Tesla V100 DGXS

Differences

Reasons to consider the NVIDIA A40

Videocard is newer: launch date 2 year(s) 6 month(s) later
Around 59% higher core clock speed: 1305 MHz vs 823 MHz
Around 90% higher boost clock speed: 1740 MHz vs 918 MHz
1262.6x more texture fill rate: 584.6 GTexel/s vs 463.0 GTexel / s
2.1x more pipelines: 10752 vs 5120
A newer manufacturing process allows for a more powerful, yet cooler running videocard: 8 nm vs 12 nm
Around 50% higher maximum memory size: 48 GB vs 32 GB
Around 3% higher memory clock speed: 1812 MHz (14.5 Gbps effective) vs 1752 MHz
Around 20% better performance in Geekbench - OpenCL: 193656 vs 161295

Specifications (specs)
Launch date	5 Oct 2020 vs 27 March 2018
Core clock speed	1305 MHz vs 823 MHz
Boost clock speed	1740 MHz vs 918 MHz
Texture fill rate	584.6 GTexel/s vs 463.0 GTexel / s
Pipelines	10752 vs 5120
Manufacturing process technology	8 nm vs 12 nm
Maximum memory size	48 GB vs 32 GB
Memory clock speed	1812 MHz (14.5 Gbps effective) vs 1752 MHz
Benchmarks
Geekbench - OpenCL	193656 vs 161295

Reasons to consider the NVIDIA Tesla V100 DGXS

Around 20% lower typical power consumption: 250 Watt vs 300 Watt

Thermal Design Power (TDP)	250 Watt vs 300 Watt

Compare benchmarks

GPU 1: NVIDIA A40
GPU 2: NVIDIA Tesla V100 DGXS

Geekbench - OpenCL

GPU 1

GPU 2

193656

161295

Name	NVIDIA A40	NVIDIA Tesla V100 DGXS
Geekbench - OpenCL	193656	161295
PassMark - G2D Mark	627
PassMark - G3D Mark	14665

Compare specifications (specs)

	NVIDIA A40	NVIDIA Tesla V100 DGXS
Essentials
Architecture	Ampere	Volta
Code name	GA102	GV100
Launch date	5 Oct 2020	27 March 2018
Place in performance rating	58	57
Type		Workstation
Technical info
Boost clock speed	1740 MHz	918 MHz
Core clock speed	1305 MHz	823 MHz
Manufacturing process technology	8 nm	12 nm
Peak Double Precision (FP64) Performance	1169 GFLOPS (1:32)
Peak Half Precision (FP16) Performance	37.42 TFLOPS (1:1)
Peak Single Precision (FP32) Performance	37.42 TFLOPS
Pipelines	10752	5120
Pixel fill rate	194.9 GPixel/s
Texture fill rate	584.6 GTexel/s	463.0 GTexel / s
Thermal Design Power (TDP)	300 Watt	250 Watt
Transistor count	28300 million	21,100 million
Floating-point performance		14,817 gflops
Video outputs and ports
Display Connectors	3x DisplayPort	No outputs
Compatibility, dimensions and requirements
Form factor	Dual-slot
Interface	PCIe 4.0 x16	PCIe 3.0 x16
Length	267 mm (10.5 inches)
Recommended system power (PSU)	700 Watt
Supplementary power connectors	8-pin EPS	None
Width	112 mm (4.4 inches)
API support
DirectX	12.2	12.0 (12_1)
OpenCL	3.0
OpenGL	4.6	4.6
Shader Model	6.6
Vulkan
Memory
Maximum RAM amount	48 GB	32 GB
Memory bandwidth	695.8 GB/s	897.0 GB / s
Memory bus width	384 bit	4096 Bit
Memory clock speed	1812 MHz (14.5 Gbps effective)	1752 MHz
Memory type	GDDR6	HBM2