NVIDIA A40 vs NVIDIA Tesla V100S PCIe 32 GB

Comparative analysis of NVIDIA A40 and NVIDIA Tesla V100S PCIe 32 GB videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL.

 

Differences

Reasons to consider the NVIDIA A40

  • Videocard is newer: launch date 10 month(s) later
  • Around 5% higher core clock speed: 1305 MHz vs 1245 MHz
  • Around 9% higher boost clock speed: 1740 MHz vs 1597 MHz
  • Around 14% higher texture fill rate: 584.6 GTexel/s vs 511.0 GTexel/s
  • 2.1x more pipelines: 10752 vs 5120
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 8 nm vs 12 nm
  • Around 50% higher maximum memory size: 48 GB vs 32 GB
  • Around 64% higher memory clock speed: 1812 MHz (14.5 Gbps effective) vs 1106 MHz (2.2 Gbps effective)
Launch date 5 Oct 2020 vs 26 Nov 2019
Core clock speed 1305 MHz vs 1245 MHz
Boost clock speed 1740 MHz vs 1597 MHz
Texture fill rate 584.6 GTexel/s vs 511.0 GTexel/s
Pipelines 10752 vs 5120
Manufacturing process technology 8 nm vs 12 nm
Maximum memory size 48 GB vs 32 GB
Memory clock speed 1812 MHz (14.5 Gbps effective) vs 1106 MHz (2.2 Gbps effective)

Reasons to consider the NVIDIA Tesla V100S PCIe 32 GB

  • Around 20% lower typical power consumption: 250 Watt vs 300 Watt
Specifications (specs)
Thermal Design Power (TDP) 250 Watt vs 300 Watt
Benchmarks
Geekbench - OpenCL 186147 vs 185390

Compare benchmarks

GPU 1: NVIDIA A40
GPU 2: NVIDIA Tesla V100S PCIe 32 GB

Geekbench - OpenCL
GPU 1
GPU 2
185390
186147
Name NVIDIA A40 NVIDIA Tesla V100S PCIe 32 GB
Geekbench - OpenCL 185390 186147

Compare specifications (specs)

NVIDIA A40 NVIDIA Tesla V100S PCIe 32 GB

Essentials

Architecture Ampere Volta
Code name GA102 GV100
Launch date 5 Oct 2020 26 Nov 2019
Place in performance rating 5 4
Type Workstation

Technical info

Boost clock speed 1740 MHz 1597 MHz
Core clock speed 1305 MHz 1245 MHz
Manufacturing process technology 8 nm 12 nm
Peak Double Precision (FP64) Performance 1169 GFLOPS (1:32) 8.177 TFLOPS (1:2)
Peak Half Precision (FP16) Performance 37.42 TFLOPS (1:1) 32.71 TFLOPS (2:1)
Peak Single Precision (FP32) Performance 37.42 TFLOPS 16.35 TFLOPS
Pipelines 10752 5120
Pixel fill rate 194.9 GPixel/s 204.4 GPixel/s
Texture fill rate 584.6 GTexel/s 511.0 GTexel/s
Thermal Design Power (TDP) 300 Watt 250 Watt
Transistor count 28300 million 21100 million

Video outputs and ports

Display Connectors 3x DisplayPort No outputs

Compatibility, dimensions and requirements

Form factor Dual-slot Dual-slot
Interface PCIe 4.0 x16 PCIe 3.0 x16
Length 267 mm (10.5 inches)
Recommended system power (PSU) 700 Watt 600 Watt
Supplementary power connectors 8-pin EPS 2x 8-pin
Width 112 mm (4.4 inches)

API support

DirectX 12.2 12.0 (12_1)
OpenCL 3.0 3.0
OpenGL 4.6 4.6
Shader Model 6.6 6.6
Vulkan

Memory

Maximum RAM amount 48 GB 32 GB
Memory bandwidth 695.8 GB/s 1133 GB/s
Memory bus width 384 bit 4096 Bit
Memory clock speed 1812 MHz (14.5 Gbps effective) 1106 MHz (2.2 Gbps effective)
Memory type GDDR6 HBM2
High bandwidth memory (HBM)