NVIDIA L40 vs NVIDIA A40

Comparative analysis of NVIDIA L40 and NVIDIA A40 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, PassMark - G2D Mark, PassMark - G3D Mark.

 

Differences

Reasons to consider the NVIDIA L40

  • Videocard is newer: launch date 2 year(s) 0 month(s) later
  • Around 43% higher boost clock speed: 2490 MHz vs 1740 MHz
  • Around 69% higher pipelines: 18176 vs 10752
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 5 nm vs 8 nm
  • Around 24% higher memory clock speed: 2250 MHz, 18 Gbps effective vs 1812 MHz (14.5 Gbps effective)
  • Around 71% better performance in Geekbench - OpenCL: 331026 vs 193656
Specifications (specs)
Launch date 13 Oct 2022 vs 5 Oct 2020
Boost clock speed 2490 MHz vs 1740 MHz
Pipelines 18176 vs 10752
Manufacturing process technology 5 nm vs 8 nm
Memory clock speed 2250 MHz, 18 Gbps effective vs 1812 MHz (14.5 Gbps effective)
Benchmarks
Geekbench - OpenCL 331026 vs 193656

Reasons to consider the NVIDIA A40

  • Around 78% higher core clock speed: 1305 MHz vs 735 MHz
  • 413.4x more texture fill rate: 584.6 GTexel/s vs 1,414 GTexel/s
Core clock speed 1305 MHz vs 735 MHz
Texture fill rate 584.6 GTexel/s vs 1,414 GTexel/s

Compare benchmarks

GPU 1: NVIDIA L40
GPU 2: NVIDIA A40

Geekbench - OpenCL
GPU 1
GPU 2
331026
193656
Name NVIDIA L40 NVIDIA A40
Geekbench - OpenCL 331026 193656
PassMark - G2D Mark 627
PassMark - G3D Mark 14665

Compare specifications (specs)

NVIDIA L40 NVIDIA A40

Essentials

Architecture Ada Lovelace Ampere
Code name AD102 GA102
Launch date 13 Oct 2022 5 Oct 2020
Place in performance rating 2 58

Technical info

Boost clock speed 2490 MHz 1740 MHz
Core clock speed 735 MHz 1305 MHz
Manufacturing process technology 5 nm 8 nm
Pipelines 18176 10752
Pixel fill rate 478.1 GPixel/s 194.9 GPixel/s
Texture fill rate 1,414 GTexel/s 584.6 GTexel/s
Thermal Design Power (TDP) 300 Watt 300 Watt
Transistor count 76300 million 28300 million
Peak Double Precision (FP64) Performance 1169 GFLOPS (1:32)
Peak Half Precision (FP16) Performance 37.42 TFLOPS (1:1)
Peak Single Precision (FP32) Performance 37.42 TFLOPS

Video outputs and ports

Display Connectors 4x DisplayPort 1.4a 3x DisplayPort

Compatibility, dimensions and requirements

Form factor Dual-slot Dual-slot
Interface PCIe 4.0 x16 PCIe 4.0 x16
Length 267 mm, 10.5 inches 267 mm (10.5 inches)
Recommended system power (PSU) 700 Watt 700 Watt
Supplementary power connectors 1x 16-pin 8-pin EPS
Width 111 mm, 4.4 inches 112 mm (4.4 inches)

API support

DirectX 12 Ultimate (12_2) 12.2
OpenCL 3.0 3.0
OpenGL 4.6 4.6
Shader Model 6.7 6.6
Vulkan

Memory

Maximum RAM amount 48 GB 48 GB
Memory bandwidth 864.0 GB/s 695.8 GB/s
Memory bus width 384 bit 384 bit
Memory clock speed 2250 MHz, 18 Gbps effective 1812 MHz (14.5 Gbps effective)
Memory type GDDR6 GDDR6