NVIDIA Tesla P4 vs NVIDIA Tesla M40

Comparative analysis of NVIDIA Tesla P4 and NVIDIA Tesla M40 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: PassMark - G3D Mark, PassMark - G2D Mark, Geekbench - OpenCL, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s), GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Fps).

 

Differences

Reasons to consider the NVIDIA Tesla P4

  • Videocard is newer: launch date 10 month(s) later
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 16 nm vs 28 nm
  • 3.3x lower typical power consumption: 75 Watt vs 250 Watt
Launch date 13 September 2016 vs 10 November 2015
Manufacturing process technology 16 nm vs 28 nm
Thermal Design Power (TDP) 75 Watt vs 250 Watt

Reasons to consider the NVIDIA Tesla M40

  • Around 17% higher core clock speed: 948 MHz vs 810 MHz
  • Around 5% higher boost clock speed: 1114 MHz vs 1063 MHz
  • Around 26% higher texture fill rate: 213.9 GTexel / s vs 170.1 GTexel / s
  • Around 20% higher pipelines: 3072 vs 2560
  • Around 26% better floating-point performance: 6,844 gflops vs 5,443 gflops
  • Around 50% higher maximum memory size: 12 GB vs 8 GB
  • Around 12% better performance in PassMark - G3D Mark: 10220 vs 9097
  • Around 12% better performance in PassMark - G2D Mark: 437 vs 391
  • Around 3% better performance in Geekbench - OpenCL: 39184 vs 37924
  • Around 25% better performance in CompuBench 1.5 Desktop - Face Detection (mPixels/s): 183.81 vs 147.62
  • Around 47% better performance in CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s): 2637.997 vs 1791.761
  • Around 38% better performance in CompuBench 1.5 Desktop - T-Rex (Frames/s): 13.059 vs 9.457
  • Around 61% better performance in CompuBench 1.5 Desktop - Video Composition (Frames/s): 160.359 vs 99.574
  • Around 32% better performance in CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s): 688.388 vs 523.29
Specifications (specs)
Core clock speed 948 MHz vs 810 MHz
Boost clock speed 1114 MHz vs 1063 MHz
Texture fill rate 213.9 GTexel / s vs 170.1 GTexel / s
Pipelines 3072 vs 2560
Floating-point performance 6,844 gflops vs 5,443 gflops
Maximum memory size 12 GB vs 8 GB
Benchmarks
PassMark - G3D Mark 10220 vs 9097
PassMark - G2D Mark 437 vs 391
Geekbench - OpenCL 39184 vs 37924
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 183.81 vs 147.62
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 2637.997 vs 1791.761
CompuBench 1.5 Desktop - T-Rex (Frames/s) 13.059 vs 9.457
CompuBench 1.5 Desktop - Video Composition (Frames/s) 160.359 vs 99.574
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 688.388 vs 523.29

Compare benchmarks

GPU 1: NVIDIA Tesla P4
GPU 2: NVIDIA Tesla M40

PassMark - G3D Mark
GPU 1
GPU 2
9097
10220
PassMark - G2D Mark
GPU 1
GPU 2
391
437
Geekbench - OpenCL
GPU 1
GPU 2
37924
39184
CompuBench 1.5 Desktop - Face Detection (mPixels/s)
GPU 1
GPU 2
147.62
183.81
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s)
GPU 1
GPU 2
1791.761
2637.997
CompuBench 1.5 Desktop - T-Rex (Frames/s)
GPU 1
GPU 2
9.457
13.059
CompuBench 1.5 Desktop - Video Composition (Frames/s)
GPU 1
GPU 2
99.574
160.359
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s)
GPU 1
GPU 2
523.29
688.388
Name NVIDIA Tesla P4 NVIDIA Tesla M40
PassMark - G3D Mark 9097 10220
PassMark - G2D Mark 391 437
Geekbench - OpenCL 37924 39184
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 147.62 183.81
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 1791.761 2637.997
CompuBench 1.5 Desktop - T-Rex (Frames/s) 9.457 13.059
CompuBench 1.5 Desktop - Video Composition (Frames/s) 99.574 160.359
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 523.29 688.388
GFXBench 4.0 - Car Chase Offscreen (Frames) 11409
GFXBench 4.0 - Manhattan (Frames) 3698
GFXBench 4.0 - T-Rex (Frames) 3341
GFXBench 4.0 - Car Chase Offscreen (Fps) 11409
GFXBench 4.0 - Manhattan (Fps) 3698
GFXBench 4.0 - T-Rex (Fps) 3341

Compare specifications (specs)

NVIDIA Tesla P4 NVIDIA Tesla M40

Essentials

Architecture Pascal Maxwell 2.0
Code name GP104 GM200
Launch date 13 September 2016 10 November 2015
Place in performance rating 349 260
Type Workstation Workstation

Technical info

Boost clock speed 1063 MHz 1114 MHz
Core clock speed 810 MHz 948 MHz
Floating-point performance 5,443 gflops 6,844 gflops
Manufacturing process technology 16 nm 28 nm
Pipelines 2560 3072
Texture fill rate 170.1 GTexel / s 213.9 GTexel / s
Thermal Design Power (TDP) 75 Watt 250 Watt
Transistor count 7,200 million 8,000 million

Video outputs and ports

Display Connectors No outputs No outputs

Compatibility, dimensions and requirements

Interface PCIe 3.0 x16 PCIe 3.0 x16
Length 267 mm 267 mm
Supplementary power connectors None 1x 6-pin + 1x 8-pin

API support

DirectX 12.0 (12_1) 12.0 (12_1)
OpenGL 4.6 4.6

Memory

Maximum RAM amount 8 GB 12 GB
Memory bandwidth 192.3 GB / s 288.0 GB / s
Memory bus width 256 Bit 384 Bit
Memory clock speed 6008 MHz 6008 MHz
Memory type GDDR5 GDDR5