NVIDIA Tesla P4 vs NVIDIA Tesla P40

Comparative analysis of NVIDIA Tesla P4 and NVIDIA Tesla P40 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: PassMark - G3D Mark, PassMark - G2D Mark, Geekbench - OpenCL, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s), GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Fps).

 

Differences

Reasons to consider the NVIDIA Tesla P4

  • 3.3x lower typical power consumption: 75 Watt vs 250 Watt
  • Around 20% better performance in GFXBench 4.0 - Car Chase Offscreen (Frames): 11409 vs 9538
  • Around 20% better performance in GFXBench 4.0 - Car Chase Offscreen (Fps): 11409 vs 9538
Specifications (specs)
Thermal Design Power (TDP) 75 Watt vs 250 Watt
Benchmarks
GFXBench 4.0 - Car Chase Offscreen (Frames) 11409 vs 9538
GFXBench 4.0 - T-Rex (Frames) 3341 vs 3340
GFXBench 4.0 - Car Chase Offscreen (Fps) 11409 vs 9538
GFXBench 4.0 - T-Rex (Fps) 3341 vs 3340

Reasons to consider the NVIDIA Tesla P40

  • Around 61% higher core clock speed: 1303 MHz vs 810 MHz
  • Around 44% higher boost clock speed: 1531 MHz vs 1063 MHz
  • 2.2x more texture fill rate: 367.4 GTexel / s vs 170.1 GTexel / s
  • Around 50% higher pipelines: 3840 vs 2560
  • 2.2x better floating-point performance: 11,758 gflops vs 5,443 gflops
  • 3x more maximum memory size: 24 GB vs 8 GB
  • Around 67% higher memory clock speed: 10008 MHz vs 6008 MHz
  • Around 35% better performance in PassMark - G3D Mark: 12267 vs 9097
  • Around 12% better performance in PassMark - G2D Mark: 439 vs 391
  • Around 64% better performance in Geekbench - OpenCL: 62287 vs 37924
  • 2x better performance in CompuBench 1.5 Desktop - Face Detection (mPixels/s): 300.355 vs 147.62
  • Around 99% better performance in CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s): 3559.383 vs 1791.761
  • 2.1x better performance in CompuBench 1.5 Desktop - T-Rex (Frames/s): 19.757 vs 9.457
  • 2.1x better performance in CompuBench 1.5 Desktop - Video Composition (Frames/s): 204.32 vs 99.574
  • 2.3x better performance in CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s): 1214.142 vs 523.29
Specifications (specs)
Core clock speed 1303 MHz vs 810 MHz
Boost clock speed 1531 MHz vs 1063 MHz
Texture fill rate 367.4 GTexel / s vs 170.1 GTexel / s
Pipelines 3840 vs 2560
Floating-point performance 11,758 gflops vs 5,443 gflops
Maximum memory size 24 GB vs 8 GB
Memory clock speed 10008 MHz vs 6008 MHz
Benchmarks
PassMark - G3D Mark 12267 vs 9097
PassMark - G2D Mark 439 vs 391
Geekbench - OpenCL 62287 vs 37924
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 300.355 vs 147.62
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 3559.383 vs 1791.761
CompuBench 1.5 Desktop - T-Rex (Frames/s) 19.757 vs 9.457
CompuBench 1.5 Desktop - Video Composition (Frames/s) 204.32 vs 99.574
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 1214.142 vs 523.29
GFXBench 4.0 - Manhattan (Frames) 3699 vs 3698
GFXBench 4.0 - Manhattan (Fps) 3699 vs 3698

Compare benchmarks

GPU 1: NVIDIA Tesla P4
GPU 2: NVIDIA Tesla P40

PassMark - G3D Mark
GPU 1
GPU 2
9097
12267
PassMark - G2D Mark
GPU 1
GPU 2
391
439
Geekbench - OpenCL
GPU 1
GPU 2
37924
62287
CompuBench 1.5 Desktop - Face Detection (mPixels/s)
GPU 1
GPU 2
147.62
300.355
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s)
GPU 1
GPU 2
1791.761
3559.383
CompuBench 1.5 Desktop - T-Rex (Frames/s)
GPU 1
GPU 2
9.457
19.757
CompuBench 1.5 Desktop - Video Composition (Frames/s)
GPU 1
GPU 2
99.574
204.32
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s)
GPU 1
GPU 2
523.29
1214.142
GFXBench 4.0 - Car Chase Offscreen (Frames)
GPU 1
GPU 2
11409
9538
GFXBench 4.0 - Manhattan (Frames)
GPU 1
GPU 2
3698
3699
GFXBench 4.0 - T-Rex (Frames)
GPU 1
GPU 2
3341
3340
GFXBench 4.0 - Car Chase Offscreen (Fps)
GPU 1
GPU 2
11409
9538
GFXBench 4.0 - Manhattan (Fps)
GPU 1
GPU 2
3698
3699
GFXBench 4.0 - T-Rex (Fps)
GPU 1
GPU 2
3341
3340
Name NVIDIA Tesla P4 NVIDIA Tesla P40
PassMark - G3D Mark 9097 12267
PassMark - G2D Mark 391 439
Geekbench - OpenCL 37924 62287
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 147.62 300.355
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 1791.761 3559.383
CompuBench 1.5 Desktop - T-Rex (Frames/s) 9.457 19.757
CompuBench 1.5 Desktop - Video Composition (Frames/s) 99.574 204.32
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 523.29 1214.142
GFXBench 4.0 - Car Chase Offscreen (Frames) 11409 9538
GFXBench 4.0 - Manhattan (Frames) 3698 3699
GFXBench 4.0 - T-Rex (Frames) 3341 3340
GFXBench 4.0 - Car Chase Offscreen (Fps) 11409 9538
GFXBench 4.0 - Manhattan (Fps) 3698 3699
GFXBench 4.0 - T-Rex (Fps) 3341 3340

Compare specifications (specs)

NVIDIA Tesla P4 NVIDIA Tesla P40

Essentials

Architecture Pascal Pascal
Code name GP104 GP102
Launch date 13 September 2016 13 September 2016
Place in performance rating 349 234
Type Workstation Workstation
Launch price (MSRP) $5,699

Technical info

Boost clock speed 1063 MHz 1531 MHz
Core clock speed 810 MHz 1303 MHz
Floating-point performance 5,443 gflops 11,758 gflops
Manufacturing process technology 16 nm 16 nm
Pipelines 2560 3840
Texture fill rate 170.1 GTexel / s 367.4 GTexel / s
Thermal Design Power (TDP) 75 Watt 250 Watt
Transistor count 7,200 million 11,800 million

Video outputs and ports

Display Connectors No outputs No outputs

Compatibility, dimensions and requirements

Interface PCIe 3.0 x16 PCIe 3.0 x16
Length 267 mm 267 mm
Supplementary power connectors None 1x 6-pin + 1x 8-pin

API support

DirectX 12.0 (12_1) 12.0 (12_1)
OpenGL 4.6 4.6

Memory

Maximum RAM amount 8 GB 24 GB
Memory bandwidth 192.3 GB / s 480.4 GB / s
Memory bus width 256 Bit 384 Bit
Memory clock speed 6008 MHz 10008 MHz
Memory type GDDR5 GDDR5X