NVIDIA Tesla P100 PCIe 16 GB vs NVIDIA Tesla M40

Comparative analysis of NVIDIA Tesla P100 PCIe 16 GB and NVIDIA Tesla M40 videocards for all known characteristics in the following categories: Essentials, Technical info, Video outputs and ports, Compatibility, dimensions and requirements, API support, Memory. Benchmark videocards performance analysis: Geekbench - OpenCL, GFXBench 4.0 - Car Chase Offscreen (Frames), GFXBench 4.0 - Car Chase Offscreen (Fps), GFXBench 4.0 - Manhattan (Frames), GFXBench 4.0 - Manhattan (Fps), GFXBench 4.0 - T-Rex (Frames), GFXBench 4.0 - T-Rex (Fps), PassMark - G2D Mark, PassMark - G3D Mark, CompuBench 1.5 Desktop - Face Detection (mPixels/s), CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s), CompuBench 1.5 Desktop - T-Rex (Frames/s), CompuBench 1.5 Desktop - Video Composition (Frames/s), CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s).

 

Differences

Reasons to consider the NVIDIA Tesla P100 PCIe 16 GB

  • Videocard is newer: launch date 7 month(s) later
  • Around 26% higher core clock speed: 1190 MHz vs 948 MHz
  • Around 19% higher boost clock speed: 1329 MHz vs 1114 MHz
  • Around 55% higher texture fill rate: 331.5 GTexel / s vs 213.9 GTexel / s
  • Around 17% higher pipelines: 3584 vs 3072
  • Around 55% better floating-point performance: 10,609 gflops vs 6,844 gflops
  • A newer manufacturing process allows for a more powerful, yet cooler running videocard: 16 nm vs 28 nm
  • Around 33% higher maximum memory size: 16 GB vs 12 GB
  • Around 98% better performance in Geekbench - OpenCL: 77871 vs 39310
  • Around 27% better performance in PassMark - G2D Mark: 572 vs 452
Specifications (specs)
Launch date 20 June 2016 vs 10 November 2015
Core clock speed 1190 MHz vs 948 MHz
Boost clock speed 1329 MHz vs 1114 MHz
Texture fill rate 331.5 GTexel / s vs 213.9 GTexel / s
Pipelines 3584 vs 3072
Floating-point performance 10,609 gflops vs 6,844 gflops
Manufacturing process technology 16 nm vs 28 nm
Maximum memory size 16 GB vs 12 GB
Benchmarks
Geekbench - OpenCL 77871 vs 39310
PassMark - G2D Mark 572 vs 452

Reasons to consider the NVIDIA Tesla M40

  • 4.2x more memory clock speed: 6008 MHz vs 1430 MHz
  • Around 45% better performance in PassMark - G3D Mark: 10465 vs 7225
Specifications (specs)
Memory clock speed 6008 MHz vs 1430 MHz
Benchmarks
PassMark - G3D Mark 10465 vs 7225

Compare benchmarks

GPU 1: NVIDIA Tesla P100 PCIe 16 GB
GPU 2: NVIDIA Tesla M40

Geekbench - OpenCL
GPU 1
GPU 2
77871
39310
PassMark - G2D Mark
GPU 1
GPU 2
572
452
PassMark - G3D Mark
GPU 1
GPU 2
7225
10465
Name NVIDIA Tesla P100 PCIe 16 GB NVIDIA Tesla M40
Geekbench - OpenCL 77871 39310
GFXBench 4.0 - Car Chase Offscreen (Frames) 13720
GFXBench 4.0 - Car Chase Offscreen (Fps) 13720
GFXBench 4.0 - Manhattan (Frames) 6381
GFXBench 4.0 - Manhattan (Fps) 6381
GFXBench 4.0 - T-Rex (Frames) 8915
GFXBench 4.0 - T-Rex (Fps) 8915
PassMark - G2D Mark 572 452
PassMark - G3D Mark 7225 10465
CompuBench 1.5 Desktop - Face Detection (mPixels/s) 183.81
CompuBench 1.5 Desktop - Ocean Surface Simulation (Frames/s) 2637.997
CompuBench 1.5 Desktop - T-Rex (Frames/s) 13.059
CompuBench 1.5 Desktop - Video Composition (Frames/s) 160.359
CompuBench 1.5 Desktop - Bitcoin Mining (mHash/s) 688.388

Compare specifications (specs)

NVIDIA Tesla P100 PCIe 16 GB NVIDIA Tesla M40

Essentials

Architecture Pascal Maxwell 2.0
Code name GP100 GM200
Launch date 20 June 2016 10 November 2015
Launch price (MSRP) $5,699
Place in performance rating 192 264
Type Workstation Workstation

Technical info

Boost clock speed 1329 MHz 1114 MHz
Core clock speed 1190 MHz 948 MHz
Floating-point performance 10,609 gflops 6,844 gflops
Manufacturing process technology 16 nm 28 nm
Pipelines 3584 3072
Texture fill rate 331.5 GTexel / s 213.9 GTexel / s
Thermal Design Power (TDP) 250 Watt 250 Watt
Transistor count 15,300 million 8,000 million

Video outputs and ports

Display Connectors No outputs No outputs

Compatibility, dimensions and requirements

Interface PCIe 3.0 x16 PCIe 3.0 x16
Length 267 mm 267 mm
Supplementary power connectors 1x 8-pin 1x 6-pin + 1x 8-pin

API support

DirectX 12.0 (12_1) 12.0 (12_1)
OpenGL 4.6 4.6

Memory

Maximum RAM amount 16 GB 12 GB
Memory bandwidth 720.9 GB / s 288.0 GB / s
Memory bus width 4096 Bit 384 Bit
Memory clock speed 1430 MHz 6008 MHz
Memory type HBM2 GDDR5