Tesla T4 vs Tesla P40
Aggregate performance score
We've compared Tesla P40 and Tesla T4, covering specs and all relevant benchmarks.
Tesla P40 outperforms Tesla T4 by a moderate 13% based on our aggregate benchmark results.
Primary details
GPU architecture, market segment, value for money and other general parameters compared.
Place in the ranking | 172 | 196 |
Place by popularity | not in top-100 | not in top-100 |
Cost-effectiveness evaluation | 2.57 | no data |
Power efficiency | 9.03 | 28.41 |
Architecture | Pascal (2016−2021) | Turing (2018−2022) |
GPU code name | GP102 | TU104 |
Market segment | Workstation | Workstation |
Release date | 13 September 2016 (8 years ago) | 13 September 2018 (6 years ago) |
Launch price (MSRP) | $5,699 | no data |
Cost-effectiveness evaluation
Performance to price ratio. The higher, the better.
Detailed specifications
General parameters such as number of shaders, GPU core base clock and boost clock speeds, manufacturing process, texturing and calculation speed. Note that power consumption of some graphics cards can well exceed their nominal TDP, especially when overclocked.
Pipelines / CUDA cores | 3840 | 2560 |
Core clock speed | 1303 MHz | 585 MHz |
Boost clock speed | 1531 MHz | 1590 MHz |
Number of transistors | 11,800 million | 13,600 million |
Manufacturing process technology | 16 nm | 12 nm |
Power consumption (TDP) | 250 Watt | 70 Watt |
Texture fill rate | 367.4 | 254.4 |
Floating-point processing power | 11.76 TFLOPS | 8.141 TFLOPS |
ROPs | 96 | 64 |
TMUs | 240 | 160 |
Tensor Cores | no data | 320 |
Ray Tracing Cores | no data | 40 |
Form factor & compatibility
Information on compatibility with other computer components. Useful when choosing a future computer configuration or upgrading an existing one. For desktop graphics cards it's interface and bus (motherboard compatibility), additional power connectors (power supply compatibility).
Interface | PCIe 3.0 x16 | PCIe 3.0 x16 |
Length | 267 mm | 168 mm |
Width | 2-slot | 1-slot |
Supplementary power connectors | 8-pin EPS | None |
VRAM capacity and type
Parameters of VRAM installed: its type, size, bus, clock and resulting bandwidth. Integrated GPUs have no dedicated video RAM and use a shared part of system RAM.
Memory type | GDDR5 | GDDR6 |
Maximum RAM amount | 24 GB | 16 GB |
Memory bus width | 384 Bit | 256 Bit |
Memory clock speed | 1808 MHz | 1250 MHz |
Memory bandwidth | 347.1 GB/s | 320.0 GB/s |
Connectivity and outputs
Types and number of video connectors present on the reviewed GPUs. As a rule, data in this section is precise only for desktop reference ones (so-called Founders Edition for NVIDIA chips). OEM manufacturers may change the number and type of output ports, while for notebook cards availability of certain video outputs ports depends on the laptop model rather than on the card itself.
Display Connectors | No outputs | No outputs |
API compatibility
List of supported 3D and general-purpose computing APIs, including their specific versions.
DirectX | 12 (12_1) | 12 Ultimate (12_1) |
Shader Model | 6.7 | 6.5 |
OpenGL | 4.6 | 4.6 |
OpenCL | 3.0 | 1.2 |
Vulkan | 1.3 | 1.2.131 |
CUDA | 6.1 | 7.5 |
Synthetic benchmark performance
Non-gaming benchmark results comparison. The combined score is measured on a 0-100 point scale.
Combined synthetic benchmark score
This is our combined benchmark score. We are regularly improving our combining algorithms, but if you find some perceived inconsistencies, feel free to speak up in comments section, we usually fix problems quickly.
Passmark
This is the most ubiquitous GPU benchmark. It gives the graphics card a thorough evaluation under various types of load, providing four separate benchmarks for Direct3D versions 9, 10, 11 and 12 (the last being done in 4K resolution if possible), and few more tests engaging DirectCompute capabilities.
Gaming performance
Let's see how good the compared graphics cards are for gaming. Particular gaming benchmark results are measured in FPS.
Pros & cons summary
Performance score | 31.59 | 27.84 |
Recency | 13 September 2016 | 13 September 2018 |
Maximum RAM amount | 24 GB | 16 GB |
Chip lithography | 16 nm | 12 nm |
Power consumption (TDP) | 250 Watt | 70 Watt |
Tesla P40 has a 13.5% higher aggregate performance score, and a 50% higher maximum VRAM amount.
Tesla T4, on the other hand, has an age advantage of 2 years, a 33.3% more advanced lithography process, and 257.1% lower power consumption.
The Tesla P40 is our recommended choice as it beats the Tesla T4 in performance tests.
Should you still have questions concerning choice between the reviewed GPUs, ask them in Comments section, and we shall answer.
Comparisons with similar GPUs
We selected several comparisons of graphics cards with performance close to those reviewed, providing you with more options to consider.