In the new era of AI and intelligent machines, Deep Learning is shaping our world like no other computing model in history. Interactive speech, visual search, and video recommendations are a few of many AI-based services that we use every day. Accuracy and responsiveness are key to user adoption for these services. As Deep Learning models increase in accuracy and complexity, CPUs are no longer capable of delivering a responsive user experience.

The NVIDIA® Tesla® P4 is powered by the revolutionary NVIDIA® Pascal™ architecture and purpose-built to boost efficiency for scale-out servers running deep learning workloads,enabling smart responsive AI-based services. It slashes inference latency by 15X in any hyperscale infrastructure and provides an incredible 60X better energy efficiency than CPUs.This unlocks a new wave of AI services previous impossible due to latency limitation.


| Print

Model Number


GPU Architecture

NVIDIA® Pascal™

Single-Precision Performance

5.5 TeraFLOPS*

Integer Operations (INT8)

22 TOPS* (Tera-Operations per Second)

GPU Memory

8 GB

Memory Bandwidth

192 GB/s

System Interface

Low-Profile PCI Express Form Factor

Max Power


Enhanced Programmability with Page Migration Engine


ECC Protection


Server-Optimized for Data Center Deployment


Hardware-Accelerated Video Engine

1x Decode Engine, 2x Encode Engine

Need a Quote?

Have questions about XENON’s products and solutions. Just ask. A knowledgeable Sales Specialist will get back to you shortly.

get a quote