In the new era of AI and intelligent machines, Deep Learning is shaping our world like no other computing model in history. Interactive speech, visual search, and video recommendations are a few of many AI-based services that we use every day. Accuracy and responsiveness are key to user adoption for these services. As Deep Learning models increase in accuracy and complexity, CPUs are no longer capable of delivering a responsive user experience.
The NVIDIA® Tesla® P4 is powered by the revolutionary NVIDIA® Pascal™ architecture and purpose-built to boost efficiency for scale-out servers running deep learning workloads,enabling smart responsive AI-based services. It slashes inference latency by 15X in any hyperscale infrastructure and provides an incredible 60X better energy efficiency than CPUs.This unlocks a new wave of AI services previous impossible due to latency limitation.
NVIDIA Tesla P4
Integer Operations (INT8)
22 TOPS* (Tera-Operations per Second)
Low-Profile PCI Express Form Factor
Enhanced Programmability with Page Migration Engine
Server-Optimized for Data Center Deployment
Hardware-Accelerated Video Engine
1x Decode Engine, 2x Encode Engine