Support for the Intel® Xeon Phi™ Processor Codenamed Knights Landing (KNL) including General Exploration, Memory Access,HPC Performance Characterization analysis and PMU event reference.
Disk Input and Output analysis(Preview) that monitors utilization of the disk subsystem, CPU and PCIe buses, helps identify long latency of I/O requests and imbalance between I/O and compute operations.
Memory Access analysis improvements:
Automatic detection of maximum system DRAM bandwidth characteristics. This option helps understand how you utilize the available DRAM bandwidth.
Support for custom memory allocators via Memory Allocation API that help correctly determine memory objects
HPC workloads profiling improvements:
HPC Performance Characterization analysis that explores the following performance aspects of the application scalability: CPU utilization with parallel efficiency for MPI and OpenMP*, memory access
efficiency and FPU utilization with basic vectorization metrics
MPI analysis extended with the event-based sampling collection supported for multiple ranks per node with an arbitrary MPI launcher and natural syntax. Arbitrary targets command line configuration extended with MPI launcher options. You can now use the Copy Command Line to Clipboard dialog box to automatically generate a command line for MPI analysis from GUI.
An option enabling/disabling the OpenMP* regions analysis added to selected analysis configurations
An option controlling result finalization, finalization-mode, that enables you to perform a full finalization on the target, defer or skip the finalization. The deferred finalization mode is especially useful on target platforms with a single-thread performance lower than on the host. In this mode, the VTune™ Amplifier calculates a binary checksum to match the binaries for finalization on the host machine.
More languages support:
Python* applications profiling with Basic Hotspots analysis running via the Launch Application or Attach to Process mode
Go* applications profiling with hardware event-based analysis types
GPU analysis improvements:
GPU Hotspots analysis targeted for GPU-bound applications and providing options to analyze execution of OpenCL™ kernels and Intel Media SDK tasks
GPU analysis Summary introducing a set of metrics to estimate GPU utilization per engine, identify stalled or idle Execution Units and explore the most typical problems with low occupancy or frequent
Navigation from the Hottest GPU computing tasks summary to the details provided in the Graphics tab
Support for the Attach to Process target analysis for Intel® Media SDK and OpenCL™ programs
Detection of the OpenCL™ 2.0 Shared Virtual Memory (SVM) usage types per kernel instance
Support for the Attach to Process target analysis with the event-based sampling for low privileged Java* daemons on Linux*
Event selection mechanism for custom hardware event-based sampling analysis extended with filtering options
Arbitrary target GUI configuration to generate a command line for performance analysis on a system that is not accessible from the current host
UI improvements for the grid views and identification of performance issues
Intel Performance Snapshot (Preview) introducing the following tools as part of the VTune™ Amplifier:
Application Performance Snapshot tool provides a quick look at your application performance and helps you understand whether your application will benefit from tuning. It identifies how effectively your
application uses the hardware platform and displays basic performance enhancement opportunities.
Storage Performance Snapshot tool analyzes your system's storage, CPU, memory, and network usage and displays basic performance enhancement opportunities for systems using Intel hardware.
Support for Windows 10 RS1 (Redstone) and Windows Server 2016
Support for Microsoft Visual Studio* 2015 Update 2