Note
This type of analysis is supported by the Intel® VTune™ Amplifier XE only.
The following figure shows basic steps required to analyze an application running on Intel® Xeon Phi™ coprocessors based on Intel Many Integrated Core Architecture (Intel® MIC Architecture) or perform a system-wide analysis. You may choose to run one of the predefined analysis types, Advanced Hotspots, Bandwidth, General Exploration, or create a custom analysis type.
Prerequisites:
Build the target on the host with full optimizations, which is recommended for performance analysis.
When using an offload or cross compiler, make sure to manually install binary utilities (Binutils) included in the Intel Xeon Phi installation zip file package. For installation instructions, please refer to the Intel Compiler documentation.
1. | Install the sampling server and driver on an Intel Xeon Phi coprocessor card to be sampled. | |
3. | Specify and configure your analysis target from the host system |
|
4. | From the performance analysis tree in the Analysis Type window, choose and configure an analysis type. Click Start to run the analysis. | |
5. | Intel® VTune™ Amplifier generates a data collection result and, by default, opens it in the default viewpoint. Switch between available viewpoints to identify code regions that took most of the CPU time and experienced potentially significant architectural bottlenecks. |