To access this pane:
Click the
New Analysis button on the Intel® VTune™ Amplifier toolbar.
The New Amplifier Result tab opens with the Analysis Type window active.
Select the Algorithm Analysis > Basic Hotspots analysis type from the analysis tree on the left pane.
The Basic Hotspots pane opens on the right.
Use this pane to explore and edit the Basic Hotspots analysis type configuration. The Basic Hotspots analysis type uses the user-mode sampling and tracing collection to analyze where your application is spending time.
Use This | To Do This |
---|---|
CPU sampling interval, ms spin box | Specify an interval between CPU samples. |
Analyze user tasks check box | Analyze tasks on the certain tasks in your code specified via Task API. |
Analyze Processor Graphics hardware events drop-down menu | Enable to analyze performance data from Processor Graphics based on the predefined groups of GPU metrics. This option is available only for supported platforms with the Intel® graphics driver installed. |
Analyze DirectX pipeline events check box | Analyze GPU usage based on the data provided by the DirectX* and identify whether your application is GPU or CPU bound. |
Trace OpenCL kernels on Processor Graphics check box | Capture the execution time of OpenCL™ kernels on a GPU, identify performance-critical GPU computing tasks, and analyze the performance of OpenCL kernels per GPU hardware metrics. |
Details button | Expand/collapse a section listing the default non-editable settings used for this analysis type. If you want to modify the settings for the Basic Hotspots analysis, you may right-click the Basic Hotspots entry in the analysis tree and select Duplicate from Existing Configuration from the context menu. VTune Amplifier creates an editable copy of this analysis type configuration and locates it under the Custom Analysis branch in the analysis tree. |
The Details section provides information on the following collection settings used for the Basic Hotspots analysis:
Use This Option | To Do This | Default Basic Hotspots Value |
---|---|---|
CPU sampling interval, ms | Set the interval between collected CPU samples in milliseconds. | 10 |
Collect highly accurate CPU time | Obtain more accurate CPU time data. This option causes more runtime overhead and increases result size. Administrator privileges are required. | Yes |
Collect CPU sampling data | Enable sampling and include stack unwinding, that is respective result windows and panes will contain information about function call stacks. | With stacks |
Collect signalling API data | Identify synchronization transitions in the timeline and signalling call stacks for associated waits. The collector instruments signalling APIs, which causes higher overhead and increases result size. The specified option value enables stack unwinding for signaling calls, that is respective result windows and panes will contain information about calling sequences for signaling calls. | No |
Collect synchronisation API data | Identify where threads are waiting or enables you to compute thread concurrency. The collector instruments APIs, which causes higher overhead and increases result size. The specified option value enables stack unwinding for synchronization wait calls, that is respective result windows and panes will contain information about calling sequences for synchronization wait calls. | No |
Collect I/O API data | Identify where threads are waiting or enables you to compute thread concurrency. The collector instruments APIs, which causes higher overhead and increases result size. The specified option value enables stack unwinding for I/O calls, that is respective result windows and panes will contain information about calling sequences for I/O calls. | No |
Analyze user tasks | Analyze tasks in your code specified via Task API. This option causes higher overhead and increases result size. | No |
Analyze Intel runtimes and user synchronization | Analyze thread synchronization by profiling User synchronization API used by Intel runtimes like OpenMP* and Intel® Threading Building Blocks (Intel TBB) or by the user. This option causes higher overhead and increases result size. | No |
Analyze Processor Graphics hardware events | Analyze performance data from Processor Graphics based on the predefined groups of GPU metrics. This option is available only for supported platforms with the Intel® graphics driver installed. | None |
GPU sampling interval, us | Specify an interval between GPU samples. | 1000us |
Analyze DirectX pipeline events | Analyze GPU usage and frame rate based on the data provided by the DirectX* and identify whether your application is GPU or CPU bound. | No |
Trace OpenCL kernels on Processor Graphics | Capture the execution time of OpenCL kernels on a GPU, identify performance-critical GPU computing tasks, and analyze the performance of OpenCL kernels per GPU hardware metrics. | No |
Stack unwinding mode | Enable stack unwinding after collection finishes (offline mode). Offline mode reduces analysis overhead and is typically recommended. | After collection |
Stitch stacks | For applications using Intel Threading Building Blocks (Intel TBB) or OpenMP* with Intel runtime libraries, restructure the call flow to attach stacks to a point introducing a parallel workload. | Yes |
Collect timeline data | Enable collecting and retaining overhead data to display the Timeline pane. This mode increases result size. | Yes |
Collect frequency data | Collect data about processor frequency changes. This type of data collection is supported only on Linux* systems based on Intel Xeon® processors. | No |
Collect sleep data | Analyze when and what causes the hardware to wake up from a sleep state. This type of data collection is supported only on Linux* systems based on Intel Xeon processors. | No |
Note
You may copy the command line for this configuration using the Command Line... button at the bottom and run this analysis remotely.
See Also
Supplemental documentation specific to a particular Intel Studio may be available at <install-dir>\<studio>\documentation\
.