Quantcast
Viewing all articles
Browse latest Browse all 1853

Getting Started with Intel® VTune™ Amplifier 2016 for Systems

Intel® VTune™ Amplifier for Systems Performance Profiler with Intel® Energy Profiler helps you speed up and optimize execution of your code on Linux* embedded platforms, Android*, or Windows* systems providing the following types of analysis:

  • Performance analysis: Find serial and parallel code bottlenecks, analyze algorithm choices, GPU engines usage, and understand where and how your application can benefit from available hardware resources.

  • Intel Energy Profiler analysis: Analyze power events and identify those that waste energy.

VTune Amplifier for Systems is available as part of the Intel System Studio.

Key Features

ALGORITHM ANALYSIS

  • Run Basic Hotspots analysis type to understand application flow and identify sections of code that get a lot of execution time (hotspots).

    See the tutorial for Linux host - Android target | Windows host - Android target

  • Run Concurrency analysis to estimate parallelization in your code and understand how effectively your application uses available cores.

  • Run Locks and Waits analysis to identify synchronization objects preventing effective utilization of processor resources.

  • Use the algorithm Advanced Hotspots analysis to extend Basic Hotspots analysis by collecting call stacks, context switches and analyze CPI (Cycles Per Instructions) metric.

    See the tutorial for Linux host - embedded Linux target system

Image may be NSFW.
Clik here to view.
Basic Hotspots Analysis

MICROARCHITECTURE ANALYSIS

  • Run General Exploration analysis to triage hardware issues in your application. This type collects a complete list of events for analyzing a typical client application.

  • Narrow down your hardware analysis by focusing on specific hardware issues, such as ineffective memory accesses, low bandwidth, and so on. VTune Amplifier analyzes your code, identifies issues and provides recommendations.

  • Use Memory Access analysis to identify memory-related issues, like NUMA problems and bandwidth-limited accesses, and attribute performance events to memory objects (data structures), which is provided due to instrumentation of memory allocations/de-allocations and getting static/global variables from symbol information.

  • For systems with Intel Software Guard Extensions (Intel SGX) feature enabled, run SGX Hotspots analysis to identify performance-critical program units inside security enclaves. This analysis type uses the INST_RETIRED.PREC_DIST hardware event that emulates precise clockticks which is mandatory for the analysis on the systems with the Intel SGX enabled.

Image may be NSFW.
Clik here to view.
General Exploration Analysis

PLATFORM ANALYSIS

  • Run System Overview analysis to review general behavior of a target Linux* or Android* system and correlate power and performance metrics with the interrupt request (IRQ).

  • Collect data on Ftrace* events on Android and Linux targets and Atrace* events on Android targets.

  • Run CPU/GPU Concurrency analysis to identify code regions where your application is CPU or GPU bound.

  • For GPU-bound applications running on Intel HD Graphics, collect GPU hardware events to estimate how effectively the Processor Graphics are used.

  • Analyze hot Intel® Media SDK programs and OpenCL™ kernels running on a GPU. For OpenCL application analysis, use the Architecture Diagram to explore GPU hardware metrics per GPU architecture blocks.

Image may be NSFW.
Clik here to view.
GPU Analysis

SOURCE ANALYSIS

  • Double click a hotspot function to drill down to the source code and analyze performance per source line or assembler instruction. By default, the hottest line is highlighted.

  • For help on an assembly instruction, right-click the instruction in the Assembly pane and select Instruction Reference from the context menu.

Image may be NSFW.
Clik here to view.
Source View

CUSTOM ANALYSIS

  • Select Custom Analysis branch in the analysis tree to create your own analysis configurations using any of the available VTune Amplifier data collectors.

  • Run your own custom collector from the VTune Amplifier to get the aggregated performance data, from your custom collection and VTune Amplifier analysis, in the same result.

  • Import performance data collected by your own or third-party collector into the VTune Amplifier result collected in parallel with your external collection. Use the Import from CSV button to integrate the external data to the *.amplxe result.

  • Collect data from a remote virtual machine by configuring KVM guest OS profiling, which makes use of the Linux Perf KVM feature. Select Analyze KVM guest OS from the Advanced options.

Image may be NSFW.
Clik here to view.
Custom Analysis

ENERGY ANALYSIS

Use the Intel Energy Profiler to investigate high energy consumption on a platform with its summaries of CPU and GPU state residency, CPU and GPU processor frequency use, and correlated timelines of metrics that reflect platform energy use, such as thermals and bandwidth. The CPU sleep state timeline is annotated with wakeup reasons.

See the tutorial for Linux host - Android target | Windows host - Android target | Windows host - Windows target.

Image may be NSFW.
Clik here to view.

For the detailed list of product features, see Intel VTune Amplifier Help:Windows Host | Linux Host.

Remote Collection Modes

You can collect data on your Linux, Windows, or Android system using any of the following modes:

  • (Linux and Android targets) Remote analysis via SSH/ADB communication with VTune Amplifier graphical and command line interface (amplxe-cl) installed on the host and VTune Amplifier target package installed on the remote target system. Recommended for resource-constrained embedded platforms (with insufficient disk space, memory, or CPU power).

    See the tutorial for Linux host - Android target | Windows host - Android target | Linux host - embedded Linux target system

  • (Android targets) Disconnected analysis via SSH/ADB communication with VTune Amplifier installed on the host and the VTune Amplifier target package installed on the remote Android system. The analysis is initiated from the host system, but data collection does not begin until the device is unplugged from the host system. The results are finalized after the device is reconnected to the host system.

  • (Linux and Windows targets) Native performance analysis with the VTune Amplifier graphical or command line interface installed on the target system. Analysis is started directly on the target system.

  • (Linux and Windows targets) Native hardware event-based sampling analysis with the VTune Amplifier's Sampling Enabling Product (SEP) installed on the target embedded system.

Legal Information

Intel, VTune and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

* Other names and brands may be claimed as the property of others.

Microsoft, Windows, and the Windows logo are trademarks, or registered trademarks of Microsoft Corporation in the United States and/or other countries.

Copyright © 2016, Intel Corporation. All rights reserved.


Viewing all articles
Browse latest Browse all 1853

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>