Quantcast
Channel: Intermédiaire
Viewing all articles
Browse latest Browse all 677

How to analyze Intel® Xeon Phi™ coprocessor applications using Intel® VTune™ Amplifier XE 2015

$
0
0

 

Introduction

 

Intel® VTune™ Amplifier XE 2015 now includes some new capabilities for analyzing Intel® Xeon Phi™ coprocessor applications. This article will step through this analysis on a Intel® Xeon Phi™ coprocessor and also outline some of the new capabilities.

 

Compiling and running the application

The application we will be using is one of the samples include in VTune Amplifier. It is located in /opt/intel/vtune_amplifier_xe_2015/samples/en/C++/matrix_vtune_amp_xe.tgz. To build the application on Linux*:

  1. First source the environment for the Intel® Compiler you are using.
    1. source /opt/intel/compiler_xe_2015/compilervars.sh intel64
  2. Untar the sample in a directory where you have permission
    1. tar xvzf matrix_vtune_amp_xe.tgz
  3. By default the sample does not use OpenMP*. You will need to modify the Makefile
    1. cd matrix/linux
    2. Edit the Makefile
    3. Comment the default PARAMODEL and uncomment the OpenMP PARAMODEL.
  4. Build the application to run native on the Intel® Xeon Phi™ coprocessor
    1. cd matrix/linux
    2. make mic
  5. The make command from step #4 will create a Intel Xeon Phi native matrix.mic executable. It will also copy the file to mic0:/tmp.
  6. Verify the libiomp5.so library is available on your Intel Xeon Phi coprocessor.
  7. Run the application
    1. /tmp/matrix.mic

Addr of buf1 = 0x7fec2b054010

Offs of buf1 = 0x7fec2b054180

Addr of buf2 = 0x7fec23fd3010

Offs of buf2 = 0x7fec23fd31c0

Addr of buf3 = 0x7fec1cf52010

Offs of buf3 = 0x7fec1cf52100

Addr of buf4 = 0x7fec15ed1010

Offs of buf4 = 0x7fec15ed1140

Threads #: 240 OpenMP threads

Matrix size: 3840

Using multiply kernel: multiply1

Freq = 1.090908 GHz

Execution time = 23.866 seconds

Running the application using VTune Amplifier

  1. Source /opt/intel/vtune_amplifier_2015/amplxvar.sh
  2. Start the VTune Amplifier GUI
    1. amplxe-gui
  3. Create a VTune Amplifier project
    1. File->New->Project
    2. There are several new options in the “Target System” menu pull down
    3. .
  4. We will be selecting the menu item Intel Xeon Phi coprocessor (native)
  5. Specify the "Launch Application" menu item
  6. Specify the application name /tmp/matrix.mic
    1. Note: This application is located on the Intel Xeon Phi coprocessor's file system.
  7. Click on Ok
  8. To analyze your application
    1. Click on "New Analysis"
    2. Click on "Advanced Hotspots"
    3. Click Start
    4. VTune Amplifier will launch the application and then finalize the result

Summary

VTune Amplifier has made some significant improvements in the analysis of Intel Xeon Phi coprocessor applications. This article has explained how to launch native applications under VTune Amplifier using the new GUI interface available in the 2015 release, but you can easily analyze offloaded applications using a very similar method, using the “Intel Xeon Phi coprocessor (host launch)” menu item shown in a menu above.  Changes for the 2015 release also impact the command line interface, amplxe-cl.  Look for another article to explain that.

 

  • Intermédiaire
  • Serveur
  • URL
  • Amélioration des performances
  • Zone des thèmes: 

    IDZone

    Viewing all articles
    Browse latest Browse all 677

    Trending Articles



    <script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>