- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi, there is a tool or a Visual Studio plugin that give me how many SSE/AVX instructions there are in my code? How many SIMD Instruction I've performed? ecc. ecc.
- Tags:
- Intel® Advanced Vector Extensions (Intel® AVX)
- Intel® Streaming SIMD Extensions
- Parallel Computing
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
One of the best tools for profiling SSE/AVX code is Intel VTune. You can also look at disassembled code and check for AVX machine code instructions.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Innocenzo,
Have a look at this Intel presentation: http://www.gdcvault.com/play/1014645
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Can you read this paper: https://software.intel.com/sites/default/files/article/394181/using-intel-vtune-amplifier-xe-on-4th-generation-intel-core-processors.pdf
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
The floating-point arithmetic counters are disabled on Haswell processors -- perhaps because of accuracy problems with these events on Sandy Bridge and Ivy Bridge cores. One of these days I will find a Broadwell or Skylake system to check the new floating-point counters that have been added on those processors.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Theoretically you can count AVX machine code instructions which are executed and that way you asses how many GFLOPs were achieved. Of course this will be crude approximation of real result.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yes I know that. For simple loops like accumulation of variable it will work, but for complex nested loops with embedded control statement it will be hard and not accurate task.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page