- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi all,
It turn out one of the loop I was interested in offloading to MIC as per optimization report suggested it was both auto vectorized and auto parallelized.
What I would want is only auto parallelize only this loop.
The -vec-report vectorised all loops and paralelized them.
This is important to for me as I would want to check percentage reduction in time taken on my application just by reducing the time taken in this loop.
How do I do this? Is there a way to do this?
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Do you mean is it ok to set pragma no vector on a test loop? Remember Mic performance depends strongly on vectorization.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
No,
I want vectorization and parallelisation(thread level) only for the one loop I am interested in. Default options for the rest of the application.
Thats because I want to know, how much faster my application was after optimization of only the loop I was interested in.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Default options involve fairly aggressive vectorization. You might try adding no-vec option with pragma simd on the test loop. For selective parallelism openmp is likely answer.

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page