- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Has anyone experimented with the workshare directive on whole array operations and ifort? I played a bit with this on gfortran, but found that it wasn't giving me any speedup, and I really need to resort to parallel do/end do loops to get a boost out of openmp.
Link Copied
3 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Quoting - grs2103@columbia.edu
Has anyone experimented with the workshare directive on whole array operations and ifort? I played a bit with this on gfortran, but found that it wasn't giving me any speedup, and I really need to resort to parallel do/end do loops to get a boost out of openmp.
You might note that you need a recent gfortran with options -mtune=barcelona -msse4 to get full benefit of vectorization on recent CPUs. On some of the older CPUs, there is less benefit from combining vectorization and OpenMP, but a more common reason why people forgo vectorization is to be able to start from a lower base when bragging about parallel performance scaling.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Quoting - grs2103@columbia.edu
Has anyone experimented with the workshare directive on whole array operations and ifort? I played a bit with this on gfortran, but found that it wasn't giving me any speedup, and I really need to resort to parallel do/end do loops to get a boost out of openmp.
Currently with Intel Fortran, the OpenMP WORKSHARE directive is implemented with a SINGLE construct, and so no parallel code is generated. Weare consideringa threaded implementation limited to parallelizing a single FORALL construct, single WHERE construct, or a single block of F90 array assignments.
For reference, we are tracking this as compilerfeature requestDPD200045053.
Patrick Kennedy
Intel Compiler Lab
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Certain OpenMP* WORKSHARE constructs now parallelize with Intel® Fortran Compiler 15.0. Our implementation is described here.
Patrick

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page