- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
On my system, running a code parallelised with OpenMP and setting OMP_NUM_THREADS=2, OMP_DYNAMIC=false, OMP_PROC_BIND=true, we observe 10 threads, as per screenshot. Please can you explain why?
And what is "orted", "ucs_async_thread_func", "progress_engine" and "listen_thread" that we observe. DLPOLY.Z is the name of our executable which contains MPI calls so was compiled with mpif90 but we are (implicitly) using only 1 MPI process).
We launch our collection:
vtune -collect hotspots --app-working-dir=$(pwd) -- ${EXE_PATH}/DLPOLY.Z
Yours, Michael @mkbane_hec
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Good day to you.
Thanks for posting in Intel Communities.
Could you please share the following details so that we could investigate further:
1. Sample reproducer code.
2. VTune version.
3. Exact steps and the commands used.
4. OS and Processor details.
Regards,
Sreedevi
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
We haven't heard back from you. Could you please confirm if your issue is resolved or not?
Regards,
Sreedevi
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Good day to you.
We have not heard back from you. This thread will no longer be monitored by Intel. If you need further assistance, please post a new question.
Regards,
Sreedevi
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page