Intel® Integrated Performance Primitives
Community support and discussions relating to developing high-performance vision, signal, security, and storage applications.
6632 Discussions

IPP's OpenMP in a multi-threaded performance app

cratiuixiacom_com
129 Views

Hello,

I have an issue with threads created from IPP's OpenMP implementation. My app has 4 worker threads each running on a core. Each worker thread does some heavy-lifting work, involving FFTs. The problem is that IPP creates 4 additional threads that compete with my threads for CPU. After looking through the documentation, I found the OMP_NUM_THREADS variable and the ippSetNumThreads function. I've tried setting both to 1 but they don't seem to work: threads are still created.

I link with the dynamic version of IPP and the part of the app with the worker threads is loaded with dlopen.

Some more info: By reducing the number of my worker threads to 2, it seems everything is fine: 2 more IPP threads are created from libguile.so but each thread gets a core of it's own and everyone is happy...but I'd like to have some control and see what happens if FFTs are not parallelized and everything runs on the same core, serially.

Thanks,

Cosmin.

0 Kudos
2 Replies
Vladimir_Dudnik
Employee
129 Views
After you call ippSetNumThreads(1) IPP will use only single thread, it is easy to check looking at performance of any threaded IPP function.
Regards,
Vladimir

cratiuixiacom_com
129 Views
Quoting - vdudnik
After you call ippSetNumThreads(1) IPP will use only single thread, it is easy to check looking at performance of any threaded IPP function.
Regards,
Vladimir

Thanks, meanwhile I discovered that I didn't correctly call that function before the first chunk of work caused threads to spawn.

Reply