Application Acceleration With FPGAs
Programmable Acceleration Cards (PACs), DCP, DLA, Software Stack, and Reference Designs
Announcements
The Intel sign-in experience has changed to support enhanced security controls. If you sign in, click here for more information.
446 Discussions

What might be wrong when we increase the number of computing units of the kernel (OpenCL implementation) and the execution time increases instead of decreases?!

sraha
Beginner
726 Views
 
0 Kudos
1 Reply
HRZ
Valued Contributor III
253 Views

When using Compute Unit Replication, each compute unit will have its own set of memory ports. This results in a high amount of contention on the memory bus and if there are too many ports going to the memory interface, then it is very much possible that performance will start degrading.

Reply