OpenCL* for CPU
Ask questions and share information on Intel® SDK for OpenCL™ Applications and OpenCL™ implementations for Intel® CPU.
Announcements
This forum covers OpenCL* for CPU only. OpenCL* for GPU questions can be asked in the GPU Compute Software forum. Intel® FPGA SDK for OpenCL™ questions can be ask in the FPGA Intel® High Level Design forum.

More flexibility in kernel analysis (Code Builder, local sizes)

Marc_M_
Beginner
347 Views

Hi folks,

I'm facing a problem on workgroup size definitions during a session of kernel analysis, I'd like to know if it is possible to benchmark all the combinations of local work size possible.

For example, if I want to test the combinations of local sizes between 1,23,50 and 100 I put these values :

sizes.PNG

And I get :

combinations.PNG

I don't have any analysis for (25,25) for example, or (50,100). How should I put the values ? Is it possible ?

Thanks !

0 Kudos
2 Replies
Arik_Z_Intel1
Employee
347 Views

Hi Marc,

If the local sizes multiplication exceeds the CL_KERNEL_WORK_GROUP_SIZE than this configuration is omitted.

25X25 = 625, 50X100 = 5000 which is larger than 512 which is the CL_KERNEL_WORK_GROUP_SIZE on the GPU for example.

On the CPU it is 8192 so those configuration should appear.

You can check out the spe in https://www.khronos.org/registry/cl/specs/opencl-2.0.pdf, page 222.

Are you running on the CPU or GPU?

0 Kudos
Marc_M_
Beginner
347 Views

Thank you for your explanation ARIK Z, now I understand why it shouldn't appear.

This was run with the GPU, with the CPU (25,25) appears as expected.

Thanks again!

0 Kudos
Reply