GPU Compute Software
Ask questions about Intel® Graphics Compute software technologies, such as OpenCL* GPU driver and oneAPI Level Zero
360 Discussions

Is it possible to force compiler to map one thread instead of eight threads per VXE?

pauljurczak
Beginner
424 Views

"oneAPI GPU Optimization Guide" reads: 

 

"Each VE is a multi-threaded SIMD processor. The compiler generates SIMD code to map several work-items to be executed simultaneously within a given hardware thread. The SIMD-width for a kernel is a heuristic driven compiler choice. Common SIMD-width examples are SIMD-8, SIMD-16, and SIMD-32."

 

In case of a highly divergent kernel, is it possible to force the compiler to SIMD-1, i.e. only one work item using the whole XVE SIMD16 engine at a time? The reason would be to avoid exponential overhead when each work item's control flow is different.

Labels (1)
0 Kudos
0 Replies
Reply