Please give us more details regarding your use-case.
Regarding support of SIMD if the hardware supports then subgroup is the way to achieve SIMD, as subgroups map to SIMD hardware.
Thanks. I have found some documents to introduce the SIMD of oneAPI. But I still have some questions, could you please help to clarify, Thanks.
- Does oneAPI try to enable SIMD as possible since the SIMD width is 16 when I run the vtune to diagnose the GPU performance even I don't write any subgroup code?
- How to specify the SIMD width manually?
- What are the possible reasons to stop a higher SIMD width? the number of EU registers?