OpenCL* for CPU
Ask questions and share information on Intel® SDK for OpenCL™ Applications and OpenCL™ implementations for Intel® CPU
Announcements
This forum covers OpenCL* for CPU only. OpenCL* for GPU questions can be asked in the GPU Compute Software forum. Intel® FPGA SDK for OpenCL™ questions can be ask in the FPGA Intel® High Level Design forum.
1663 Discussions

Auto-vectorization constraints...

janez-makovsek
New Contributor I
129 Views
Hi!

Currently auto-vectorization will become enabled only if WorkSize is a multiple of 4. Is it maybe possible to implement optimization which allows vectorization up to (WorkSize/4 * 4) and does only the remainder as scalar? (and/or only the begining thus removing also the constraint for 256 byte alignment).

Thanks!
Atmapuri
0 Kudos
0 Replies
Reply