OpenCL* for CPU
Ask questions and share information on Intel® SDK for OpenCL™ Applications and OpenCL™ implementations for Intel® CPU
Announcements
This forum covers OpenCL* for CPU only. OpenCL* for GPU questions can be asked in the GPU Compute Software forum. Intel® FPGA SDK for OpenCL™ questions can be ask in the FPGA Intel® High Level Design forum.
1663 Discussions

HD4000: Can it load 4 floats at one clock ?

zvivered
Beginner
166 Views

Hello,

I'm running OpenCL on Intel's HD4000 GPU.

It seems that this GPU has no vector calculations.

According to the query:

CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT

The vector width is 1. Which means that float4 calculations are executed as 4 float scalars.

But what about the size of the loaded vector from global memory ?

Can HD4000 load 4 floats (or even 8) at one clock ?

Can you send a document describing those features ?

I did not find the relevant document in google.

Thanks,

Z.V

0 Kudos
1 Reply
Raghupathi_M_Intel
165 Views

Hi Z.V.,

IVB GPU can actually execute 8 SP floats / cycle / EU. With MAD that's 2X so peak is 16 SP float ops / cycle / EU. I will try to find the public doc (if one is available) and post the link here.

Raghu

Reply