Graphics
Intel® graphics drivers and software, compatibility, troubleshooting, performance, and optimization
22667 토론

bandwidth of different clImage format and efficient access pattern

chris_pn
초급자
4,394 조회수

Hi,

    Per OpenCL Developer Guide (as shown below), I find that max bandwidth of each sampler is 64 Byte/cyc:

https://software.intel.com/content/www/us/en/develop/documentation/iocl-opg/top/coding-for-the-intel-processor-graphics/memory-hierarchy.html

    My questions are:

1. Which clImage format with (efficient access pattern) can achieve max bandwidth?

2. Does different clImage format has a max texel/cyc when read by sampler? If yes, where can I find the data for common-used format?

3. What's efficient access pattern to read clImage by Sampler?

    Thanks!

레이블 (1)
0 포인트
1 솔루션
Dunni_A_Intel
중재자
4,137 조회수

Hi Chris,


Apologies for the delay in providing you with a timely response.


  1. The maximum bandwidth may only be achieved when all accesses hit the cache. In general, accesses to smaller image formats will yield better performance than accesses to larger image formats.
  2. Yes, performance varies with different image formats. However, I am unable to locate any public documents with specific performance numbers.
  3. 2D accesses patterns will be efficient based on how images are actually laid out in memory.


Please let me know if this answers your questions and if you have any additional questions.



원본 게시물의 솔루션 보기

0 포인트
7 응답
VidyalathaB_Intel
중재자
4,368 조회수

Hi Chris,

Thanks for reaching out to us

we are working on your issue. we will get back to you soon

Regards,

Vidya


0 포인트
chris_pn
초급자
4,353 조회수

Thanks Vidya. For clImage write operation, I also has the same question. So many thanks If you could also provide answer for clImage write operation. Thanks again!

0 포인트
chris_pn
초급자
4,286 조회수

Hi Vidya,

    Can you answer my questions now or could you tell me how long should I wait for the answer? Thanks.

0 포인트
chris_pn
초급자
4,236 조회수

Hi Vidya,

    Do you have any update? I'm looking forward to your answer. Thanks!

0 포인트
Dunni_A_Intel
중재자
4,138 조회수

Hi Chris,


Apologies for the delay in providing you with a timely response.


  1. The maximum bandwidth may only be achieved when all accesses hit the cache. In general, accesses to smaller image formats will yield better performance than accesses to larger image formats.
  2. Yes, performance varies with different image formats. However, I am unable to locate any public documents with specific performance numbers.
  3. 2D accesses patterns will be efficient based on how images are actually laid out in memory.


Please let me know if this answers your questions and if you have any additional questions.



0 포인트
chris_pn
초급자
4,093 조회수

Hi Dunni,

    Thanks for your answer. It can be better if there is detailed explaination and samples. Current answer related to clImage provided by Intel official document is too short and rough to understand. Image is also a important memory object of OpenCL for user. Thanks anyway.

0 포인트
Dunni_A_Intel
중재자
4,115 조회수

Hi Chris,


We will no longer be monitoring this thread. Please feel free to make a new post if you have additional questions.


0 포인트
응답