<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic gemm sample program is black out in OpenCL* for CPU</title>
    <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053637#M4088</link>
    <description>&lt;P&gt;I tried General Matrix Multiply (GEMM) sample (https://software.intel.com/en-us/articles/gemm) on my pc.&lt;/P&gt;

&lt;P&gt;But, if the matrix size is more than 2048 x 2048 program stops with black out.&lt;/P&gt;

&lt;P&gt;Black out back in a few seconds and .&lt;/P&gt;

&lt;P&gt;I think program stops at EnqueNDRange.&lt;/P&gt;

&lt;P&gt;&lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;According to&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;my calculations&lt;/SPAN&gt;&lt;/SPAN&gt;, there is no problem on the memory size, work item size, work group size, etc...&lt;/P&gt;

&lt;P&gt;pc specifications&lt;/P&gt;

&lt;P&gt;Core i5-4440&lt;/P&gt;

&lt;P&gt;memory 4GB&lt;/P&gt;

&lt;P&gt;Windows 7 64bit&lt;/P&gt;

&lt;P&gt;Why can I increase matrix size?&lt;/P&gt;</description>
    <pubDate>Sun, 15 Mar 2015 02:15:49 GMT</pubDate>
    <dc:creator>naoki_o_</dc:creator>
    <dc:date>2015-03-15T02:15:49Z</dc:date>
    <item>
      <title>gemm sample program is black out</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053637#M4088</link>
      <description>&lt;P&gt;I tried General Matrix Multiply (GEMM) sample (https://software.intel.com/en-us/articles/gemm) on my pc.&lt;/P&gt;

&lt;P&gt;But, if the matrix size is more than 2048 x 2048 program stops with black out.&lt;/P&gt;

&lt;P&gt;Black out back in a few seconds and .&lt;/P&gt;

&lt;P&gt;I think program stops at EnqueNDRange.&lt;/P&gt;

&lt;P&gt;&lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;According to&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;my calculations&lt;/SPAN&gt;&lt;/SPAN&gt;, there is no problem on the memory size, work item size, work group size, etc...&lt;/P&gt;

&lt;P&gt;pc specifications&lt;/P&gt;

&lt;P&gt;Core i5-4440&lt;/P&gt;

&lt;P&gt;memory 4GB&lt;/P&gt;

&lt;P&gt;Windows 7 64bit&lt;/P&gt;

&lt;P&gt;Why can I increase matrix size?&lt;/P&gt;</description>
      <pubDate>Sun, 15 Mar 2015 02:15:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053637#M4088</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-15T02:15:49Z</dc:date>
    </item>
    <item>
      <title>Hi Naoki,</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053638#M4089</link>
      <description>&lt;P&gt;Hi Naoki,&lt;/P&gt;

&lt;P&gt;Could you please specify the following:&lt;/P&gt;

&lt;P&gt;1. The version of the graphics driver you are using&lt;/P&gt;

&lt;P&gt;2. the exact parameters you feed to the program when you experience the blackout&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 16 Mar 2015 17:03:55 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053638#M4089</guid>
      <dc:creator>Robert_I_Intel</dc:creator>
      <dc:date>2015-03-16T17:03:55Z</dc:date>
    </item>
    <item>
      <title>Actually our GEMM sample is</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053639#M4090</link>
      <description>&lt;P&gt;Actually our GEMM sample is not fully optimized for our GPU. The kernel nn/nt and default tiling parameters are CPU-optimized, thus the execution of the default matrix size (3968x3968) with default tiling parameters is very slow.&lt;/P&gt;

&lt;P&gt;Try to run kernel nn with --tile-size-N 4. It reduces the private memory usage and improves the performance on GPU.&lt;/P&gt;

&lt;P&gt;Thanks, Lingyi&lt;/P&gt;</description>
      <pubDate>Tue, 17 Mar 2015 02:28:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053639#M4090</guid>
      <dc:creator>Lingyi_K_Intel</dc:creator>
      <dc:date>2015-03-17T02:28:39Z</dc:date>
    </item>
    <item>
      <title> </title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053640#M4091</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;Graphics driver version is 10.18.10.3907&lt;/P&gt;

&lt;P&gt;I had tried default parameters and change matrix size.&lt;/P&gt;

&lt;P&gt;I tried to run kernel nn with -tile-size-N 4, but I got same result. I think memory is enough.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 17 Mar 2015 04:45:13 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053640#M4091</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-17T04:45:13Z</dc:date>
    </item>
    <item>
      <title>Dear Naoki,</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053641#M4092</link>
      <description>&lt;P&gt;Dear Naoki,&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;It appears your driver version is fairly outdated. Please update to 10.18.10.4061 or above: &lt;A href="https://downloadcenter.intel.com/search?keyword=3rd+generation+intel+processors"&gt;https://downloadcenter.intel.com/search?keyword=3rd+generation+intel+processors&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Let me know whether that worked for you.&lt;/P&gt;</description>
      <pubDate>Tue, 17 Mar 2015 15:52:06 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053641#M4092</guid>
      <dc:creator>Robert_I_Intel</dc:creator>
      <dc:date>2015-03-17T15:52:06Z</dc:date>
    </item>
    <item>
      <title>Thank you for pointing it out</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053642#M4093</link>
      <description>&lt;P&gt;Thank you for pointing it out.&lt;/P&gt;

&lt;P&gt;This GPU is HD graphics 4600, so I installed version 10.18.10.4156&lt;/P&gt;

&lt;P&gt;&lt;A href="https://downloadcenter.intel.com/download/24785/Intel-Iris-and-HD-Graphics-Driver-for-Windows-7-8-8-1-64-bit" target="_blank"&gt;https://downloadcenter.intel.com/download/24785/Intel-Iris-and-HD-Graphics-Driver-for-Windows-7-8-8-1-64-bit&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;But, it does not improve.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Mar 2015 02:31:25 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053642#M4093</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-18T02:31:25Z</dc:date>
    </item>
    <item>
      <title>On my HD 4400 GPU, 3968x3968</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053643#M4094</link>
      <description>&lt;P&gt;On my HD 4400 GPU, &lt;SPAN style="font-size: 12px; line-height: 14.3999996185303px;"&gt;3968x3968 takes over 26 sec to finish with default tile-size-N = 128. During the execution, the system seems to be "hanging".&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 14.3999996185303px;"&gt;With tile-size-N = 4, it takes 9 sec to finish. Not sure the behavior on HD 4600, but it should be able to finish... &lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 12px; line-height: 14.3999996185303px;"&gt;Again, the performance of this kernel is not that good on GPU, and we have done some optimization work for GEMM on GPU and it performs such better than the sample.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Mar 2015 06:09:47 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053643#M4094</guid>
      <dc:creator>Lingyi_K_Intel</dc:creator>
      <dc:date>2015-03-18T06:09:47Z</dc:date>
    </item>
    <item>
      <title>Ummm....</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053644#M4095</link>
      <description>&lt;P&gt;&lt;SPAN class="st"&gt;Ummm....&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;When I start 3968x3968, black out back after a few seconds and back&lt;SPAN lang="en"&gt; &lt;SPAN class="hps"&gt;to the normal&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;after a few seconds&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN lang="en"&gt;&lt;SPAN&gt;But&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;the program&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;has stopped.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;This sample repeats calculation 10 times.&lt;/P&gt;

&lt;P&gt;&lt;SPAN id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;Sometimes&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;calculation&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;is finished one or two times. At that time&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt; it &lt;SPAN class="hps"&gt;takes&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;about&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;8 seconds to finish once of calculation.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;However&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;never&lt;/SPAN&gt;&amp;nbsp;&lt;SPAN class="hps"&gt;finished&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;to the end&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;of the loop&lt;/SPAN&gt;&lt;/SPAN&gt;.&lt;/P&gt;

&lt;P&gt;When CPU is selected, it runs &lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps alt-edited"&gt;normal.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Mar 2015 10:14:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053644#M4095</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-18T10:14:39Z</dc:date>
    </item>
    <item>
      <title>I took a  video of the</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053645#M4096</link>
      <description>&lt;P&gt;&lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;I&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;took &lt;/SPAN&gt;a &lt;SPAN class="hps"&gt; video of the operation.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Mar 2015 10:23:53 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053645#M4096</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-18T10:23:53Z</dc:date>
    </item>
    <item>
      <title>You could always add "-cl-mad</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053646#M4097</link>
      <description>&lt;P&gt;You could always&amp;nbsp;add "-cl-mad-enable"&amp;nbsp;to make multiply+add operations faster, and also try to use&amp;nbsp;"-cl-fast-relaxed-math" to get performance gains but you may lose control of numeric accuracy. Try them in build program option.&lt;/P&gt;

&lt;P&gt;Below kernel performs much better than current nn/nt kernels on GPU,&amp;nbsp;and need following changes in host code.&lt;/P&gt;

&lt;P&gt;1. Update kernel args and add "-DSIZE=3968" in build program option.&lt;/P&gt;

&lt;P&gt;2. Update global size to {3968 / 4, 3968 / 8} because each work items calculates 8x4 data.&lt;/P&gt;

&lt;P&gt;3. Update verification code to check the output in row-major order.&lt;/P&gt;

&lt;PRE class="brush:cpp;"&gt;// C := alpha*A*B + beta*C
// A is in row-major form
// B is in row-major form
// C is in row-major form
__kernel void gemm_8x4(__global const float4 * restrict A,
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; __global const float4 * restrict B,
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; __global float4 * restrict C,
		&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; float alpha,
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 		float beta)
{
&amp;nbsp;&amp;nbsp;&amp;nbsp; int x = get_global_id(0);
&amp;nbsp;   int y = get_global_id(1);

&amp;nbsp;&amp;nbsp;&amp;nbsp; float4 sum[8] = {(float4)(0)};
&amp;nbsp;   float4 tempA;
&amp;nbsp;   float4 tempB[4];

&amp;nbsp;&amp;nbsp;&amp;nbsp; for(uint i = 0; i &amp;lt; SIZE; i=i+4)
&amp;nbsp;&amp;nbsp;&amp;nbsp; {
&amp;nbsp;&amp;nbsp;	for(uint k = 0; k &amp;lt; 4; k++)
&amp;nbsp;&amp;nbsp;	{
&amp;nbsp;&amp;nbsp;&amp;nbsp;		tempB&lt;K&gt; = B[x + (i + k) * (SIZE / 4)];
&amp;nbsp;&amp;nbsp;	}
&amp;nbsp;&amp;nbsp;
&amp;nbsp;&amp;nbsp;	for (uint k = 0; k &amp;lt; 8; k++)
&amp;nbsp;&amp;nbsp;	{
&amp;nbsp;&amp;nbsp;&amp;nbsp;		tempA = A[i/4 + ((y * 8) + k) * (SIZE / 4)];
&amp;nbsp;&amp;nbsp;&amp;nbsp;		sum&lt;K&gt; += tempB[0] * tempA.x;
&amp;nbsp;&amp;nbsp;&amp;nbsp;		sum&lt;K&gt; += tempB[1] * tempA.y;
&amp;nbsp;&amp;nbsp;&amp;nbsp;		sum&lt;K&gt; += tempB[2] * tempA.z;
&amp;nbsp;&amp;nbsp;&amp;nbsp;		sum&lt;K&gt; += tempB[3] * tempA.w;
&amp;nbsp;&amp;nbsp;	}
&amp;nbsp;&amp;nbsp;	//improve cache accesses
&amp;nbsp;&amp;nbsp;	barrier(CLK_LOCAL_MEM_FENCE); 
&amp;nbsp;&amp;nbsp;&amp;nbsp; }
&amp;nbsp;   for(uint i = 0; i &amp;lt; 8; i++)
&amp;nbsp;   {
&amp;nbsp;&amp;nbsp;	C[x + (y * 8 + i) * (SIZE / 4)] = alpha * sum&lt;I&gt; + beta * C[x + (y * 8 + i) * (SIZE / 4)];
&amp;nbsp;   } 
}&lt;/I&gt;&lt;/K&gt;&lt;/K&gt;&lt;/K&gt;&lt;/K&gt;&lt;/K&gt;&lt;/PRE&gt;

&lt;P&gt;BTW, another optimization is to use local memory. In each work group after loading sub-matrix of input matrix&amp;nbsp;into local memory, do the synchronization, and then read the data from local memory and do the calculation.&amp;nbsp;With this solution we should get&amp;nbsp;further performance gains.&lt;/P&gt;

&lt;P&gt;Please have a try and let me the result. Thanks, Lingyi&lt;/P&gt;</description>
      <pubDate>Thu, 19 Mar 2015 02:30:51 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053646#M4097</guid>
      <dc:creator>Lingyi_K_Intel</dc:creator>
      <dc:date>2015-03-19T02:30:51Z</dc:date>
    </item>
    <item>
      <title>I've tried it, but it was the</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053647#M4098</link>
      <description>&lt;DIV class="g-unit" id="gt-res-c"&gt;
	&lt;DIV id="gt-res-p"&gt;
		&lt;DIV id="gt-res-data"&gt;
			&lt;DIV id="gt-res-wrap"&gt;
				&lt;DIV class="almost_half_cell" id="gt-res-content"&gt;
					&lt;DIV dir="ltr" style="zoom:1"&gt;&lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;I&lt;/SPAN&gt;&lt;SPAN&gt;'ve tried&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;it&lt;/SPAN&gt;&lt;/SPAN&gt;, but it was the same &lt;SPAN class="short_text" id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;behavior&lt;/SPAN&gt;&lt;/SPAN&gt;.&lt;/DIV&gt;

					&lt;DIV dir="ltr" style="zoom:1"&gt;&amp;nbsp;&lt;/DIV&gt;

					&lt;DIV dir="ltr" style="zoom:1"&gt;If a program isn't optimized, is there a case &lt;SPAN id="result_box" lang="en"&gt;&lt;SPAN class="hps"&gt;that the program&lt;/SPAN&gt; &lt;SPAN class="hps"&gt;would stop?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/DIV&gt;
				&lt;/DIV&gt;
			&lt;/DIV&gt;
		&lt;/DIV&gt;
	&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 23 Mar 2015 08:59:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053647#M4098</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-23T08:59:15Z</dc:date>
    </item>
    <item>
      <title>If the case causes graphic</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053648#M4099</link>
      <description>&lt;P&gt;If the case causes graphic driver reset, then the execution would stop. Do you see the graphics driver stopped responding and recovered in your case?&lt;/P&gt;</description>
      <pubDate>Fri, 27 Mar 2015 05:35:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053648#M4099</guid>
      <dc:creator>Lingyi_K_Intel</dc:creator>
      <dc:date>2015-03-27T05:35:45Z</dc:date>
    </item>
    <item>
      <title>Yes.</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053649#M4100</link>
      <description>&lt;P&gt;Yes.&lt;/P&gt;

&lt;P&gt;After recovery, it's displayed that the graphics driver stopped responding and recovered.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Mar 2015 10:32:24 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053649#M4100</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-03-27T10:32:24Z</dc:date>
    </item>
    <item>
      <title>Ok, this shall be the problem</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053650#M4101</link>
      <description>&lt;P&gt;Ok, this shall be&amp;nbsp;the problem. GPU stops/resets&amp;nbsp;for large workload.&lt;/P&gt;

&lt;P&gt;You could try a longer TDR time in regedit (HKEY_LOCAL_MACHINE\SYSTEM|ControlSet001\Control\GraphicsDrivers\TdrDelay) to see if larger delay time could make your case finish.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Mar 2015 02:30:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053650#M4101</guid>
      <dc:creator>Lingyi_K_Intel</dc:creator>
      <dc:date>2015-03-31T02:30:28Z</dc:date>
    </item>
    <item>
      <title>It went well.</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053651#M4102</link>
      <description>&lt;P&gt;It went well.&lt;/P&gt;

&lt;P&gt;The problem was resolved.&lt;/P&gt;

&lt;P&gt;Thank you very much!&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 03 Apr 2015 03:49:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/gemm-sample-program-is-black-out/m-p/1053651#M4102</guid>
      <dc:creator>naoki_o_</dc:creator>
      <dc:date>2015-04-03T03:49:56Z</dc:date>
    </item>
  </channel>
</rss>

