<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Some suggestions! in OpenCL* for CPU</title>
    <link>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778580#M323</link>
    <description>&lt;DIV&gt;Hi,&lt;/DIV&gt;&lt;DIV&gt;Some suggestions for improving this good release:&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;For completeness:&lt;/DIV&gt;*Add 3d_image_writes support as AMD GPU backend supports it and I have some demo using it..&lt;DIV&gt;*Add D3D10 interop: cl_khr_d3d10_sharing similar to OpenGL interop so some Nvidia/AMD samples work too..&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;comparing to AMD:&lt;/DIV&gt;&lt;DIV&gt;*Addcl_ext_device_fission so we can expose multiple concurrent kernels,etc..&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;more ambitious:&lt;/DIV&gt;&lt;DIV&gt;Add next-gen computing features (as featured in CUDA 3.x) :&lt;/DIV&gt;&lt;DIV&gt;*(support for no inlined functions with stack) brings Function pointers andRecursion : believe or not but Nvidia OCL GPU backend supports at least recursion! and function pointers fails only when building (also GPU ocelot cpu backend (PTX-&amp;gt;LLVM) supports it right now!)&lt;/DIV&gt;&lt;DIV&gt;*Similar to printf expose malloc and free (featured new in CUDA 3.2)&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Also seems AMD is working on some C++ support (templatized kernels)&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;What do you think?&lt;/DIV&gt;&lt;DIV&gt;Thank.&lt;/DIV&gt;</description>
    <pubDate>Wed, 24 Nov 2010 03:03:39 GMT</pubDate>
    <dc:creator>rtfss1gmail_com</dc:creator>
    <dc:date>2010-11-24T03:03:39Z</dc:date>
    <item>
      <title>Some suggestions!</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778580#M323</link>
      <description>&lt;DIV&gt;Hi,&lt;/DIV&gt;&lt;DIV&gt;Some suggestions for improving this good release:&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;For completeness:&lt;/DIV&gt;*Add 3d_image_writes support as AMD GPU backend supports it and I have some demo using it..&lt;DIV&gt;*Add D3D10 interop: cl_khr_d3d10_sharing similar to OpenGL interop so some Nvidia/AMD samples work too..&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;comparing to AMD:&lt;/DIV&gt;&lt;DIV&gt;*Addcl_ext_device_fission so we can expose multiple concurrent kernels,etc..&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;more ambitious:&lt;/DIV&gt;&lt;DIV&gt;Add next-gen computing features (as featured in CUDA 3.x) :&lt;/DIV&gt;&lt;DIV&gt;*(support for no inlined functions with stack) brings Function pointers andRecursion : believe or not but Nvidia OCL GPU backend supports at least recursion! and function pointers fails only when building (also GPU ocelot cpu backend (PTX-&amp;gt;LLVM) supports it right now!)&lt;/DIV&gt;&lt;DIV&gt;*Similar to printf expose malloc and free (featured new in CUDA 3.2)&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;Also seems AMD is working on some C++ support (templatized kernels)&lt;/DIV&gt;&lt;DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;What do you think?&lt;/DIV&gt;&lt;DIV&gt;Thank.&lt;/DIV&gt;</description>
      <pubDate>Wed, 24 Nov 2010 03:03:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778580#M323</guid>
      <dc:creator>rtfss1gmail_com</dc:creator>
      <dc:date>2010-11-24T03:03:39Z</dc:date>
    </item>
    <item>
      <title>Some suggestions!</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778581#M324</link>
      <description>&lt;P&gt;Another one:&lt;/P&gt;&lt;P&gt;*Allow asm("") function being able to insert x86 assembly code in kernels&lt;/P&gt;&lt;P&gt;CUDA allows asm function containing PTX code inside CUDA device functions..&lt;/P&gt;</description>
      <pubDate>Wed, 24 Nov 2010 04:00:08 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778581#M324</guid>
      <dc:creator>rtfss1gmail_com</dc:creator>
      <dc:date>2010-11-24T04:00:08Z</dc:date>
    </item>
    <item>
      <title>Some suggestions!</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778582#M325</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;We would like to thank you for your suggestions. &lt;BR /&gt;Some of these suggestions have been raised internally as well, and are considered for the next versions of the SDK. &lt;BR /&gt;&lt;BR /&gt;About the specific proposal to allow asm functions accessed directly inside kernels, I do not believe that we will want to go in this direction. Intel's direction is to promotethe cross-device approach of OpenCL, and this proposal goes against it. The preferred direction is improving the compiler, making sure that the mapping to assembler instructions is efficient. I believe that the additions made to the OCL C language improve the complier's ability to reach this goal. &lt;BR /&gt;However, the direction of adding new built-in functions, which map well to SSE instructions, is interesting. We do see cases where a code sequence can be efficiently replaced by a call to SSE instruction - and the method that we prefer is to expose it as a built-in function. this preserves the approach of C language, and is also forward compatible - on future ISA, this built-in can be replaced by the JIT complier in a new instruction. &lt;/P&gt;</description>
      <pubDate>Thu, 25 Nov 2010 08:41:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778582#M325</guid>
      <dc:creator>Ofer_Rosenberg__Inte</dc:creator>
      <dc:date>2010-11-25T08:41:46Z</dc:date>
    </item>
    <item>
      <title>Some suggestions!</title>
      <link>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778583#M326</link>
      <description>Hi,&lt;DIV&gt;thanks for your insight.. as you say perhaps asm("") is not a good approach but I think the others are interesting still! Really waiting to see how this excellent SDK evolves!&lt;/DIV&gt;&lt;DIV&gt;Thanks.&lt;/DIV&gt;</description>
      <pubDate>Wed, 01 Dec 2010 08:23:45 GMT</pubDate>
      <guid>https://community.intel.com/t5/OpenCL-for-CPU/Some-suggestions/m-p/778583#M326</guid>
      <dc:creator>rtfss1gmail_com</dc:creator>
      <dc:date>2010-12-01T08:23:45Z</dc:date>
    </item>
  </channel>
</rss>

