<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Fat binary with PTX backend in Intel® oneAPI DPC++/C++ Compiler</title>
    <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287121#M1265</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;To compiler SYCL code for a specific NVIDIA device, I've used to following:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;clang++ \
  -fsycl \
  -fsycl-targets=nvptx64-nvidia-cuda-sycldevice \ 
  -fsycl-unnamed-lambda \
  -Xsycl-target-backend "--cuda-gpu-arch=sm_35" \
  test.cpp&lt;/LI-CODE&gt;
&lt;P&gt;Is there a way to generate a fat binary containing several sm_* computes ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have gone through the manual with no avail.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &lt;/P&gt;</description>
    <pubDate>Fri, 04 Jun 2021 08:16:39 GMT</pubDate>
    <dc:creator>Viet-Duc</dc:creator>
    <dc:date>2021-06-04T08:16:39Z</dc:date>
    <item>
      <title>Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287121#M1265</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;To compiler SYCL code for a specific NVIDIA device, I've used to following:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;clang++ \
  -fsycl \
  -fsycl-targets=nvptx64-nvidia-cuda-sycldevice \ 
  -fsycl-unnamed-lambda \
  -Xsycl-target-backend "--cuda-gpu-arch=sm_35" \
  test.cpp&lt;/LI-CODE&gt;
&lt;P&gt;Is there a way to generate a fat binary containing several sm_* computes ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have gone through the manual with no avail.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks &lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2021 08:16:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287121#M1265</guid>
      <dc:creator>Viet-Duc</dc:creator>
      <dc:date>2021-06-04T08:16:39Z</dc:date>
    </item>
    <item>
      <title>Re: Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287674#M1267</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Could you try the following command and let us know?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda-sycldevice sample.cpp -o sample&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;IMO, the above command should work on any SM architecture.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Rahul&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 10:39:20 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287674#M1267</guid>
      <dc:creator>RahulV_intel</dc:creator>
      <dc:date>2021-06-07T10:39:20Z</dc:date>
    </item>
    <item>
      <title>Re: Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287740#M1271</link>
      <description>&lt;P&gt;Thanks for suggestions.&lt;/P&gt;
&lt;P&gt;But if you remove '-Xsycl-target-backend', the program will fail at runtime, for instance on Tesla K40&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;PI CUDA ERROR:
        Value:           209
        Name:            CUDA_ERROR_NO_BINARY_FOR_GPU
        Description:     no kernel image is available for execution on the device
        Function:        build_program
        Source Location: .../apps/src/llvm/unstable/sycl/plugins/cuda/pi_cuda.cpp:516


PI CUDA ERROR:
        Value:           400
        Name:            CUDA_ERROR_INVALID_HANDLE
        Description:     invalid resource handle
        Function:        cuda_piProgramRelease
        Source Location: .../apps/src/llvm/unstable/sycl/plugins/cuda/pi_cuda.cpp:2938

The program was built for 1 devices
Build program log for 'Tesla K40m':
 -999 (Unknown OpenCL error code)&lt;/LI-CODE&gt;
&lt;P&gt;If I force sm35, it will not work on other gpus such as V100.&lt;/P&gt;
&lt;P&gt;Since the manual is quite terse, I have asked the question in case I overlooked something.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 14:46:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1287740#M1271</guid>
      <dc:creator>Viet-Duc</dc:creator>
      <dc:date>2021-06-07T14:46:15Z</dc:date>
    </item>
    <item>
      <title>Re:Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1288352#M1277</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;As per the documentation, SM-50 and above architectures are supported. &lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/intel/llvm/blob/sycl/sycl/doc/GetStartedGuide.md" target="_blank"&gt;https://github.com/intel/llvm/blob/sycl/sycl/doc/GetStartedGuide.md&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Please note that only the Github version of DPC++ supports the CUDA backend. If you have further queries, please raise a new issue in the below link:&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/intel/llvm/issues" target="_blank"&gt;https://github.com/intel/llvm/issues&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Rahul&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 09 Jun 2021 06:31:04 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1288352#M1277</guid>
      <dc:creator>RahulV_intel</dc:creator>
      <dc:date>2021-06-09T06:31:04Z</dc:date>
    </item>
    <item>
      <title>Re: Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1288432#M1280</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You are right.&lt;/P&gt;
&lt;P&gt;For CUDA-related question, I should have asked the intel-llvm developers instead.&lt;/P&gt;
&lt;P&gt;I will raise the issue through github page. Nevertheless, thanks for your time.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards.&lt;/P&gt;</description>
      <pubDate>Wed, 09 Jun 2021 12:18:56 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1288432#M1280</guid>
      <dc:creator>Viet-Duc</dc:creator>
      <dc:date>2021-06-09T12:18:56Z</dc:date>
    </item>
    <item>
      <title>Re:Fat binary with PTX backend</title>
      <link>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1289587#M1298</link>
      <description>&lt;P&gt;Intel will no longer monitor this thread. Further discussions on this thread will be considered community only.&lt;/P&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 14 Jun 2021 10:03:36 GMT</pubDate>
      <guid>https://community.intel.com/t5/Intel-oneAPI-DPC-C-Compiler/Fat-binary-with-PTX-backend/m-p/1289587#M1298</guid>
      <dc:creator>RahulV_intel</dc:creator>
      <dc:date>2021-06-14T10:03:36Z</dc:date>
    </item>
  </channel>
</rss>

