<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Pinpointed the problem: We in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/SCIF-connection-refused/m-p/925181#M13699</link>
    <description>&lt;P&gt;Pinpointed the problem: We use a slightly customized system for user management on the MICs and due to that the 'micuser' user was missing during mpssd and ofed-mic initialization. I now added the user and offloading seems to work again. Suggestion: It would be nice to have a sanity check for this. &lt;/P&gt;
&lt;P&gt;Olli-Pekka&lt;/P&gt;</description>
    <pubDate>Sun, 14 Apr 2013 13:15:35 GMT</pubDate>
    <dc:creator>Olli-Pekka_L_</dc:creator>
    <dc:date>2013-04-14T13:15:35Z</dc:date>
    <item>
      <title>SCIF connection refused</title>
      <link>https://community.intel.com/t5/Software-Archive/SCIF-connection-refused/m-p/925180#M13698</link>
      <description>&lt;P&gt;For some reason the SCIF interface in my compute nodes is refusing connections. Any ideas on what's wrong or where to start investigating:&lt;/P&gt;
&lt;P&gt;The node has a Mellanox ConnectX-3 HCA with the latest Gold Update 2 MPSS and everything else set up "by the book". All the IB services and modules load nicely and seem to work and I can ssh into the MIC and run natively.&lt;/P&gt;
&lt;P&gt;However, if I try to run an offload (LEO or OpenCL) application it hangs. Doing an strace reveals the following:&lt;/P&gt;
&lt;P&gt;[plain]&lt;/P&gt;
&lt;P&gt;mmap(NULL, 10489856, PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE|MAP_ANONYMOUS|MAP_STACK, -1, 0) = 0x7f737396e000&lt;BR /&gt;mprotect(0x7f737396e000, 4096, PROT_NONE) = 0&lt;BR /&gt;clone(child_stack=0x7f737436dfd0, flags=CLONE_VM|CLONE_FS|CLONE_FILES|CLONE_SIGHAND|CLONE_THREAD|CLONE_SYSVSEM|CLONE_SETTLS|CLONE_PARENT_SETTID|CLONE_CHILD_CLEARTID, parent_tidptr=0x7f737436e9d0, tls=0x7f737436e700, child_tidptr=0x7f737436e9d0) = 26801&lt;BR /&gt;open("/dev/mic/scif", O_RDWR)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 5&lt;BR /&gt;fcntl(5, F_SETFD, FD_CLOEXEC)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;ioctl(5, 0xc0087303, 0x7fffa02d2710)&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;futex(0x7f737436e9d0, FUTEX_WAIT, 26801, NULL) = 0&lt;BR /&gt;close(4)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;ioctl(3, 0xc0087303, 0x7fffa02d27d0)&amp;nbsp;&amp;nbsp;&amp;nbsp; = -1 ECONNREFUSED (Connection refused)&lt;BR /&gt;nanosleep({0, 10000000}, NULL)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;ioctl(3, 0xc0087303, 0x7fffa02d27d0)&amp;nbsp;&amp;nbsp;&amp;nbsp; = -1 ECONNREFUSED (Connection refused)&lt;BR /&gt;nanosleep({0, 20000000}, NULL)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;ioctl(3, 0xc0087303, 0x7fffa02d27d0)&amp;nbsp;&amp;nbsp;&amp;nbsp; = -1 ECONNREFUSED (Connection refused)&lt;BR /&gt;nanosleep({0, 40000000}, NULL)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; = 0&lt;BR /&gt;[/plain]&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Apr 2013 14:16:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/SCIF-connection-refused/m-p/925180#M13698</guid>
      <dc:creator>oplehto</dc:creator>
      <dc:date>2013-04-09T14:16:17Z</dc:date>
    </item>
    <item>
      <title>Pinpointed the problem: We</title>
      <link>https://community.intel.com/t5/Software-Archive/SCIF-connection-refused/m-p/925181#M13699</link>
      <description>&lt;P&gt;Pinpointed the problem: We use a slightly customized system for user management on the MICs and due to that the 'micuser' user was missing during mpssd and ofed-mic initialization. I now added the user and offloading seems to work again. Suggestion: It would be nice to have a sanity check for this. &lt;/P&gt;
&lt;P&gt;Olli-Pekka&lt;/P&gt;</description>
      <pubDate>Sun, 14 Apr 2013 13:15:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/SCIF-connection-refused/m-p/925181#M13699</guid>
      <dc:creator>Olli-Pekka_L_</dc:creator>
      <dc:date>2013-04-14T13:15:35Z</dc:date>
    </item>
  </channel>
</rss>

