<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Unable to access the KNC card in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116618#M74297</link>
    <description>&lt;P&gt;Dear all,&lt;/P&gt;

&lt;P&gt;I have a server running&amp;nbsp;CentOS Linux release 7.2.1511, equipped with a KNC MIC card,&lt;BR /&gt;
	using MPSS version 3.3 in order to access and manage the card. The server has been&lt;BR /&gt;
	running for about 2 years without issues. Recently however, I am unable to ssh into the&lt;BR /&gt;
	MIC card. The problem persists after rebooting the entire node. Trying to reboot just the&lt;BR /&gt;
	card gives the following output:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo micctrl --reboot
Error getting SCIF driver version 
  [Error] mic0 failed to shutdown: card state (null)
  [Error] mic0: cannot wait for non existent MIC device
  [Error] mic0: Boot aborted - Setting kernel command line failed
  [Error] mic0: cannot wait for non existent MIC device
$ sudo micctrl --status
Error getting SCIF driver version 
  [Error] mic0: cannot find state of non existent MIC device&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;The MPSS service, however, seems to be working ok:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo service mpss status
mpss is running&lt;/PRE&gt;

&lt;P&gt;The card is also recognized as existing on the PCI bus:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo lspci -v
...
02:00.0 Co-processor: Intel Corporation Xeon Phi coprocessor 3120 series (rev 20)
    Subsystem: Intel Corporation Device 3608
    Flags: fast devsel, IRQ 11
    Memory at 380c00000000 (64-bit, prefetchable) [disabled] [size=8G]
    Memory at fb600000 (64-bit, non-prefetchable) [disabled] [size=128K]
    Capabilities: [44] Power Management version 3
    Capabilities: [4c] Express Endpoint, MSI 00
    Capabilities: [88] MSI: Enable- Count=1/16 Maskable- 64bit+
    Capabilities: [98] MSI-X: Enable- Count=16 Masked-
    Capabilities: [100] Advanced Error Reporting
...&lt;/PRE&gt;

&lt;P&gt;Finally, the miccheck command reports a driver-related issue:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo miccheck
MicCheck 3.3-r1
Copyright 2013 Intel Corporation All Rights Reserved
Executing default tests for host
  Test 0: Check number of devices the OS sees in the system ... pass
  Test 1: Check mic driver is loaded ... fail
    mic driver not loaded
Status: FAIL
Failure: mic driver not loaded&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;It is obviously a driver-related issue, however we have not "touched" the driver in a&lt;BR /&gt;
	very long time and MPSS seems to be working fine. Isn't the SCIF driver part of the&lt;BR /&gt;
	MPSS service? There seems to be an issue with this driver in particular.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;We could definitely solve this issue by re-installing MPSS as a whole and updating&lt;BR /&gt;
	it as well (3.3 is quite old by now), however this is a time-consuming and potentially&lt;BR /&gt;
	volatile process, backwards-compatibility being an issue. Is there some other way&lt;BR /&gt;
	of solving this?&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Thank you for your time reading this post.&lt;BR /&gt;
	I eagerly await your responses,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;George Ch.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 17 Oct 2016 10:19:35 GMT</pubDate>
    <dc:creator>George_C_5</dc:creator>
    <dc:date>2016-10-17T10:19:35Z</dc:date>
    <item>
      <title>Unable to access the KNC card</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116618#M74297</link>
      <description>&lt;P&gt;Dear all,&lt;/P&gt;

&lt;P&gt;I have a server running&amp;nbsp;CentOS Linux release 7.2.1511, equipped with a KNC MIC card,&lt;BR /&gt;
	using MPSS version 3.3 in order to access and manage the card. The server has been&lt;BR /&gt;
	running for about 2 years without issues. Recently however, I am unable to ssh into the&lt;BR /&gt;
	MIC card. The problem persists after rebooting the entire node. Trying to reboot just the&lt;BR /&gt;
	card gives the following output:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo micctrl --reboot
Error getting SCIF driver version 
  [Error] mic0 failed to shutdown: card state (null)
  [Error] mic0: cannot wait for non existent MIC device
  [Error] mic0: Boot aborted - Setting kernel command line failed
  [Error] mic0: cannot wait for non existent MIC device
$ sudo micctrl --status
Error getting SCIF driver version 
  [Error] mic0: cannot find state of non existent MIC device&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;The MPSS service, however, seems to be working ok:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo service mpss status
mpss is running&lt;/PRE&gt;

&lt;P&gt;The card is also recognized as existing on the PCI bus:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo lspci -v
...
02:00.0 Co-processor: Intel Corporation Xeon Phi coprocessor 3120 series (rev 20)
    Subsystem: Intel Corporation Device 3608
    Flags: fast devsel, IRQ 11
    Memory at 380c00000000 (64-bit, prefetchable) [disabled] [size=8G]
    Memory at fb600000 (64-bit, non-prefetchable) [disabled] [size=128K]
    Capabilities: [44] Power Management version 3
    Capabilities: [4c] Express Endpoint, MSI 00
    Capabilities: [88] MSI: Enable- Count=1/16 Maskable- 64bit+
    Capabilities: [98] MSI-X: Enable- Count=16 Masked-
    Capabilities: [100] Advanced Error Reporting
...&lt;/PRE&gt;

&lt;P&gt;Finally, the miccheck command reports a driver-related issue:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ sudo miccheck
MicCheck 3.3-r1
Copyright 2013 Intel Corporation All Rights Reserved
Executing default tests for host
  Test 0: Check number of devices the OS sees in the system ... pass
  Test 1: Check mic driver is loaded ... fail
    mic driver not loaded
Status: FAIL
Failure: mic driver not loaded&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;It is obviously a driver-related issue, however we have not "touched" the driver in a&lt;BR /&gt;
	very long time and MPSS seems to be working fine. Isn't the SCIF driver part of the&lt;BR /&gt;
	MPSS service? There seems to be an issue with this driver in particular.&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;We could definitely solve this issue by re-installing MPSS as a whole and updating&lt;BR /&gt;
	it as well (3.3 is quite old by now), however this is a time-consuming and potentially&lt;BR /&gt;
	volatile process, backwards-compatibility being an issue. Is there some other way&lt;BR /&gt;
	of solving this?&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Thank you for your time reading this post.&lt;BR /&gt;
	I eagerly await your responses,&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;George Ch.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 17 Oct 2016 10:19:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116618#M74297</guid>
      <dc:creator>George_C_5</dc:creator>
      <dc:date>2016-10-17T10:19:35Z</dc:date>
    </item>
    <item>
      <title>mpss 3.3 is out of support</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116619#M74298</link>
      <description>&lt;P&gt;mpss 3.3 is out of support for a very long time; my guess is that you got lucky that the mic driver was working with the recent kernel updates. What happens when you do a&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;modprobe mic&lt;/PRE&gt;

&lt;P&gt;?&amp;nbsp; what do you seen with 'dmesg' after that?&lt;/P&gt;

&lt;P&gt;Other than that, upgrading the mpss stack to 3.7.2 is fairly painless - but it depends on your setup, of course. If you're installing k1om rpms on the mic then remember to upgrade them as well.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 08:21:37 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116619#M74298</guid>
      <dc:creator>JJK</dc:creator>
      <dc:date>2016-10-18T08:21:37Z</dc:date>
    </item>
    <item>
      <title>Executing the following</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116620#M74299</link>
      <description>&lt;P&gt;Executing the following command:&lt;/P&gt;

&lt;P&gt;&lt;SPAN style="font-family: Consolas, &amp;quot;Lucida Console&amp;quot;, Menlo, Monaco, &amp;quot;DejaVu Sans Mono&amp;quot;, monospace, sans-serif; font-size: 1em;"&gt;modprobe mic&lt;/SPAN&gt;&lt;/P&gt;

&lt;P&gt;results in the following output:&lt;/P&gt;

&lt;PRE class="brush:;"&gt;$ modprobe mic
modprobe: FATAL: Module mic not found.&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;The dmesg command prints a&amp;nbsp;&lt;STRONG&gt;lot&lt;/STRONG&gt;&amp;nbsp;of output, none of which seems to be relevant to the Phi&lt;BR /&gt;
	with a simple grep command:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:;"&gt;$ dmesg | grep Phi
$ dmesg | grep coprocessor&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Upgrading to a newer version of mpss seems unavoidable.&lt;/SPAN&gt;&lt;BR /&gt;
	&lt;SPAN style="font-size: 1em;"&gt;I apologize if I sound ignorant, but how do I check for my k1om installation?&lt;BR /&gt;
	I do seem to have a relevant directory:&lt;/SPAN&gt;&lt;/P&gt;

&lt;PRE class="brush:;"&gt;/opt/mpss/3.3/sysroots/x86_64-mpsssdk-linux/usr/libexec/k1om-mpss-linux&lt;/PRE&gt;

&lt;P&gt;&lt;SPAN style="font-size: 1em;"&gt;Can you give some tips concerning the update of k1om (and perhaps mpss as well)?&lt;BR /&gt;
	Thank you for your reply!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 10:13:40 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116620#M74299</guid>
      <dc:creator>George_C_5</dc:creator>
      <dc:date>2016-10-18T10:13:40Z</dc:date>
    </item>
    <item>
      <title>most likely your newer kernel</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116621#M74300</link>
      <description>&lt;P&gt;most likely your newer kernel does not have a link to the mic.ko module; you can try adding a weak-updates link for the mic.ko module using&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;cd /lib/modules/`uname -r`/weak-updates
ln -fs /lib/modules/3.10.0-123.el7.x86_64/extra/mic.ko
depmod -ae

modprobe mic
&lt;/PRE&gt;

&lt;P&gt;if that does not work then you'll have to upgrade to mpss 3.7.2 ; if you did not install any MIC-side RPMs (which is specified in the mic configuration files in /etc/mpss/....) then you do not need to worry about the k1om rpms.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Oct 2016 06:50:17 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116621#M74300</guid>
      <dc:creator>JJK</dc:creator>
      <dc:date>2016-10-19T06:50:17Z</dc:date>
    </item>
    <item>
      <title>Dear JJK,</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116622#M74301</link>
      <description>&lt;P&gt;Dear JJK,&lt;/P&gt;

&lt;P&gt;I am very thankful for your help. Establishing the weak link, as you proposed, did not help much. Thus, I moved on to the long-term solution of upgrading my mpss software. I tried to follow this guide by the book:&amp;nbsp;&lt;A href="http://registrationcenter-download.intel.com/akdlm/irc_nas/9669/readme.txt"&gt;http://registrationcenter-download.intel.com/akdlm/irc_nas/9669/readme.txt&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;&lt;BR /&gt;
	Now, this process did not go exactly as planned. I was completely unable to locate several important files that should have been included in the mpss version that was installed - such as the uninstall script and some module files that needed recompiling. I attributed this to having an even older mpss version than 3.3.5 - seemingly a 3.3.1 version. My solution to this problem was actually downloading the online 3.3.5 mpss distribution, re-compiling those mpss modules and running that uninstall scipt. My solution seemingly worked.&lt;/P&gt;

&lt;P&gt;I then continued with the installation of the new 3.7.2 mpss software. I followed every step of the algorithm applicable for Red Hat 7.2. The software installed without any issues, following the guide that I mentioned religiously. I arrived at the step of actually checking the mic card before updating its flash files. The problem, unfortunately, persists:&lt;/P&gt;

&lt;PRE class="brush:;"&gt;$ modprobe mic
modprobe: FATAL: Module mic not found.
$ sudo micctrl --initdefaults
$ micctrl -s
  [Error] mic0: State failed - non existent MIC device
$ sudo micctrl -rw
[sudo] password for georgec: 
  [Error] mic0: Reset aborted - non existent MIC device
  [Error] mic0: Wait failed - non existent MIC device
&lt;/PRE&gt;

&lt;P&gt;Thus, updating the driver, at least host-side, did not solve my issues. Furthermore, I cannot proceed with the completion of the upgrade on the side of Phi.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Oct 2016 15:22:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116622#M74301</guid>
      <dc:creator>George_C_5</dc:creator>
      <dc:date>2016-10-19T15:22:00Z</dc:date>
    </item>
    <item>
      <title>did you install the mpss</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116623#M74302</link>
      <description>&lt;P&gt;did you install the mpss-modules rpm for the RHEL 7.2 kernel? is there a module present in /lib/modules/3.10.0-327.el7/extra ?&lt;/P&gt;

&lt;P&gt;what happens if you try to insert that module directly:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;insmod /lib/modules/3.10.0-327.el7/extra/mic.ko

?  what does &lt;/PRE&gt;

&lt;PRE class="brush:bash;"&gt;modinfo mic
&lt;/PRE&gt;

&lt;P&gt;return?&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 08:48:59 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116623#M74302</guid>
      <dc:creator>JJK</dc:creator>
      <dc:date>2016-10-20T08:48:59Z</dc:date>
    </item>
    <item>
      <title>I removed the previous</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116624#M74303</link>
      <description>&lt;P&gt;I removed the previous installation of MPSS (3.3) via the uninstall.sh script that I had to download from&amp;nbsp;&lt;A href="https://software.intel.com/en-us/articles/intel-manycore-platform-software-stack-mpss#lx37rel"&gt;&lt;/A&gt;&lt;A href="https://software.intel.com/en-us/articles/intel-manycore-platform-software-stack-mpss#lx37rel" target="_blank"&gt;https://software.intel.com/en-us/articles/intel-manycore-platform-software-stack-mpss#lx37rel&lt;/A&gt;&lt;BR /&gt;
	I&amp;nbsp;then downloaded, extracted, copied and installed MPSS 3.7 modules:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ tar xvf mpss-3.7.2-linux.tar
$ cd mpss-3.7.2
$ uname -r
3.10.0-327.28.3.el7.x86_64
$ cp modules/mpss-modules-3.10.0-327.el7.x86_64-3.7.2-1.x86_64.rpm .
$ cp modules/mpss-modules-dev-3.10.0-327.el7.x86_64-3.7.2-1.x86_64.rpm .
$ sudo yum install *.rpm&lt;/PRE&gt;

&lt;P&gt;As for the module that you ask:&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ ls /lib/modules/3.10.0-327.el7.x86_64/extra/
mic.ko&lt;/PRE&gt;

&lt;P&gt;Now for the good news: trying your insertion command fixed everything!&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;​
$ sudo insmod /lib/modules/3.10.0-327.el7.x86_64/extra/mic.ko 
[sudo] password for georgec: 
$&lt;/PRE&gt;

&lt;P&gt;The command worked and I was able to continue the installation (updating flash and SMC).&lt;BR /&gt;
	Everything is working as it should be now and I am able to ssh into the card.&lt;BR /&gt;
	There is only one small detail that maybe you can shed some light onto:&lt;BR /&gt;
	I need to run the insertion command everytime I boot the host, otherwise the mic card remains undetected.&lt;BR /&gt;
	Any ideas on how to fix this last aspect of the issue in a permanent fashion? I could just include the command on&lt;BR /&gt;
	a boot script, but is there some other way to fix it?&lt;/P&gt;

&lt;P&gt;Thank you JJK, you have been immensely helpful.&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 09:42:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116624#M74303</guid>
      <dc:creator>George_C_5</dc:creator>
      <dc:date>2016-10-20T09:42:35Z</dc:date>
    </item>
    <item>
      <title>Good, now we're getting</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116625#M74304</link>
      <description>&lt;P&gt;Good, now we're getting somewhere.&lt;/P&gt;

&lt;P&gt;The next step is to ensure that the mic.ko module is found at boot time. Try the following&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;cd /lib/modules/`uname -r`/weak-updates
ln -fs /lib/modules/3.10.0-327.el7.x86_64/extra/mic.ko
depmod -ae

modinfo mic&lt;/PRE&gt;

&lt;P&gt;This will create the right "weak-updates" link for the mic.ko module. Afterwards, the mic.ko module should be loadable using&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;rmmod mic
modprobe mic
&lt;/PRE&gt;

&lt;P&gt;and the mpss stack will automatically insert this module at startup.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 19:08:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116625#M74304</guid>
      <dc:creator>JJK</dc:creator>
      <dc:date>2016-10-20T19:08:46Z</dc:date>
    </item>
    <item>
      <title>Thank you JJK, the process</title>
      <link>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116626#M74305</link>
      <description>&lt;P&gt;Thank you JJK, the process you described removed the need to manually call insmod at server booting.&lt;/P&gt;

&lt;PRE class="brush:bash;"&gt;$ modinfo mic
filename:       /lib/modules/3.10.0-327.28.3.el7.x86_64/weak-updates/mic.ko
license:        GPL
build_scmver:   2659671e21e0814014e442998fbdc8ff37d1c68e
build_ondate:   2016-08-09 15:39:20 -0400
build_bywhom:   qb_user@sid-bld06.pdx.intel.com
build_number:   0
license:        GPL
license:        GPL
rhelversion:    7.2
srcversion:     35F362554621AC4E6F74424
depends:        
vermagic:       3.10.0-327.el7.x86_64 SMP mod_unload modversions 
parm:           vnet:Vnet operating mode, one of: poll intr dma (vnetmode)
parm:           vnet_num_buffers:Number of buffers used by the VNET driver (int)
parm:           vnet_addr:Vnet driver host ring address (ulong)
parm:           ulimit:SCIF ulimit check (bool)
parm:           reg_cache:SCIF registration caching (bool)
parm:           huge_page:SCIF Huge Page Support (bool)
parm:           p2p:SCIF peer-to-peer (bool)
parm:           p2p_proxy:SCIF peer-to-peer proxy DMA support (bool)
parm:           watchdog:SCIF Watchdog (bool)
parm:           watchdog_auto_reboot:SCIF Watchdog auto reboot (bool)
parm:           msi:bool
parm:           mic_msi_enable:To enable MSIx in the driver.
parm:           pm_qos_cpu_dma_lat:int
parm:           mic_pm_qos_cpu_dma_lat:PM QoS CPU DMA latency in usecs.
parm:           ramoops_count:Maximum frame count for the ramoops driver. (int)
parm:           crash_dump:bool
parm:           mic_crash_dump_enabled:MIC Crash Dump enabled.
parm:           psmi:Enable/disable mic psmi (bool)
&lt;/PRE&gt;

&lt;P&gt;Thank you for your help :)&lt;/P&gt;</description>
      <pubDate>Fri, 21 Oct 2016 08:31:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/Unable-to-access-the-KNC-card/m-p/1116626#M74305</guid>
      <dc:creator>George_C_5</dc:creator>
      <dc:date>2016-10-21T08:31:00Z</dc:date>
    </item>
  </channel>
</rss>

