<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Quote:Taylor Kidd (Intel) in Software Archive</title>
    <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008326#M32933</link>
    <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Taylor Kidd (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;The experts want to know what you have done to isolate the problem.&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Are the cards rebooted the same way each time - for example, by issuing "service mpss restart"? If not, what is the process?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, service mpss restart&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is the problem reproducable if the card hasn't been touched? For example, once the coprocessors are both working, can you restart mpss 10 times in a row and get at least one failure?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, I'd say it's over 50% probability that boot fails.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is this reproducible on multiple hosts? (if not, have the cards been re-seated....?)&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, I've 45 nodes with 2 mics and it's not host specific issue. Another user is facing same issue:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://software.intel.com/en-us/forums/topic/508661#comment-1787816"&gt;https://software.intel.com/en-us/forums/topic/508661#comment-1787816&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is it sensitive to multi-card installs - can it be reproduced with only 1 card installed?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;We have "only" multi-card nodes and those are water cooled so it's not possible to take card away from the node.&lt;/P&gt;

&lt;P&gt;Is it possible to start mpss only one mic at time?&lt;/P&gt;

&lt;P&gt;Or insert some delay between mic startups?&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 15 May 2014 08:24:46 GMT</pubDate>
    <dc:creator>Tommi_T_</dc:creator>
    <dc:date>2014-05-15T08:24:46Z</dc:date>
    <item>
      <title>How to debug mic boot problem?</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008322#M32929</link>
      <description>&lt;P&gt;Since MPSS 3.2 mics does not boot reliably way, there is following error on the console: "Initramfs unpacking failed: junk in compressed archive"&lt;/P&gt;

&lt;P&gt;There are two 7120 cards on the host and boot failure occurs quite often. If I restart mpss service problem may disappear and both cards works fine. Another mpss restart and suddenly there is that "Initramfs unpacking failed"-error on the logs. It may be mic0 or mic1 which fails. Never both.&lt;/P&gt;

&lt;P&gt;If I enable verboselogging&amp;nbsp; "Initramfs unpacking failed"-error message disappears but problem do not.&lt;/P&gt;

&lt;P&gt;[&amp;nbsp;&amp;nbsp; 82.940477] System halted.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 82.942345] mic_shutdown: system state 2 dbreg 0x80000002&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = e0000, size = 131071&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = ef180, size = 32&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = 92000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 24&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff4ba000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 1000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff47a1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = ef000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff47a000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = ef000, size = 312&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = 92000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 24&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff4ba000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 1000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = 92000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 24&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff4ba000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_memory, phys = 92000, size = 1000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: Entering sfi_map_table, pa = ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] SFI: sfi_map_table, th = ffffffffff47a1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] PCI: Warning: Cannot find a gap in the 32bit address range&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] PCI: Unassigned devices with 32bit resource registers may break!&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = ef180, size = 48&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = 92000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = 92000, size = 24&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffffc90000000000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = 92000, size = 1000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffff8800000ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = ef000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffff8800000ef000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = ef000, size = 312&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = 92000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = 92000, size = 24&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffffc90000000000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = 92000, size = 1000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffff8800000ef1c8&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_table, pa = ef000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: sfi_map_table, th = ffff8800000ef000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.010000] SFI: Entering sfi_map_memory, phys = ef000, size = 312&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 2.634597] Initramfs unpacking failed: junk in compressed archive&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 4.061908] i8042: Can't read CTR while initializing i8042&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 6.704162] Have you set virtblk file?&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.239568] [ pm_scif_init : 348 ]:==&amp;gt; pm_scif_init&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.239590] [ pm_scif_init : 349 ]:pm_scif insmoded&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.239643] [ pm_scif_init : 377 ]: scif_bind successfull. Local port number = 1089, ep =&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.240538] [ pm_recv_from_host : 182 ]:==&amp;gt; pm_recv_from_host&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.240574] [ pm_handle_open : 88 ]:==&amp;gt; pm_handle_open&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.240707] [ pm_recv_from_host : 182 ]:==&amp;gt; pm_recv_from_host&lt;/P&gt;

&lt;P&gt;Intel MIC Platform Software Stack (Built by Poky 7.0) 3.2.1 m40-mic0 hvc0&lt;/P&gt;

&lt;P&gt;Here is boot log from failed boot when verboselogging is enabled:&lt;/P&gt;

&lt;P&gt;Unmounting local filesystems...&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 96.880582] Preparing to shutdown kernel&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 96.880612] md: stopping all md devices.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 98.013776] card: scif node 1 exiting&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 98.018264] Deregistered interrupt handler for node 0, for IRQ = 17,handle = 0&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 98.072678] Back from notifier call&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 98.073544] System halted.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 98.074980] mic_shutdown: system state 2 dbreg 0x80000002&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] Initializing cgroup subsys cpuset&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] Initializing cgroup subsys cpu&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] Linux version 2.6.38.8+mpss3.2.1 (build@yocto-182-71) (gcc version 4.7.0 20110509 (experimental) (GCC) ) #1 SMP Wed Apr 2 08:52:20 PDT 2014&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.000000] Command line: card=0 vnet=dma scif_id=1 scif_addr=0x8474fae780 vnet_addr=0x84761c0118 vcons_hdr_addr=0x8474fa5440 virtio_addr=[&amp;nbsp;&amp;nbsp;&amp;nbsp; 7.420826] RAS.init: module operational&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.061375] Module mpssboot loaded at 0xffffffffa0003000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.069231] MPSSBOOT Time of day sycned with host&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.096002] Module pm_scif loaded at 0xffffffffa0016000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.101490] [ pm_scif_init : 348 ]:==&amp;gt; pm_scif_init&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.101516] [ pm_scif_init : 349 ]:pm_scif insmoded&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.101559] [ pm_scif_init : 377 ]: scif_bind successfull. Local port number = 1089, ep =&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.102659] [ pm_recv_from_host : 182 ]:==&amp;gt; pm_recv_from_host&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.102695] [ pm_handle_open : 88 ]:==&amp;gt; pm_handle_open&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.102792] [ pm_recv_from_host : 182 ]:==&amp;gt; pm_recv_from_host&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.688581] Module blcr_imports loaded at 0xffffffffa0009000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.712853] Module blcr loaded at 0xffffffffa009b000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734372] blcr: vmadump: (from bproc-"4.0.0pre8") Erik Hendriks &amp;lt;erik@hendriks.cx&amp;gt;&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734405] blcr: vmadump: Modified for blcr 0.8.5 &amp;lt;http://ftg.lbl.gov/checkpoint&amp;gt;&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734427] blcr: Berkeley Lab Checkpoint/Restart (BLCR) module version 0.8.5.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734448] blcr:&amp;nbsp;&amp;nbsp; Parameter cr_io_max = 0x4000000&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734461] blcr:&amp;nbsp;&amp;nbsp; Supports kernel interface version 0.10.3.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734477] blcr:&amp;nbsp;&amp;nbsp; Supports context file format versions 8 though 9.&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.734495] blcr: &lt;A href="http://ftg.lbl.gov/checkpoint" target="_blank"&gt;http://ftg.lbl.gov/checkpoint&lt;/A&gt;&lt;BR /&gt;
	[&amp;nbsp;&amp;nbsp; 13.809241] MPSSBOOT Boot acknowledged&lt;/P&gt;

&lt;P&gt;Intel MIC Platform Software Stack (Built by Poky 7.0) 3.2.1 m40-mic0 hvc0&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 May 2014 14:53:35 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008322#M32929</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-05T14:53:35Z</dc:date>
    </item>
    <item>
      <title>[root@m40 ~]# miccheck</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008323#M32930</link>
      <description>&lt;P&gt;[root@m40 ~]# miccheck&lt;BR /&gt;
	MicCheck 3.2.1-r1&lt;BR /&gt;
	Copyright 2013 Intel Corporation All Rights Reserved&lt;/P&gt;

&lt;P&gt;Executing default tests for host&lt;BR /&gt;
	&amp;nbsp; Test 0: Check number of devices the OS sees in the system ... pass&lt;BR /&gt;
	&amp;nbsp; Test 1: Check mic driver is loaded ... pass&lt;BR /&gt;
	&amp;nbsp; Test 2: Check number of devices driver sees in the system ... pass&lt;BR /&gt;
	&amp;nbsp; Test 3: Check mpssd daemon is running ... pass&lt;BR /&gt;
	Executing default tests for device: 0&lt;BR /&gt;
	&amp;nbsp; Test 4 (mic0): Check device is in online state and its postcode is FF ... pass&lt;BR /&gt;
	&amp;nbsp; Test 5 (mic0): Check ras daemon is available in device ... pass&lt;BR /&gt;
	&amp;nbsp; Test 6 (mic0): Check running flash version is correct ... pass&lt;BR /&gt;
	Executing default tests for device: 1&lt;BR /&gt;
	&amp;nbsp; Test 7 (mic1): Check device is in online state and its postcode is FF ... pass&lt;BR /&gt;
	&amp;nbsp; Test 8 (mic1): Check ras daemon is available in device ... pass&lt;BR /&gt;
	&amp;nbsp; Test 9 (mic1): Check running flash version is correct ... pass&lt;/P&gt;

&lt;P&gt;Status: OK&lt;/P&gt;

&lt;P&gt;[root@m40 ~]# micctrl --config&lt;/P&gt;

&lt;P&gt;mic0:&lt;BR /&gt;
	=============================================================&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; Config Version: 1.1&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Linux Kernel:&amp;nbsp;&amp;nbsp; /usr/share/mpss/boot/bzImage-knightscorner&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; BootOnStart:&amp;nbsp;&amp;nbsp;&amp;nbsp; Enabled&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; Shutdowntimeout: 300 seconds&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; ExtraCommandLine: highres=off&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; PowerManagment: cpufreq_on;corec6_on;pc3_on;pc6_on&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Root Device:&amp;nbsp;&amp;nbsp; Dynamic Ram Filesystem /var/mpss/mic0.image.gz from:&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Base:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CPIO /usr/share/mpss/boot/initramfs-knightscorner.cpio.gz&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Overlay&amp;nbsp;&amp;nbsp;&amp;nbsp; Filelist /opt/intel/mic/ofed/ /opt/intel/mic/ofed/ofed.filelist on&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Overlay&amp;nbsp;&amp;nbsp;&amp;nbsp; RPM /opt/intel/mic/filesystem on&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CommonDir: Directory /var/mpss/common&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Micdir:&amp;nbsp;&amp;nbsp;&amp;nbsp; Directory /var/mpss/mic0&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Network:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Static bridge br0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MIC IP:&amp;nbsp;&amp;nbsp;&amp;nbsp; 10.10.5.40&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Host IP:&amp;nbsp;&amp;nbsp; 10.10.4.40&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Net Bits:&amp;nbsp; 16&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; NetMask:&amp;nbsp;&amp;nbsp; 255.255.0.0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MtuSize:&amp;nbsp;&amp;nbsp; 1500&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Hostname:&amp;nbsp; m40-mic0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MIC MAC:&amp;nbsp;&amp;nbsp; 4c:79:ba:4c:01:18&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Host MAC:&amp;nbsp; 4c:79:ba:4c:01:19&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cgroup:&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Memory:&amp;nbsp;&amp;nbsp;&amp;nbsp; Enabled&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Console:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; hvc0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; VerboseLogging: Enabled&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; CrashDump:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; /var/crash/mic 16GB&lt;/P&gt;

&lt;P&gt;mic1:&lt;BR /&gt;
	=============================================================&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; Config Version: 1.1&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Linux Kernel:&amp;nbsp;&amp;nbsp; /usr/share/mpss/boot/bzImage-knightscorner&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; BootOnStart:&amp;nbsp;&amp;nbsp;&amp;nbsp; Enabled&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; Shutdowntimeout: 300 seconds&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; ExtraCommandLine: highres=off&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; PowerManagment: cpufreq_on;corec6_on;pc3_on;pc6_on&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Root Device:&amp;nbsp;&amp;nbsp; Dynamic Ram Filesystem /var/mpss/mic1.image.gz from:&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Base:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CPIO /usr/share/mpss/boot/initramfs-knightscorner.cpio.gz&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Overlay&amp;nbsp;&amp;nbsp;&amp;nbsp; Filelist /opt/intel/mic/ofed/ /opt/intel/mic/ofed/ofed.filelist on&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Overlay&amp;nbsp;&amp;nbsp;&amp;nbsp; RPM /opt/intel/mic/filesystem on&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CommonDir: Directory /var/mpss/common&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Micdir:&amp;nbsp;&amp;nbsp;&amp;nbsp; Directory /var/mpss/mic1&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Network:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Static bridge br0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MIC IP:&amp;nbsp;&amp;nbsp;&amp;nbsp; 10.10.6.40&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Host IP:&amp;nbsp;&amp;nbsp; 10.10.4.40&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Net Bits:&amp;nbsp; 16&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; NetMask:&amp;nbsp;&amp;nbsp; 255.255.0.0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MtuSize:&amp;nbsp;&amp;nbsp; 1500&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Hostname:&amp;nbsp; m40-mic1&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MIC MAC:&amp;nbsp;&amp;nbsp; 4c:79:ba:4c:00:be&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Host MAC:&amp;nbsp; 4c:79:ba:4c:00:bf&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cgroup:&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Memory:&amp;nbsp;&amp;nbsp;&amp;nbsp; Enabled&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Console:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; hvc0&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; VerboseLogging: Enabled&lt;BR /&gt;
	&amp;nbsp;&amp;nbsp;&amp;nbsp; CrashDump:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; /var/crash/mic 16GB&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 May 2014 14:55:59 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008323#M32930</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-05T14:55:59Z</dc:date>
    </item>
    <item>
      <title>Tommi,</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008324#M32931</link>
      <description>&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;We are working on this issue for you. As soon as I get an update, I'll let you know.&lt;/P&gt;

&lt;P&gt;Regards&lt;BR /&gt;
	--&lt;BR /&gt;
	Taylor&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 08 May 2014 17:49:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008324#M32931</guid>
      <dc:creator>TaylorIoTKidd</dc:creator>
      <dc:date>2014-05-08T17:49:34Z</dc:date>
    </item>
    <item>
      <title>Tommi,</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008325#M32932</link>
      <description>&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;The experts want to know what you have done to isolate the problem.&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Are the cards rebooted the same way each time - for example, by issuing "service mpss restart"? If not, what is the process?&lt;/LI&gt;
	&lt;LI&gt;Is the problem reproducable if the card hasn't been touched? For example, once the coprocessors are both working, can you restart mpss 10 times in a row and get at least one failure?&lt;/LI&gt;
	&lt;LI&gt;Is this reproducible on multiple hosts? (if not, have the cards been re-seated....?)&lt;/LI&gt;
	&lt;LI&gt;Is it sensitive to multi-card installs - can it be reproduced with only 1 card installed?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Regards&lt;BR /&gt;
	--&lt;BR /&gt;
	Taylor&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 14 May 2014 23:11:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008325#M32932</guid>
      <dc:creator>TaylorIoTKidd</dc:creator>
      <dc:date>2014-05-14T23:11:21Z</dc:date>
    </item>
    <item>
      <title>Quote:Taylor Kidd (Intel)</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008326#M32933</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Taylor Kidd (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;The experts want to know what you have done to isolate the problem.&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Are the cards rebooted the same way each time - for example, by issuing "service mpss restart"? If not, what is the process?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, service mpss restart&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is the problem reproducable if the card hasn't been touched? For example, once the coprocessors are both working, can you restart mpss 10 times in a row and get at least one failure?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, I'd say it's over 50% probability that boot fails.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is this reproducible on multiple hosts? (if not, have the cards been re-seated....?)&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Yes, I've 45 nodes with 2 mics and it's not host specific issue. Another user is facing same issue:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://software.intel.com/en-us/forums/topic/508661#comment-1787816"&gt;https://software.intel.com/en-us/forums/topic/508661#comment-1787816&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;Is it sensitive to multi-card installs - can it be reproduced with only 1 card installed?&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;We have "only" multi-card nodes and those are water cooled so it's not possible to take card away from the node.&lt;/P&gt;

&lt;P&gt;Is it possible to start mpss only one mic at time?&lt;/P&gt;

&lt;P&gt;Or insert some delay between mic startups?&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 15 May 2014 08:24:46 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008326#M32933</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-15T08:24:46Z</dc:date>
    </item>
    <item>
      <title>micctrl -r mic0sleep 2micctrl</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008327#M32934</link>
      <description>&lt;P&gt;micctrl -r mic0&lt;/P&gt;
&lt;P&gt;sleep 2&lt;/P&gt;
&lt;P&gt;micctrl -r mic1&lt;/P&gt;
&lt;P&gt;sleep 2&lt;/P&gt;
&lt;P&gt;micctrl -b mic0&lt;/P&gt;
&lt;P&gt;micctrl -w mic0 # waits until boot 0 is done&lt;/P&gt;
&lt;P&gt;micctrl -b mic1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 May 2014 14:26:02 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008327#M32934</guid>
      <dc:creator>Michael_H_Intel1</dc:creator>
      <dc:date>2014-05-15T14:26:02Z</dc:date>
    </item>
    <item>
      <title>I forgot - first edit:  /etc</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008328#M32935</link>
      <description>&lt;P&gt;I forgot - first edit:&lt;/P&gt;
&lt;P&gt;&amp;nbsp; /etc/mpss/mic[0,1].conf&lt;/P&gt;
&lt;P&gt;set&lt;/P&gt;
&lt;P&gt;&amp;nbsp; BootOnStart Enabled&lt;/P&gt;
&lt;P&gt;to&lt;/P&gt;
&lt;P&gt;&amp;nbsp; BootOnStart Disabled&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 May 2014 14:27:39 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008328#M32935</guid>
      <dc:creator>Michael_H_Intel1</dc:creator>
      <dc:date>2014-05-15T14:27:39Z</dc:date>
    </item>
    <item>
      <title>Thanks for the hint, I</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008329#M32936</link>
      <description>&lt;P&gt;Thanks for the hint, I changed configuration so that mics will boot serially. But it did not help :-(&lt;/P&gt;

&lt;P&gt;I rebooted 28 nodes and here are results:&lt;/P&gt;

&lt;P&gt;9 nodes had both mics up.&lt;/P&gt;

&lt;P&gt;6 nodes mic0 up correctly&lt;/P&gt;

&lt;P&gt;9 nodes mic1 up correctly&lt;/P&gt;

&lt;P&gt;4 nodes both mics failed.&lt;/P&gt;

&lt;P&gt;I reimaged all nodes so there is no configuration differences between nodes.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 May 2014 16:46:21 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008329#M32936</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-15T16:46:21Z</dc:date>
    </item>
    <item>
      <title>Tommi,</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008330#M32937</link>
      <description>&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;Here are the experts requests:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;&lt;EM&gt;The output “[&amp;nbsp;&amp;nbsp; 13.809241] MPSSBOOT Boot acknowledged” output indicates it did boot.&amp;nbsp; Tell us the card's state using “micctrl –s”.&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Generally when this occurs it indicates a network setup issue.&amp;nbsp; I would start by using minicom to log into the card using the virtual console and looking at the network config.&lt;EM&gt;&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Do a “mkdir unpack; cd unpack; zcat /var/mpss/mic0.image.gz | (cpio –iv; cpio –iv)” and see if the initrd image unpacks on the host or not. &amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Send me the initrd image created by mpssd and defined by the RootDevice parameter (usually /var/mpss/mic0.image.gz).&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;You can send the image to me in a private message. You might have to change its name so it makes it past the virus filters. If it still doesn't work, let me know and we can do so via email.&lt;/P&gt;

&lt;P&gt;Regards&lt;BR /&gt;
	--&lt;BR /&gt;
	Taylor&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 May 2014 17:00:34 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008330#M32937</guid>
      <dc:creator>TaylorIoTKidd</dc:creator>
      <dc:date>2014-05-15T17:00:34Z</dc:date>
    </item>
    <item>
      <title>Quote:Taylor Kidd (Intel)</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008331#M32938</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Taylor Kidd (Intel) wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;Here are the experts requests:&lt;/P&gt;

&lt;UL&gt;
	&lt;LI&gt;&lt;EM&gt;The output “[&amp;nbsp;&amp;nbsp; 13.809241] MPSSBOOT Boot acknowledged” output indicates it did boot.&amp;nbsp; Tell us the card's state using “micctrl –s”.&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Generally when this occurs it indicates a network setup issue.&amp;nbsp; I would start by using minicom to log into the card using the virtual console and looking at the network config.&lt;EM&gt;&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Do a “mkdir unpack; cd unpack; zcat /var/mpss/mic0.image.gz | (cpio –iv; cpio –iv)” and see if the initrd image unpacks on the host or not. &amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
	&lt;LI&gt;&lt;EM&gt;Send me the initrd image created by mpssd and defined by the RootDevice parameter (usually /var/mpss/mic0.image.gz).&amp;nbsp;&amp;nbsp;&lt;/EM&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;You can send the image to me in a private message. You might have to change its name so it makes it past the virus filters. If it still doesn't work, let me know and we can do so via email.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;
	&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;

&lt;P&gt;Hi, my post was a bit unclear. Mics will boot up but due to initramfs unpacking error mic will use "wrong" ssh_host_key. I use micctrl --hostkeys=/opt/intel/mic_host_keys/ which will add my cluster host keys to overlay file system /var/mpss/mic0,1/etc/ssh/. See &lt;A href="https://software.intel.com/en-us/forums/topic/508661#comment-1787816"&gt;https://software.intel.com/en-us/forums/topic/508661#comment-1787816&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;I can extract&lt;EM&gt; &lt;/EM&gt;/var/mpss/mic0,1/image.gz on the host without errors.&lt;/P&gt;</description>
      <pubDate>Fri, 16 May 2014 13:05:28 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008331#M32938</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-16T13:05:28Z</dc:date>
    </item>
    <item>
      <title>Well, I made simple script</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008332#M32939</link>
      <description>Well, I made simple script which will run after mpss service is loaded.
It'll continue to reboot mics until they manage to pick up ssh keys from 
the overlay filesystem.

&lt;PRE class="brush:bash;"&gt;
#!/bin/bash

for mic in mic0 mic1;
do

  status=1;

  while [ $status -eq 1 ];
  do
    ssh $mic uname -r

    if [ $? -ne 0  ];
    then
      micctrl --shutdown $mic
      micctrl -w $mic
      micctrl -b $mic
      micctrl -w $mic
    else
      status=0;
    fi
  done
done&lt;/PRE&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 May 2014 11:22:00 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008332#M32939</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-20T11:22:00Z</dc:date>
    </item>
    <item>
      <title>Hi Tommi,</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008333#M32940</link>
      <description>&lt;P&gt;Hi Tommi,&lt;/P&gt;

&lt;P&gt;We are still working on an answer. I hope to get back to you soon.&lt;/P&gt;

&lt;P&gt;Regards&lt;BR /&gt;
	---&lt;BR /&gt;
	Taylor&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 May 2014 13:39:50 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008333#M32940</guid>
      <dc:creator>TaylorIoTKidd</dc:creator>
      <dc:date>2014-05-20T13:39:50Z</dc:date>
    </item>
    <item>
      <title>I still have zero visability</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008334#M32941</link>
      <description>&lt;P&gt;I still have zero visability into what the error could be so here are some more questions.&lt;/P&gt;
&lt;P&gt;1. Did you on the host do the command "zcat /var/mpss/mic0.image.gz | (cpio -v; cpio -iv)" note the double cpio?&lt;/P&gt;
&lt;P&gt;2. If so does the the output always have you host keys or does it change?&lt;/P&gt;
&lt;P&gt;3. How did you get visability to the unpacking error message?&lt;/P&gt;
&lt;P&gt;4. I notice you have an RPM overlay at /opt/intel/mic/filesystem.&amp;nbsp; What is in this directory?&lt;/P&gt;
&lt;P&gt;5. Can I get access to your mic0.image.gz file so I can analyze it for errors.&lt;/P&gt;</description>
      <pubDate>Tue, 20 May 2014 17:16:32 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008334#M32941</guid>
      <dc:creator>Johnnie_P_Intel</dc:creator>
      <dc:date>2014-05-20T17:16:32Z</dc:date>
    </item>
    <item>
      <title>Quote:Johnnie P. wrote:</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008335#M32942</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;Johnnie P. wrote:&lt;BR /&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;I still have zero visability into what the error could be so here are some more questions.&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;1. Did you on the host do the command "zcat /var/mpss/mic0.image.gz | (cpio -v; cpio -iv)" note the double cpio?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;2. If so does the the output always have you host keys or does it change?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;3. How did you get visability to the unpacking error message?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;4. I notice you have an RPM overlay at /opt/intel/mic/filesystem.&amp;nbsp; What is in this directory?&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;

&lt;P&gt;5. Can I get access to your mic0.image.gz file so I can analyze it for errors.&lt;/P&gt;

&lt;P&gt;&lt;/P&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;&lt;/P&gt;

&lt;P&gt;1. Yes, but first cpio command is not valid, second one will extract the image without errors:&amp;nbsp;&lt;/P&gt;

&lt;P&gt;[root@m37 asdasd]# zcat /var/mpss/mic0.image.gz | (cpio -v; cpio -i)&lt;BR /&gt;
	cpio: You must specify one of -oipt options.&lt;BR /&gt;
	Try `cpio --help' or `cpio --usage' for more information.&lt;/P&gt;

&lt;P&gt;106060 blocks&lt;BR /&gt;
	[root@m37 asdasd]# echo $?&lt;BR /&gt;
	0&lt;/P&gt;

&lt;P&gt;rpm -q cpio&lt;/P&gt;

&lt;P&gt;cpio-2.10-11.el6_3.x86_64&lt;/P&gt;

&lt;P&gt;2. ssh_host_keys are not inside the image file.&lt;/P&gt;

&lt;P&gt;3. The error message is visible on the serial console if verbose logging is disabled.&lt;/P&gt;

&lt;P&gt;4. Nothing&lt;/P&gt;

&lt;P&gt;5. Check your message box&lt;/P&gt;

&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 May 2014 08:26:15 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008335#M32942</guid>
      <dc:creator>Tommi_T_</dc:creator>
      <dc:date>2014-05-21T08:26:15Z</dc:date>
    </item>
    <item>
      <title>So Got a copy of the image</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008336#M32943</link>
      <description>&lt;P&gt;So Got a copy of the image file and on my Red Hat 6.2 host I cannot un cpio it.&amp;nbsp; In the second cpio section I see it find the etc/rc5.d directory and then I get the error:&lt;/P&gt;
&lt;P&gt;cpio:Substituting '.' for empty member&lt;/P&gt;
&lt;P&gt;cpio: premature end of file.&lt;/P&gt;
&lt;P&gt;I notice from your mic0.conf file you have not upgraded to release 3.2.&amp;nbsp; The MicDir parameter still makes use of the mic0.filelist file.&amp;nbsp; To further debug this I will need to see the contents of that file.&amp;nbsp; It would be better if you sent me the whole mic0 directory so I can try to use it to reproduce this.&lt;/P&gt;
&lt;P&gt;I would also suggest upgrading to the 3.2 release.&amp;nbsp; The use of the filelist file for MicDir and CommonDir has been removed.&amp;nbsp; The cards file system will be created with the files haveing the same user and permissions as it has on the host.&amp;nbsp; There has been a number of fixes to all areas of micctrl and a number of them may have effect on this issue.&lt;/P&gt;</description>
      <pubDate>Thu, 22 May 2014 19:09:10 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008336#M32943</guid>
      <dc:creator>Johnnie_P_Intel</dc:creator>
      <dc:date>2014-05-22T19:09:10Z</dc:date>
    </item>
    <item>
      <title>Sorry I may have copied the</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008337#M32944</link>
      <description>&lt;P&gt;Sorry I may have copied the ramdisk image I down loaded from puuppa site into a directory where there was already other stuff and confused the issue.&amp;nbsp;So I need to change some of the questions.&lt;/P&gt;
&lt;P&gt;What release do you have installed?&lt;/P&gt;
&lt;P&gt;Can I get a copy of the files int he /var/mpss/mic0 directory so I can try the same thing here?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 22 May 2014 22:01:38 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008337#M32944</guid>
      <dc:creator>Johnnie_P_Intel</dc:creator>
      <dc:date>2014-05-22T22:01:38Z</dc:date>
    </item>
    <item>
      <title>Also, have you checked the</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008338#M32945</link>
      <description>&lt;P&gt;Also, have you checked the log files on the host to see if there are any file system errors?&lt;/P&gt;

&lt;P&gt;And an obscure fact to think about is that the image files are remade every time the coprocessors are booted. So the only image file that counts when it comes to solving this problem is the one that was created during a failed boot attempt.&lt;/P&gt;</description>
      <pubDate>Thu, 22 May 2014 22:39:49 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008338#M32945</guid>
      <dc:creator>Frances_R_Intel</dc:creator>
      <dc:date>2014-05-22T22:39:49Z</dc:date>
    </item>
    <item>
      <title>Tommi,</title>
      <link>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008339#M32946</link>
      <description>&lt;P&gt;Tommi,&lt;/P&gt;

&lt;P&gt;Thank you for identifying the issue in the release notes.&lt;/P&gt;

&lt;PRE style="color: rgb(0, 0, 0); line-height: normal; word-wrap: break-word; white-space: pre-wrap;"&gt;Intel Tracking ID:  4868776
Affected OS:        All Linux
Description:        [Tools] MPSS-3.2 generates corrupted mic0.img.gz
                    file if /var/mpss/mic0/... containes softlinks to
                    files not existing in that file system tree
Notes:              Investigating&lt;/PRE&gt;

&lt;P&gt;For documentation purposes, the status is no longer "investigating". The solution is to update to MPSS-3.2.3 with OFED-1.5.4.1 instead of OFED-3.5-2-MIC-BETA.&lt;/P&gt;

&lt;P&gt;Regards&lt;BR /&gt;
	--&lt;BR /&gt;
	Taylor&lt;BR /&gt;
	&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 27 May 2014 15:54:48 GMT</pubDate>
      <guid>https://community.intel.com/t5/Software-Archive/How-to-debug-mic-boot-problem/m-p/1008339#M32946</guid>
      <dc:creator>TaylorIoTKidd</dc:creator>
      <dc:date>2014-05-27T15:54:48Z</dc:date>
    </item>
  </channel>
</rss>

