- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Dear all,
I am trouble installing ofed-mic in CentOS 7.3 with default kernel.
Linux node09 3.10.0-514.el7.x86_64 #1 SMP Tue Nov 22 16:42:41 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
I did my best to follow the installation steps in the MPSS 3.8.1 user guide and the Phi is working okay.
[root@node09 disc]# micctrl --status mic0: online (mode: linux image: /usr/share/mpss/boot/bzImage-knightscorner)
[root@node09 disc]# miccheck
MicCheck 3.8.1-1
Copyright (c) 2016, Intel Corporation.
Executing default tests for host
Test 0: Check number of devices the OS sees in the system ... pass
Test 1: Check mic driver is loaded ... pass
Test 2: Check number of devices driver sees in the system ... pass
Test 3: Check mpssd daemon is running ... pass
Executing default tests for device: 0
Test 4 (mic0): Check device is in online state and its postcode is FF ... pass
Test 5 (mic0): Check ras daemon is available in device ... pass
Test 6 (mic0): Check running flash version is correct ... pass
Test 7 (mic0): Check running SMC firmware version is correct ... pass
Status: OK
[root@node09 disc]# micinfo
MicInfo Utility Log
Created Thu Mar 2 14:07:33 2017
System Info
HOST OS : Linux
OS Version : 3.10.0-514.el7.x86_64
Driver Version : 3.8.1-1
MPSS Version : 3.8.1
Host Physical Memory : 257661 MB
Device No: 0, Device Name: mic0
Version
Flash Version : 2.1.02.0391
SMC Firmware Version : 1.17.6900
SMC Boot Loader Version : 1.8.4326
Coprocessor OS Version : 2.6.38.8+mpss3.8.1
Device Serial Number : ADKC33400518
Board
Vendor ID : 0x8086
Device ID : 0x225c
Subsystem ID : 0x7d95
Coprocessor Stepping ID : 2
PCIe Width : x16
PCIe Speed : 5 GT/s
PCIe Max payload size : 256 bytes
PCIe Max read req size : 512 bytes
Coprocessor Model : 0x01
Coprocessor Model Ext : 0x00
Coprocessor Type : 0x00
Coprocessor Family : 0x0b
Coprocessor Family Ext : 0x00
Coprocessor Stepping : C0
Board SKU : C0PRQ-7120 P/A/X/D
ECC Mode : Enabled
SMC HW Revision : Product 300W Passive CS
Cores
Total No of Active Cores : 61
Voltage : 0 uV
Frequency : 1238095 kHz
Thermal
Fan Speed Control : N/A
Fan RPM : N/A
Fan PWM : N/A
Die Temp : 45 C
GDDR
GDDR Vendor : Samsung
GDDR Version : 0x6
GDDR Density : 4096 Mb
GDDR Size : 15872 MB
GDDR Technology : GDDR5
GDDR Speed : 5.500000 GT/s
GDDR Frequency : 2750000 kHz
GDDR Voltage : 1501000 uV
I can also ssh to mic0 and back to the host by following this page https://software.intel.com/en-us/node/544138
Now, I would like to setup the OFED support on the Phi, but I found no solution so far. As suggested by the user guide, it works with OFED-3.18.2 and Mellanox MLNX-OFED 2.4.
http://registrationcenter-download.intel.com/akdlm/irc_nas/11194/mpss_users_guide.pdf
But, when I tried to compile OFED-3.18.2, it produced errors on
/var/tmp/OFED_topdir/BUILD/compat-rdma-3.18/include/linux/compat-3.16.h:25:19: error: redefinition of 'ktime_get_ns'
When I tried on MLNX-OFED 2.4, it said my OS is not supported. So, I tried the MLNX-OFED 3.4, but as I rebuild the rpms, it failed with
/root/rpmbuild/BUILD/ofed-driver/drivers/infiniband/ibp/cm/cm_server_msg.c:986:19: error: 'IB_QP_SMAC' undeclared (first use in this function)
So, I cannot enable ofed-mic service in my node.
Can anyone help?
Thanks,
Rolly
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Rolly,
I am able to reproduce the problem you saw when installing OFED-3.18-2 on a host system running RHEL 7.3.and MPSS 3.8.1 . Let me check with a MPSS/OFED expert here. I will get back to you.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Nguyen,
Thanks for your attention.
I can confirm that CentOS 7.1 works fine with OFED-3.18.2 and MPSS 3.8.1.
[qeuser@node09 ~]$ uname -a Linux node09 3.10.0-229.el7.x86_64 #1 SMP Fri Mar 6 11:36:42 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
I can the ibv_devinfo
[root@node09 qeuser]# ibv_devinfo
hca_id: scif0
transport: invalid transport (-1)
fw_ver: 0.0.1
node_guid: 4c79:baff:fe44:040d
sys_image_guid: 4c79:baff:fe44:040d
vendor_id: 0x8086
vendor_part_id: 0
hw_ver: 0x1
phys_port_cnt: 1
port: 1
state: PORT_ACTIVE (4)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
sm_lid: 1
port_lid: 1000
port_lmc: 0x00
link_layer: Unknown
hca_id: mlx4_1
transport: InfiniBand (0)
fw_ver: 2.40.5030
node_guid: f452:1403:0042:f570
sys_image_guid: f452:1403:0042:f573
vendor_id: 0x02c9
vendor_part_id: 4099
hw_ver: 0x0
board_id: MT_1090110018
phys_port_cnt: 2
port: 1
state: PORT_DOWN (1)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
sm_lid: 0
port_lid: 0
port_lmc: 0x00
link_layer: InfiniBand
port: 2
state: PORT_DOWN (1)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
sm_lid: 0
port_lid: 0
port_lmc: 0x00
link_layer: InfiniBand
hca_id: mlx4_0
transport: InfiniBand (0)
fw_ver: 2.40.5030
node_guid: f452:1403:0042:e710
sys_image_guid: f452:1403:0042:e713
vendor_id: 0x02c9
vendor_part_id: 4099
hw_ver: 0x0
board_id: MT_1090110018
phys_port_cnt: 2
port: 1
state: PORT_DOWN (1)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
sm_lid: 0
port_lid: 0
port_lmc: 0x00
link_layer: InfiniBand
port: 2
state: PORT_DOWN (1)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
sm_lid: 0
port_lid: 0
port_lmc: 0x00
link_layer: InfiniBand
[root@node09 qeuser]#
I will plugin the infiniband cables and check the status again.
Best,
Rolly
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Rolly,
According to OFED-3.18-2 release notes (OFED-3.18-2/docs/OFED_release_notes.txt), this OFED version supports RHEL 6.5, 6.6, 6.7, 7.0, 7.1, 7.2, SLES 11 SP3 and SP4, SLES 12 and 12.1 only.
RHEL 7.3/CentOS 7.3 are fairly recent, and they are not supported by OFED-3.18-2. But CentOS 7.1 should work with OFED-3.18.2.
Thanks
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page