- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
Lien copié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
I will pass on your comment about the spec files to the developers. Were there any requirements other than those listed in section 2.1 of the MPSS User's Guide?
I notice that you did not uninstall the existing dapl, libibscif and ofed-driver packages before doing the build. Before you install your rebuilt copies of those packages, it is recommended that you uninstall the old version. The same is true for the entire MPSS, although in this limited case, I suspect it is not strictly necessary. I know - not what you would expect to be asked to do for the polished open source code you are used to.
On to the errors -
Did you run this command as yourself or as root? I suspect it is right that there is no user 'build' on your system. But this does not seem to be a fatal error. When one of my teammates did the same build yesterday, he did it as root and I don't believe he got those errors.
Was your /tmp directory full or otherwise unwritable? I looked through some of the rpm source code and it looks like that message "Unable to write temp header" occurs when a temporary file that was created becomes unwritable because there is no space available.
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
After reading back over your original post, I have a couple questions. I want to make sure you are at the place in the install instructions that I think you are at. The documentation that comes with the MPSS can be confusing (which is why it is undergoing a complete rewrite).
Do you have the kernel-headers and kernel-devel packages installed (or kernel-default-devel, if using SuSE)?
Did you rebuild mpss-modules-3.3-1.src.rpm, following the directions in the "Warning" box in section 2.1 of the readme.txt file (not the User's Guide but the readme.txt file)? If you are rebuilding kernel modules because you are running a later kernel, that module is the first thing you should rebuild. Did you have any trouble rebuilding that module?
Did you complete the install of the basic MPSS following the directions in section 2.2 of the readme.txt file?
Are you using the Mellanox* 2.1 version of OFED? Did you complete the install of OFED, using the directions from Mellanox?
What section of the MPSS User's Guide are you referring to when you say:
I meant to say that the original command is: rpmbuild --rebuild --define "MOFED 1" src/dapl-2.0.42.2-1.glibc2.12.2.src.rpm src/libibscif-1.0.0-1.fc13.src.rpm src/ofed-driver-3.3-1.src.rpm as per the "MPSS users guide".
Can you provide a list of the packages you installed in an effort to clean up the missing prerequisite problem?
Looking back at the particular software you are having trouble with, you should actually only need the kernel-headers and kernel-devel packages, plus some headers and libraries that come with the MPSS and OFED. If you are doing a full build of OFED, you need the packages listed in section 2.1 of the User's Guide. But I believe for the pieces you are attempting to rebuild here you only need, as I said, the kernel-headers and kernel-devel packages, plus some headers and libraries that come with the MPSS and OFED.
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
OK, I don't know why this isn't working for you or why it says it needs those extra packages (some of them like libibverbs-devel, dapl-devel, dapl are in the source package you are trying to build and mpss-modules-devel was in the mpss modules source package that you built and whose output you installed previously. )
Since the make error 2 is so unhelpful, saying in effect only that make hit some error, could you perhaps try doing the make by hand and add a debug option?
I have also asked someone who knows more about building this software to take a look. I am worried that the answer will come back that you must use Mellanox OFED 2.1, that 2.0 will keep our code from building. But we will see.
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
I am told that Mellanox 2.0 will not work. It lacks necessary support for the coprocessor. So you must use at least 2.1.
mpxyd is in dapl-2.0.42.2-1.glibc2.12.2.x86_64.rpm which is supposed to be installed. dapl-2.0.42.2-1.glibc2.12.2.x86_64.rpm is also supposed to be rebuilt from the source code in mpss3.3/src. The directions say to rebuild with "MOFED 1" defined. I suspect it should be "_MELLANOX 1" instead. I am checking on this.
For the micctrl options that control networking functions, there is an option, --modhost=no that should prevent the disappearing /etc/host entries from happening.
For the conflicts between Mellanox OFED and the ofed-driver from Intel, was this with Mellanox OFED 2.0 or 2.1. If it was 2.0, please switch to 2.1. If it was 2.1 could you list what the conflicts were?
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
file /etc/infiniband/connectx.conf from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /etc/infiniband/openib.conf from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /etc/infiniband/truescale.cmds from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /etc/init.d/openibd from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /etc/modprobe.d/ib_ipoib.conf from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /etc/udev/rules.d/90-ib.rules from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /sbin/connectx_port_config from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /sbin/sysctl_perf_tuning from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64 file /usr/bin/ibdev2netdev from install of ofed-driver-3.0.76-0.11-default-3.3-1.x86_64 conflicts with file from package mlnx-ofa_kernel-2.1-OFED.2.1.197.g008fbee.rhel6u4.x86_64This is what we see in the logs at the moment when the MIC software starts being used, having just removed mlnx-ofa_kernel and installed ofed-kernel:
Sep 5 15:49:44 yellow12 kernel: mic0: Transition from state booting to online Sep 5 15:49:44 yellow12 kernel: mic1: Transition from state booting to online Sep 5 15:51:44 yellow12 kernel: ibp_server: disagrees about version of symbol ib_unregister_client Sep 5 15:51:44 yellow12 kernel: ibp_server: Unknown symbol ib_unregister_client Sep 5 15:51:44 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_ah Sep 5 15:51:44 yellow12 kernel: ibp_server: Unknown symbol ib_query_ah Sep 5 15:51:44 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_srq Sep 5 15:51:44 yellow12 kernel: ibp_server: Unknown symbol ib_query_srq Sep 5 15:51:44 yellow12 kernel: ibp_server: disagrees about version of symbol ib_dereg_mr Sep 5 15:51:44 yellow12 kernel: ibp_server: Unknown symbol ib_dereg_mr Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_qp Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_query_qp Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_register_event_handler Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_register_event_handler Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_detach_mcast Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_detach_mcast Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_unregister_event_handler Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_unregister_event_handler Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_create_ah Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_create_ah Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_register_client Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_register_client Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_destroy_cq Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_destroy_cq Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_set_client_data Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_set_client_data Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_port Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_query_port Sep 5 15:51:45 yellow12 kernel: ibp_server: disagrees about version of symbol ib_get_client_data Sep 5 15:51:45 yellow12 kernel: ibp_server: Unknown symbol ib_get_client_data Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_destroy_srq Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_destroy_srq Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_device Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_query_device Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_destroy_ah Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_destroy_ah Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_dealloc_xrcd Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_dealloc_xrcd Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_pkey Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_query_pkey Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_destroy_qp Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_destroy_qp Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_dealloc_pd Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_dealloc_pd Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_query_gid Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_query_gid Sep 5 15:51:46 yellow12 kernel: ibp_server: disagrees about version of symbol ib_attach_mcast Sep 5 15:51:46 yellow12 kernel: ibp_server: Unknown symbol ib_attach_mcastNote that as much of what is provided by mlnx-ofa_kernel package is in use (and I don't know what), and the nodes are diskless, there isn't much more we can do at this end at this stage. Unless I force it, which I'd much rather not, I can't install the ofed-driver package at all without uninstalling mlnx-ofa_kernel, and then once the mics come up, these errors appear and IPoIB can't be set up.
- Marquer comme nouveau
- Marquer
- S'abonner
- Sourdine
- S'abonner au fil RSS
- Surligner
- Imprimer
- Signaler un contenu inapproprié
... $(if $(CONFIG_XEN),-D__XEN_INTERFACE_VERSION__=$(CONFIG_XEN_INTERFACE_VERSION)) \ $(if $(CONFIG_XEN),-I$(srctree)/arch/x86/include/mach-xen) \ -I$(srctree)/arch/$(hdr-arch)/include \ -Iinclude \ $(if $(KBUILD_SRC),-Iinclude2 -I$(srctree)/include) \ -I$(srctree)/arch/$(SRCARCH)/include \ ' \ modules make[1]: Entering directory `/usr/src/kernels/2.6.32-358.6.2.el6.x86_64' make[1]: *** No rule to make target `modules'. Stop. make[1]: Leaving directory `/usr/src/kernels/2.6.32-358.6.2.el6.x86_64' make: *** [kernel] Error 2 error: Bad exit status from /scratch/zan/tmp/rpm-tmp.g3Ax9t (%build)

- S'abonner au fil RSS
- Marquer le sujet comme nouveau
- Marquer le sujet comme lu
- Placer ce Sujet en tête de liste pour l'utilisateur actuel
- Marquer
- S'abonner
- Page imprimable