Summary: | OpenMosix Segmentation Faults with emerge. | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Bill Cavalieri <gentoo> |
Component: | [OLD] Core system | Assignee: | Michael Imhof (RETIRED) <tantive> |
Status: | RESOLVED FIXED | ||
Severity: | blocker | CC: | charles, robmoss |
Priority: | High | ||
Version: | unspecified | ||
Hardware: | x86 | ||
OS: | Linux | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- |
Description
Bill Cavalieri
2003-08-30 12:40:56 UTC
I'm having exactly the same issues on gentoo cluster (2.4.21-openmosix). I can confirm that emerge segfaults in 9 cases out of 10 if openmosix is running. Are you aware of the following? (taken from openmosix.sf.net) "What is openMosix? openMosix is a Linux kernel extension for single-system image clustering. This kernel extension turns a network of ordinary computers into a supercomputer for Linux applications." That means that openMosix is designed for a cluster of homogenous nodes. I assume that you are running an openMosix cluster that has nodes with different architectures (like p4, p3, xeon, athlon etc...) In that case you start portage (by calling emerge) on one specific node. All code on that node is optimized for that machine (and it's capabilities). When some of the processes generated by portage (like gcc calls etc) are migrated the following situation can happen: Highly optimized code (e.g. with sse2 optimizations etc.) is executed on a node which is not having/supporting those optimizations --> it'll segfault. I hope this helps and clarifies the current situation. *** Bug 22736 has been marked as a duplicate of this bug. *** tantive's response fixed my segmentation problems. All the nodes in the cluster need to be the same processor arch, I had one p4 as a node in the cluster, and everything else was p3's. Removed the p4, and segmentation fault's went away. I'm pretty familiar with OM and did not remove my segfaults. Gentoo cluster I'm running is 4-node cluster, all machines have the following: - PII 266Mhz - 64Mb RAM - Even the same hdd all of these were compiled with the same CFLAGS, all have the same optimizations, arch, and processor. I believe this is emerge specification (it does not like when it is balanced, I do not know though 'why'). In /etc/make.conf, use "-mcpu=" instead of "-march=", you won't segfault anymore. Might I just ask if you've tried setting the following in /etc/make.conf: CBUILD="i686-pc-linux-gnu" CHOST="i686-pc-linux-gnu" CFLAGS="... -march=i686 ..." Then recompiling your kernel, rebooting and doing an "emerge -e system && emerge -e world". Also, can I ask you if all the relevant hardware is running reliably? The *primary* cause of Gentoo segfaults, apart from user error, is hardware failure. I found that underclocking a P2 350 down to 333MHz stopped all my segfaults on my OM cluster. Anyway, just a couple of suggestions; I don't like 'blocker' bugs! As i don't like blocker bugs, too... ... i'll close it now. |