gentoo-sources-2.4.22-r5; acting the machine as distccd server, it freezes if the client distcc machine compiles a package big enough (OpenOffice is a perfect test case). The computer freezes; all process die (or seems to be; music stop, X and the console are dead, can't log in with ssh), and the keyboard leds Caps Lock and Scroll Lock flash intermittently. SysRQ works; it throws Oopses when syncing and unmounting the drives, but after reboot everything seems to be OK. Reproducible: Always Steps to Reproduce: 1. Set machine A as distccd server 2. Set another machine in LAN as distcc client, distcc-config --set-hosts "A" 3. Emerge a large package (mozilla, openoffice, etc.) Actual Results: machine A freezes as described in the details. Throw this to console: kernel BUG at page_alloc.c:116! invalid operand: 0000 CPU: 0 EIP: 0010:[<c01bc901>] Tainted: P EFLAGS: 00010202 eax: 00000001 ebx: c1553f14 ecx: c1553f14 edx: 00000001 esi: 00000000 edi: dc143120 ebp: 00000004 esp: c013dd60 ds: 0018 es: 0018 ss: 0018 Process swaper (pid:0, stackpage=c013d000) Stack: 01010101 c5655200 c5655200 0011cd9d 000431d1 0011cd9b 4012b894 00003bee 00000001 dc1430c0 dc143120 00000004 c026c2ac c90c7d80 dc1430c0 dc1430f0 c026c2e3 dc1430c0 dba36cc0 c8d70a3c c026c402 dc1430c0 c8d70a3c c8d70980 Call Trace: [<c026c2ac>] [<c026c2e3>] [<c026c402>] [<c028e3a9>] [<c029377f>] [<c028e9dc>] [<c0291d47>] [<c0299538>] [<c0299aec>] [<c027ed0c>] [<c027f04c>] [<f8db0878>] [<c02702e3>] [<c027040d>] [<c0270524>] [<c01a4b9d>] [<c018fbbf>] [<c018c100>] [<c0192353>] [<c018c100>] [<c018c100>] [<c018c123>] [<c018c174>] Code: 0f 0b 74 00 2d 20 2c c0 e9 47 fd ff ff 89 f6 55 57 56 53 83 <0>Kernel panic: Aiee, killing interrupt handler! In interrupt handler - not syncing Expected Results: certainly not to crash. The distcc client machine mounts /usr/portage as a NFS filesystem from the distccd server machine. However, the freeze occur only at the compile stage; the network transfer using NFS doesn't affect it, at least until now. Portage 2.0.49-r21 (default-x86-1.4, gcc-3.2.3, glibc-2.3.2-r9, 2.4.22-gentoo-r5) ================================================================= System uname: 2.4.22-gentoo-r5 i686 AMD Athlon(tm) XP 2400+ Gentoo Base System version 1.4.3.10 distcc 2.11.1 i686-pc-linux-gnu (protocols 1 and 2) (default port 3632) [disabled] ACCEPT_KEYWORDS="x86" AUTOCLEAN="yes" CFLAGS="-march=athlon-xp -mcpu=athlon-xp -O3 -pipe -fomit-frame-pointer" CHOST="i686-pc-linux-gnu" COMPILER="gcc3" CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config /usr/kde/3.1/share/config /usr/kde/3/share/config /usr/share/config /usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/ /var/qmail/control" CONFIG_PROTECT_MASK="/etc/gconf /etc/env.d" CXXFLAGS="-march=athlon-xp -mcpu=athlon-xp -O3 -pipe -fomit-frame-pointer" DISTDIR="/usr/portage/distfiles" FEATURES="autoaddcvs ccache fixpackages sandbox" GENTOO_MIRRORS="http://gentoo.oregonstate.edu http://distro.ibiblio.org/pub/Linux/distributions/gentoo" MAKEOPTS="-j2" PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="X aalib alsa apache2 apm arts avi berkdb bonobo cdr crypt cups encode esd foomaticdb gdbm gif gnome gpm gtk gtk2 gtkhtml guile imlib java joystick jpeg kde libg++ libwww lirc mad mikmod mmx motif mozilla mpeg mysql ncurses nls oggvorbis opengl oss pam pdflib perl png python qt quicktime readline sdl slang spell ssl svga tcltk tcpd tetex truetype x86 xml2 xmms xv zlib"
Created attachment 24347 [details] Configuration file for the kernel in question
Created attachment 24348 [details] Output from ksymoops
I nearly forgot; I've been using this machine as distccd server since octuber (kernel gentoo-sources-2.4.20-rsomething), and it never has problems (nearle same config in kernel, NFS, distcc, etc). I think this happened to me with 2.4.22-r4 too, but I use that kernel just a few days.
Can you provide the log from the distcc daemon ?
Created attachment 24351 [details] Log of distcc I get it with: cat /var/log/messages| grep distccd | grep "Jan 24 12" because I've been making lot of tests, and distccd generates lots of output. However, the last part of the log contains the last freeze I have.
What nic are you using? Can you show us the output of lsmod? Does this happen with vanilla-sources?
I'm using a Realtek: lspci | grep Ethernet 00:0c.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10) I've moved to gentoo-dev-sources since the time 2.6.1 was marked as "~x86", and I don't have any more problems. And this is my "work" machine, so I can't just go back and install again gentoo-sources or vanilla-sources :( Sorry I can't help more. On the bright side, 2.6.3 seems to be rock solid; and I capture video with the new BTTV drivers, use my remote control with lirc-0.7.0_pre2, and generally put the system in heavy stress conditions (is the master distcc server for two laptops and two older machines).
I have used 2.4.22-r7 with a Tulip driver and it has had zero problems with distcc. Were you sharing compiling data (/var/tmp/portage) or anything like that?
/usr/portage is a NFS share in the machine that freezes; but I've been using it that way since almost six months.
The issue was fixed with a kernel upgrade.
Closing. Problem is no more.