Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 39272 - gentoo-source-2.4.22-r5 freeze as distccd server
Summary: gentoo-source-2.4.22-r5 freeze as distccd server
Status: VERIFIED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: x86 Linux
: High critical (vote)
Assignee: x86-kernel@gentoo.org (DEPRECATED)
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-01-24 11:35 UTC by Canek Peláez Valdés
Modified: 2004-04-08 22:32 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
Configuration file for the kernel in question (.config,24.64 KB, text/plain)
2004-01-24 11:37 UTC, Canek Peláez Valdés
Details
Output from ksymoops (ksymoops-out.txt,3.65 KB, text/plain)
2004-01-24 11:38 UTC, Canek Peláez Valdés
Details
Log of distcc (distcc.log,162.73 KB, text/plain)
2004-01-24 12:25 UTC, Canek Peláez Valdés
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Canek Peláez Valdés 2004-01-24 11:35:51 UTC
gentoo-sources-2.4.22-r5; acting the machine as distccd server, it freezes if the client distcc machine compiles a package big enough (OpenOffice is a perfect test case).

The computer freezes; all process die (or seems to be; music stop, X and the console are dead, can't log in with ssh), and the keyboard leds Caps Lock and Scroll Lock flash intermittently. SysRQ works; it throws Oopses when syncing and unmounting the drives, but after reboot everything seems to be OK.

Reproducible: Always
Steps to Reproduce:
1. Set machine A as distccd server
2. Set another machine in LAN as distcc client, distcc-config --set-hosts "A"
3. Emerge a large package (mozilla, openoffice, etc.)


Actual Results:  
machine A freezes as described in the details. Throw this to console:

kernel BUG at page_alloc.c:116!
invalid operand: 0000
CPU:    0
EIP:    0010:[<c01bc901>]    Tainted: P
EFLAGS: 00010202
eax: 00000001   ebx: c1553f14   ecx: c1553f14   edx: 00000001
esi: 00000000   edi: dc143120   ebp: 00000004   esp: c013dd60
ds: 0018   es: 0018   ss: 0018
Process swaper (pid:0, stackpage=c013d000)
Stack: 01010101 c5655200 c5655200 0011cd9d 000431d1 0011cd9b 4012b894 00003bee
       00000001 dc1430c0 dc143120 00000004 c026c2ac c90c7d80 dc1430c0 dc1430f0
       c026c2e3 dc1430c0 dba36cc0 c8d70a3c c026c402 dc1430c0 c8d70a3c c8d70980
Call Trace:    [<c026c2ac>] [<c026c2e3>] [<c026c402>] [<c028e3a9>] [<c029377f>]
  [<c028e9dc>] [<c0291d47>] [<c0299538>] [<c0299aec>] [<c027ed0c>] [<c027f04c>]
  [<f8db0878>] [<c02702e3>] [<c027040d>] [<c0270524>] [<c01a4b9d>] [<c018fbbf>]
  [<c018c100>] [<c0192353>] [<c018c100>] [<c018c100>] [<c018c123>] [<c018c174>]
 
Code: 0f 0b 74 00 2d 20 2c c0 e9 47 fd ff ff 89 f6 55 57 56 53 83
 <0>Kernel panic: Aiee, killing interrupt handler!
In interrupt handler - not syncing


Expected Results:  
certainly not to crash.

The distcc client machine mounts /usr/portage as a NFS filesystem from the
distccd server machine. However, the freeze occur only at the compile stage; the
network transfer using NFS doesn't affect it, at least until now.

Portage 2.0.49-r21 (default-x86-1.4, gcc-3.2.3, glibc-2.3.2-r9, 2.4.22-gentoo-r5)
=================================================================
System uname: 2.4.22-gentoo-r5 i686 AMD Athlon(tm) XP 2400+
Gentoo Base System version 1.4.3.10
distcc 2.11.1 i686-pc-linux-gnu (protocols 1 and 2) (default port 3632) [disabled]
ACCEPT_KEYWORDS="x86"
AUTOCLEAN="yes"
CFLAGS="-march=athlon-xp -mcpu=athlon-xp -O3 -pipe -fomit-frame-pointer"
CHOST="i686-pc-linux-gnu"
COMPILER="gcc3"
CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config
/usr/kde/3.1/share/config /usr/kde/3/share/config /usr/share/config
/usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/
/var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/env.d"
CXXFLAGS="-march=athlon-xp -mcpu=athlon-xp -O3 -pipe -fomit-frame-pointer"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoaddcvs ccache fixpackages sandbox"
GENTOO_MIRRORS="http://gentoo.oregonstate.edu
http://distro.ibiblio.org/pub/Linux/distributions/gentoo"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY=""
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="X aalib alsa apache2 apm arts avi berkdb bonobo cdr crypt cups encode esd
foomaticdb gdbm gif gnome gpm gtk gtk2 gtkhtml guile imlib java joystick jpeg
kde libg++ libwww lirc mad mikmod mmx motif mozilla mpeg mysql ncurses nls
oggvorbis opengl oss pam pdflib perl png python qt quicktime readline sdl slang
spell ssl svga tcltk tcpd tetex truetype x86 xml2 xmms xv zlib"
Comment 1 Canek Peláez Valdés 2004-01-24 11:37:29 UTC
Created attachment 24347 [details]
Configuration file for the kernel in question
Comment 2 Canek Peláez Valdés 2004-01-24 11:38:45 UTC
Created attachment 24348 [details]
Output from ksymoops
Comment 3 Canek Peláez Valdés 2004-01-24 11:41:25 UTC
I nearly forgot; I've been using this machine as distccd server since octuber (kernel gentoo-sources-2.4.20-rsomething), and it never has problems (nearle same config in kernel, NFS, distcc, etc). I think this happened to me with 2.4.22-r4 too, but I use that kernel just a few days.
Comment 4 Lisa Seelye (RETIRED) gentoo-dev 2004-01-24 12:10:00 UTC
Can you provide the log from the distcc daemon ?
Comment 5 Canek Peláez Valdés 2004-01-24 12:25:15 UTC
Created attachment 24351 [details]
Log of distcc

I get it with:

    cat /var/log/messages| grep distccd | grep "Jan 24 12"

because I've been making lot of tests, and distccd generates lots of output.
However, the last part of the log  contains the last freeze I have.
Comment 6 Brian Jackson (RETIRED) gentoo-dev 2004-02-16 15:25:50 UTC
What nic are you using? Can you show us the output of lsmod? Does this happen with vanilla-sources?
Comment 7 Canek Peláez Valdés 2004-03-01 10:17:36 UTC
I'm using a Realtek:

lspci | grep Ethernet
00:0c.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)

I've moved to gentoo-dev-sources since the time 2.6.1 was marked as "~x86", and I don't have any more problems. And this is my "work" machine, so I can't just go back and install again gentoo-sources or vanilla-sources :(

Sorry I can't help more. On the bright side, 2.6.3 seems to be rock solid; and I capture video with the new BTTV drivers, use my remote control with lirc-0.7.0_pre2, and generally put the system in heavy stress conditions (is the master distcc server for two laptops and two older machines).
Comment 8 Lisa Seelye (RETIRED) gentoo-dev 2004-03-01 10:58:32 UTC
I have used 2.4.22-r7 with a Tulip driver and it has had zero problems with distcc.

Were you sharing compiling data (/var/tmp/portage) or anything like that?
Comment 9 Canek Peláez Valdés 2004-03-01 11:49:58 UTC
/usr/portage is a NFS share in the machine that freezes; but I've been using it that way since almost six months.
Comment 10 Jason Cox (RETIRED) gentoo-dev 2004-04-08 22:32:33 UTC
The issue was fixed with a kernel upgrade. 
Comment 11 Jason Cox (RETIRED) gentoo-dev 2004-04-08 22:32:53 UTC
Closing. Problem is no more.