Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 71170 - Gentoo-dev-sources 2.6.9 r4 gives an oops at bootup
Summary: Gentoo-dev-sources 2.6.9 r4 gives an oops at bootup
Status: RESOLVED TEST-REQUEST
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: AMD64 Linux
: High normal (vote)
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-11-14 07:51 UTC by Elie De Brauwer
Modified: 2004-11-30 09:11 UTC (History)
1 user (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
The .config (config,25.28 KB, text/plain)
2004-11-14 07:52 UTC, Elie De Brauwer
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Elie De Brauwer 2004-11-14 07:51:48 UTC
Snip from my dmesg:

ACPI: PCI interrupt 0000:00:06.0[A] -> GSI 11 (level, low) -> IRQ 11
PCI: Setting latency timer of device 0000:00:06.0 to 64
ieee1394: Host added: ID:BUS[0-00:1023]  GUID[00c09f000018affe]
usb 2-3: new full speed USB device using address 2
scsi0 : SCSI emulation for USB Mass Storage devices
  Vendor: Generic   Model: Flash R/W         Rev: 2002
  Type:   Direct-Access                      ANSI SCSI revision: 02
Attached scsi removable disk sda at scsi0, channel 0, id 0, lun 0
Attached scsi generic sg0 at scsi0, channel 0, id 0, lun 0,  type 0
USB Mass Storage device found at 2
intel8x0_measure_ac97_clock: measured 49491 usecs
intel8x0: clocking to 47353
ALSA device list:
  #0: NVidia nForce3 at 0xd0002000, irq 11
NET: Registered protocol family 2
IP: routing cache hash table of 4096 buckets, 32Kbytes
TCP: Hash tables configured (established 32768 bind 32768)
NET: Registered protocol family 1
NET: Registered protocol family 17
powernow-k8: Found 1 AMD Athlon 64 / Opteron processors (version 1.00.09b)
powernow-k8:    0 : fid 0xa (1800 MHz), vid 0x2 (1500 mV)
powernow-k8:    1 : fid 0x8 (1600 MHz), vid 0x6 (1400 mV)
powernow-k8:    2 : fid 0x0 (800 MHz), vid 0x12 (1100 mV)
powernow-k8: cpu_init done, current fid 0xa, vid 0x2
ACPI: (supports S0 S3 S4 S5)
ACPI wakeup devices: 
PCI0 USB0 USB1 USB2 MCI0 GIGA  LID 
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
ReiserFS: hda2: found reiserfs format "3.6" with standard journal
ReiserFS: hda2: using ordered data mode
ReiserFS: hda2: journal params: device hda2, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30
ReiserFS: hda2: checking transaction log (hda2)
ReiserFS: hda2: Using r5 hash to sort names
VFS: Mounted root (reiserfs filesystem) readonly.
Trying to move old root to /initrd ... failed
Unmounting old root
Trying to free ramdisk memory ... okay
Freeing unused kernel memory: 172k freed
Unable to handle kernel paging request at 000001042231b78f RIP: 
<ffffffff802812ba>{memmove+58}
PML4 8063 PGD 0 
Oops: 0000 [1] PREEMPT 
CPU 0 
Modules linked in:
Pid: 3, comm: events/0 Not tainted 2.6.9-gentoo-r4
RIP: 0010:[<ffffffff802812ba>] <ffffffff802812ba>{memmove+58}
RSP: 0018:000001000193de20  EFLAGS: 00010013
RAX: 0000000000000000 RBX: 0000010001a327b0 RCX: 000001042231b78f
RDX: 0000000402abeffe RSI: 000001001f85c790 RDI: 000001042231b78f
RBP: 0000010001a329c0 R08: 000001001f85c790 R09: 0000000000000001
R10: 0000000000000001 R11: 0000000000000000 R12: 0000010001a32a70
R13: 0000000000000001 R14: 0000010001a327b0 R15: ffffffff805d6250
FS:  0000000000000000(0000) GS:ffffffff80605940(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 000001042231b78f CR3: 0000000000101000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo 000001000193c000, task 000001000191a7e0)
Stack: ffffffff8015f30e ffffffff802b0e00 ffffffff805d62a8 ffffffff805d62a0 
       00000100018b8300 0000000000000000 ffffffff8015f250 00000100018b8310 
       ffffffff8014910e ffffffff805cb080 
Call Trace:<ffffffff8015f30e>{cache_reap+190} <ffffffff802b0e00>{fb_flashcursor+0} 
       <ffffffff8015f250>{cache_reap+0} <ffffffff8014910e>{worker_thread+510} 
       <ffffffff80133e70>{default_wake_function+0} <ffffffff80442a60>{thread_return+41} 
       <ffffffff80133e70>{default_wake_function+0} <ffffffff80148f10>{worker_thread+0} 
       <ffffffff8014d938>{kthread+136} <ffffffff801123e3>{child_rip+8} 
       <ffffffff80148f10>{worker_thread+0} <ffffffff8014d8b0>{kthread+0} 
       <ffffffff801123db>{child_rip+0} 

Code: 0f b6 01 48 83 fa ff 88 07 75 ec 4c 89 c0 c3 90 90 90 90 90 
RIP <ffffffff802812ba>{memmove+58} RSP <000001000193de20>
CR2: 000001042231b78f
 <6>note: events/0[3] exited with preempt_count 1
nvidia: version magic '2.6.9-gentoo-r4 gcc-3.3' should be '2.6.9-gentoo-r4 preempt gcc-3.3'
Adding 1048568k swap on /mnt/swap.  Priority:-1 extents:8085
spurious 8259A interrupt: IRQ7.


After this Oops the machine hangs at console. E.g. now it stops after "Setting up gdm ...", at a prior point it hanged at "Setting user font ..." (but that was another compile). So after a compile it spits out this oops and after a certain interval it hangs. 

I've tried recompiling without preempt support (which didn't solve anything) and withotu Reiser proc statistics because the first visual impression was that something failed after reiser. 

The kernel I used before was 2.6.8-gentoo-r9 but for now I'll try a rollback to 2.6.9-r1 or even to the latest 2.6.8.

Reproducible: Always
Steps to Reproduce:




Emerge info:

psychotic ~ # emerge info                               
Portage 2.0.51-r3 (default-amd64-2004.2, gcc-3.3.4, glibc-2.3.4.20041102-r0,
2.6.9-gentoo-r4 x86_64)
=================================================================
System uname: 2.6.9-gentoo-r4 x86_64 AMD Athlon(tm) 64 Processor 3000+
Gentoo Base System version 1.6.6
Autoconf: sys-devel/autoconf-2.59-r5
Automake: sys-devel/automake-1.8.5-r1
Binutils: sys-devel/binutils-2.15.92.0.2-r1
Headers:  sys-kernel/linux26-headers-2.6.8.1-r1
Libtools: sys-devel/libtool-1.5.2-r7
ACCEPT_KEYWORDS="amd64 ~amd64"
AUTOCLEAN="yes"
CFLAGS="-O2 -pipe -fPIC"
CHOST="x86_64-pc-linux-gnu"
COMPILER=""
CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config
/usr/kde/3.2/share/config /usr/kde/3.3/env /usr/kde/3.3/share/config
/usr/kde/3.3/shutdown /usr/kde/3/share/config /usr/lib/mozilla/defaults/pref
/usr/share/config /usr/share/texmf/dvipdfm/config/
/usr/share/texmf/dvips/config/ /usr/share/texmf/tex/generic/config/
/usr/share/texmf/tex/platex/config/ /usr/share/texmf/xdvi/ /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-O2 -pipe -fPIC"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoaddcvs ccache distlocks"
GENTOO_MIRRORS="http://ftp.gentoo.skynet.be/pub/gentoo/"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY=""
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="X amd64 apm arts avi berkdb bitmap-fonts cdr crypt encode esd f77
foomaticdb fortran gdbm gif gnome gpm gtk gtk2 imlib java jpeg kde libg++ libwww
mikmod motif mozilla mpeg ncurses nls nogcj oggvorbis opengl oss pam pdflib perl
png postgres python qt quicktime readline sdl slang spell ssl tcpd tetex
truetype xml2 xmms xv zlib"


I'll append my .config
Comment 1 Elie De Brauwer 2004-11-14 07:52:52 UTC
Created attachment 43932 [details]
The .config
Comment 2 Elie De Brauwer 2004-11-14 08:22:41 UTC
I just tried 2.6.9-gentoo-r1 based on the same config, the problem also exists there and the system hangs at Setting user font. Meaning before sshd was started, so I couldn't ssh in the machine to get the Oops from dmesg. 

But I think it's reiserfs related since I can just read from console:

Checking internal tree.....Unable to handle kernel paging request at ... RIP

Both at the 2.6.9 -r1 and -r4

So for now I'll have to rollback to 2.6.8
Comment 3 Daniel Drake (RETIRED) gentoo-dev 2004-11-14 08:42:05 UTC
Are you using any form of framebuffer?
Comment 4 Elie De Brauwer 2004-11-14 09:36:15 UTC
Yes, a regular vesa vga framebuffer:

Card:
0000:01:00.0 VGA compatible controller: nVidia Corporation NV36 [GeForce FX Go5700] (rev a1)


psychotic void # fbset --info

mode "1024x768-76"
    # D: 78.653 MHz, H: 59.949 kHz, V: 75.694 Hz
    geometry 1024 768 1024 768 16
    timings 12714 128 32 16 4 128 4
    rgba 5/11,6/5,5/0,0/0
endmode

Frame buffer device information:
    Name        : VESA VGA
    Address     : 0xe0000000
    Size        : 3145728
    Type        : PACKED PIXELS
    Visual      : TRUECOLOR
    XPanStep    : 0
    YPanStep    : 0
    YWrapStep   : 0
    LineLength  : 2048
    Accelerator : No

Wich is working perfectly under 2.6.8. I've rolled back to 2.6.8-r10 which still
works perfectly
Comment 5 tklauser 2004-11-14 09:39:38 UTC
It seems like you're using the proprietary nvidia module. Try to boot without loading this module and check if the problem still exists. As Daniel mentioned, if you're using framebuffer (in conjunction with the nvidia module) there may be problems (at least I had using nvidia and frambuffer) so compiling the Kernel without it would also be a good idea.
Comment 6 Elie De Brauwer 2004-11-14 09:47:14 UTC
But at the point the error occurs nothing related to the module has happened. Which makes it imo illogical that the framebuffer interfers with the nvidia module because at point of the error no module has already been loaded. 
Comment 7 Daniel Drake (RETIRED) gentoo-dev 2004-11-17 15:59:26 UTC
Could you please test 2.6.10-rc2?
Comment 8 Elie De Brauwer 2004-11-17 21:22:11 UTC
Ok, I'll try it within a couple of hours (first I have some classes to attend :-( )
Comment 9 Daniel Drake (RETIRED) gentoo-dev 2004-11-30 09:11:51 UTC
Please reopen when you have had a chance to test.