Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 65455 - nvidia-kernel-1.0.6111-r1 and/or gcc-3.4.2-r2 cause xorg-x11-6.8.0-r1 to hang and a kernel oops
Summary: nvidia-kernel-1.0.6111-r1 and/or gcc-3.4.2-r2 cause xorg-x11-6.8.0-r1 to hang...
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: x86 Linux
: High normal (vote)
Assignee: Gentoo X packagers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-09-26 14:31 UTC by Steph Gosling
Modified: 2004-10-10 22:39 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Steph Gosling 2004-09-26 14:31:36 UTC
nvidia-kernel-1.0.6111-r1 with 2.6.6 and 2.6.9_rc2 (as compiled with with gcc 3.4.2),  will cause X to hang on start and cause the following Oops:

kernel: nvidia: module license 'NVIDIA' taints kernel.
kernel: NVRM: loading NVIDIA Linux x86 NVIDIA Kernel Module  1.0-6111  Tue Jul 27 07:55:38 PDT 2004
kernel:  printing eip:
kernel: f91a75ba
kernel: Oops: 0000 [#1]
kernel: PREEMPT SMP
kernel: CPU:    0
kernel: EIP:    0060:[<f91a75ba>]    Tainted: P
kernel: EFLAGS: 00013292   (2.6.6)
kernel: EIP is at os_string_compare+0x13/0x2d [nvidia]
kernel: eax: 00000046   ebx: f7098abc   ecx: 00000000   edx: f7098a9c
kernel: esi: f7098a9d   edi: 00000000   ebp: f6ffbe2c   esp: f6ffbe08
kernel: ds: 007b   es: 007b   ss: 0068
kernel: Process X (pid: 7700, threadinfo=f6ffa000 task=f7c6ee50)
kernel: Stack: f7098abc f7a10c00 f8fe625a f7098a9c 00000000 c0196ed7 f7332990 f00000fa
kernel:        f7a95005 f6ffbe5c f8fe81db f7a10c00 f7098a9c 00000000 0010a000 00000000
kernel:        f7a10c00 00000000 c0157bfa c1923428 0010a000 f6ffbe9c f8febb74 f7a10c00
kernel: Call Trace:
kernel:  [<f8fe625a>] _nv001530rm+0x16/0x1c [nvidia]
kernel:  [<c0196ed7>] proc_get_inode+0x127/0x170
kernel:  [<f8fe81db>] _nv001618rm+0x33/0x7c [nvidia]
kernel:  [<c0157bfa>] map_area_pmd+0x7a/0xb0
kernel:  [<f8febb74>] _nv001244rm+0x208/0x2e8 [nvidia]
kernel:  [<f8fecc35>] rm_access_registry+0x75/0x9c [nvidia]
kernel:  [<f8fed134>] _nv001139rm+0x3a4/0x4b8 [nvidia]
kernel:  [<c0116eb0>] do_page_fault+0x0/0x55e
kernel:  [<c0106835>] error_code+0x2d/0x38
kernel:  [<f8fec147>] rm_ioctl+0x23/0x38 [nvidia]
kernel:  [<c02c464d>] md_update_sb+0x6d/0x120
kernel:  [<f91a48bd>] nv_kern_ioctl+0x388/0x3d4 [nvidia]
kernel:  [<c02c464d>] md_update_sb+0x6d/0x120
kernel:  [<c01738b7>] sys_ioctl+0x1c7/0x2e0
kernel:  [<c02c464d>] md_update_sb+0x6d/0x120
kernel:  [<c0105dcb>] syscall_call+0x7/0xb
kernel:  [<c02c464d>] md_update_sb+0x6d/0x120

I'm sorry I don't have X logs for you, but I've been fiddling with this so long that the failed ones have long gone. From what I remember they didn't print anything out of the ordinary, merely stopped sometime after loading the "vgahw" module.

Reverting to nvidia-kernel-1.0.5336-r4 brings X back again without issue.

Hardware is:

0000:01:00.0 VGA compatible controller: nVidia Corporation NV15 [GeForce2 GTS/Pro] (rev a3) (prog-if 00 [VGA])
        Subsystem: LeadTek Research Inc.: Unknown device 2843
        Flags: bus master, 66Mhz, medium devsel, latency 248, IRQ 11
        Memory at de000000 (32-bit, non-prefetchable) [size=dfef0000]
        Memory at d0000000 (32-bit, prefetchable) [size=128M]
        Expansion ROM at 00010000 [disabled]
        Capabilities: [60] Power Management version 1
        Capabilities: [44] AGP version 2.0

This is a dual-proc system:

heroditus ~ # emerge info
Portage 2.0.51_rc4 (default-x86-2004.0, gcc-3.4.2, glibc-2.3.4.20040808-r0, 2.6.6 i686)
=================================================================
System uname: 2.6.6 i686 Pentium III (Coppermine)
Gentoo Base System version 1.5.3
Autoconf: sys-devel/autoconf-2.59-r4
Automake: sys-devel/automake-1.8.5-r1
Binutils: sys-devel/binutils-2.15.90.0.1.1-r3
Headers:  sys-kernel/linux-headers-2.4.22
Libtools: sys-devel/libtool-1.5.2-r5
ACCEPT_KEYWORDS="x86 ~x86"
AUTOCLEAN="yes"
CFLAGS="-march=pentium3 -O3 -pipe -fomit-frame-pointer"
CHOST="i686-pc-linux-gnu"
COMPILER=""
CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config /usr/kde/3.2/share/config /usr/kde/3.3/share/config:/usr/kde/3.3/env:/usr/kde/3.3/shutdown /usr/kde/3/share/config /usr/lib/mozilla/defaults/pref /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-march=pentium3 -O3 -pipe -fomit-frame-pointer"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoaddcvs ccache distlocks sandbox"
GENTOO_MIRRORS="http://www.mirrorservice.org/sites/www.ibiblio.org/gentoo http://gentoo.oregonstate.edu http://www.ibiblio.org/pub/Linux/distributions/gentoo"
MAKEOPTS="-j5"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY=""
SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage"
USE="X apm arts avi berkdb bitmap-fonts bonobo cdr crypt cups curl dba dga divx4linux dvd dvdr encode esd foomaticdb gd gdbm gif gnome gphoto2 gpm gstreamer gtk gtk2 gtkhtml imlib java jpeg ldap libg++ libwww mad mikmod mmx motif mozilla mpeg mysql ncurses nls oggvorbis opengl oss pam pdflib perl png python quicktime readline sdl slang spell sse ssl svga tcpd truetype x86 xinerama xml2 xmms xprint xv xvid zlib"






Reproducible: Always
Steps to Reproduce:
1.
2.
3.
Comment 1 Steph Gosling 2004-09-26 14:32:59 UTC
Whoops, forgot the last line of the trace:

kernel: Code: ae 75 08 84 c0 75 f8 31 c0 eb 04 19 c0 0c 01 8b 34 24 8b 7c
Comment 2 Dewet Diener 2004-09-28 11:27:48 UTC
Confirmed here as well; xorg-x11-6.8.0-r1 and nvidia-kernel-1.0.6111-r1 compiled using gcc-3.4.2-r2 causes the PC to do a hard lockup right after the nvidia logo showed after starting X.  I'm running 2.6.8-ck8 on a UP Duron 650.

0000:01:00.0 VGA compatible controller: nVidia Corporation NV11 [GeForce2 MX/MX 400] (rev b2) (prog-if 00 [VGA])
        Subsystem: Giga-byte Technology: Unknown device 1284
        Flags: bus master, 66Mhz, medium devsel, latency 248, IRQ 7
        Memory at d8000000 (32-bit, non-prefetchable)
        Memory at d0000000 (32-bit, prefetchable) [size=128M]
        Capabilities: [60] Power Management version 2
        Capabilities: [44] AGP version 2.0

Portage 2.0.51_rc6 (default-x86-1.4, gcc-3.4.2, glibc-2.3.4.20040808-r0, 2.6.8-ck3 i686)
=================================================================
System uname: 2.6.8-ck3 i686 AMD Duron(tm) Processor
Gentoo Base System version 1.5.3
distcc 2.16 i686-pc-linux-gnu (protocols 1 and 2) (default port 3632) [enabled]
Autoconf: sys-devel/autoconf-2.59-r4
Automake: sys-devel/automake-1.8.5-r1
Binutils: sys-devel/binutils-2.15.90.0.1.1-r3
Headers:  sys-kernel/linux-headers-2.4.22
Libtools: sys-devel/libtool-1.5.2-r5
ACCEPT_KEYWORDS="x86 ~x86"
AUTOCLEAN="yes"
CFLAGS="-march=athlon-tbird -O3 -pipe -fomit-frame-pointer"
CHOST="i686-pc-linux-gnu"
COMPILER=""
CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config /usr/kde/3.3/env /usr/kde/3.3/share/config /usr/kde/3.3/shutdown /usr/kde/3/share/config /usr/lib/mozilla/defaults/pref /usr/share/config /usr/share/texmf/dvipdfm/config/ /usr/share/texmf/dvips/config/ /usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/ /usr/share/texmf/xdvi/ /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/X11/Sessions /etc/X11/xdm /etc/X11/xinit /etc/X11/xkb /etc/X11/xserver /etc/X11/xsm /etc/env.d /etc/gconf /etc/init.d /etc/terminfo /etc/env.d"
CXXFLAGS="-march=athlon-tbird -O3 -pipe -fomit-frame-pointer"
DISTDIR="/mp3/distfiles"
FEATURES="autoaddcvs candy ccache distcc distlocks prelink sandbox"
GENTOO_MIRRORS="http://gentoo.inode.at/ http://www.mirror.ac.uk/sites/www.ibiblio.org/gentoo/"
MAKEOPTS="-j1"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY="/usr/local/portage /usr/local/bmg-main"
SYNC="rsync://rsync.nl.gentoo.org/gentoo-portage"
USE="3dnow S3TC X aalib acl acpi alsa apache2 arts artswrappersuid avantgo avi berkdb bitmap-fonts bluetooth bonobo cdr crypt cups curl dga dvd encode esd evo exiscan-acl f77 fbcon flash foomaticdb gd gdbm ggi gif gnome gphoto2 gpm gps gstreamer gtk gtk2 gtkhtml guile imap imlib innodb irmc java javascript jpeg kde ldap libg++ libwww mad mbox mmx mozilla mpeg mysql nagios-dns nagios-ping nagios-ssh nas ncurses nls nvidia offensive oggvorbis opengl opie oss pam pda pdflib perl png postgres python qt quicktime readline samba scanner sdl slang smime snmp spell ssl svga tcltk tcpd tetex tiff truetype usb vhosts wifi wmf x86 xinerama xml xml2 xmms xprint xv zlib"

I can't get any syslog messages, since the system is still in graphical mode and locked up hard.
Comment 3 Steph Gosling 2004-09-29 08:55:42 UTC
Mine didn't hang the whole machine but did render the virtual consoles unuseable. I was still able to get in over the network -- this maybe the case for you too and obviously is preferable to a reset and the risk of data corruption.
Comment 4 Andrew Bevitt 2004-09-30 19:22:05 UTC
Im waiting for rc3-mm1 and then im going to commit as many patches as possible to get nvidia working again... Im waiting for a response from nvidia in regards to some stuff aswell... 
Comment 5 Andrew Bevitt 2004-10-04 04:12:00 UTC
OK test out nvidia-kernel-1.0.6111-r2 (i think it should fix this)
Comment 6 Steph Gosling 2004-10-04 16:13:46 UTC
I rather inconveniently lost my system disk the other day and am still in the process of re-installing so can't fully test this; however, r2 compiles and loads against 2.6.9_rc3. Not 100% it works yet (as the monitor the nvidia card is on remains blank) but X does start with the driver loaded (it's part of a xinerama setup).

As soon as I have a working config I'll post.
Comment 7 Dewet Diener 2004-10-05 00:52:06 UTC
Please disregard my input on this bug.  Somewhere between upgrading ck-sources-2.6.8.1-r3 to ck-sources-2.6.8.1-r8, devfs decided to stop symlinking /dev/misc/psaux to /dev/mouse, so X didn't start up at all (missing core pointer).  So the white screen was only a corrupted console, and I could reach the machine over the network and actually see what the problem was.  No oops or anything like that.

Sorry for the wasted electrons.

I'm now successfully running nvidia-kernel-1.0.6111-r2 on ck-sources-2.6.8.1-r9 compiled with gcc-3.4.2-r2.
Comment 8 Steph Gosling 2004-10-05 14:09:17 UTC
Yep the following now works for me:

Linux version 2.6.9-rc3 (root@heroditus) (gcc version 3.4.2 (Gentoo Linux 3.4.2-r2, ssp-3.4.1-1, pie-8.7.6.5)) #4 SMP Mon Oct 4 09:41:20 BST 2004

With nvidia-kernel: 1.0.6111-r2
and  nvidia-glx: 1.0.6111

Great stuff and many thanks.
Comment 9 Andrew Bevitt 2004-10-10 22:39:01 UTC
Closing.