Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 150080 - kernel oops when starting X with nvidia-drivers-1.0.8762-r1
Summary: kernel oops when starting X with nvidia-drivers-1.0.8762-r1
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: x86 Linux
: High major (vote)
Assignee: X11 External Driver Maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-10-04 11:17 UTC by Jacek
Modified: 2006-12-11 06:59 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Jacek 2006-10-04 11:17:31 UTC
When I try to start xorg-x11-7.0-r1 with nvidia-drivers-1.0.8762-r1, I get Oops. I'm not sure, but I think it could be connected with SMP processor (Intel P4 with HT). 
On the other machine (with AthlonXP, and the same xorg and nvidia drivers version - is ok).

Oops message:
Oct  4 19:01:10 jacekk nvidia: module license 'NVIDIA' taints kernel.
Oct  4 19:01:10 jacekk ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 18
Oct  4 19:01:10 jacekk NVRM: loading NVIDIA Linux x86 Kernel Module  1.0-8762  Mon May 15 13:06:38 PDT 2006
Oct  4 19:03:12 jacekk BUG: unable to handle kernel NULL pointer dereference at virtual address 00000094
Oct  4 19:03:12 jacekk printing eip:
Oct  4 19:03:12 jacekk f96fb125
Oct  4 19:03:12 jacekk *pde = 00000000
Oct  4 19:03:12 jacekk Oops: 0000 [#1]
Oct  4 19:03:12 jacekk SMP
Oct  4 19:03:12 jacekk Modules linked in: nvidia
Oct  4 19:03:12 jacekk CPU:    0
Oct  4 19:03:12 jacekk EIP:    0060:[<f96fb125>]    Tainted: P      VLI
Oct  4 19:03:12 jacekk EFLAGS: 00013283   (2.6.17-gentoo-r8 #3)
Oct  4 19:03:12 jacekk EIP is at _nv002631rm+0x89/0x3bc [nvidia]
Oct  4 19:03:12 jacekk eax: 00000000   ebx: 00000002   ecx: 00000000   edx: f7be0c00
Oct  4 19:03:12 jacekk esi: 00000002   edi: 00000002   ebp: f7b51bdc   esp: f7b51b94
Oct  4 19:03:12 jacekk ds: 007b   es: 007b   ss: 0068
Oct  4 19:03:12 jacekk Process X (pid: 4532, threadinfo=f7b50000 task=f7e74520)
Oct  4 19:03:12 jacekk Stack: 00000000 00000041 f7b51bdc f96fb11c f7be0c00 00000007 00000000 f96fc17c
Oct  4 19:03:12 jacekk 00000024 f7be0c00 f7b51c9c f96f822d f78e1000 0000001b 00000001 f7be0d40
Oct  4 19:03:12 jacekk f7d4c000 00000000 f7b51c4c f96dcb37 f78e1000 f7be0c00 00000002 00000002
Oct  4 19:03:12 jacekk Call Trace:
Oct  4 19:03:12 jacekk <f96fb11c> _nv002631rm+0x80/0x3bc [nvidia]  <f96fc17c> _nv000853rm+0x38/0x44 [nvidia]
Oct  4 19:03:12 jacekk <f96f822d> _nv006929rm+0xe1/0x17c [nvidia]  <f96dcb37> _nv000654rm+0x23f/0x5a8 [nvidia]
Oct  4 19:03:12 jacekk <f96dd160> _nv000545rm+0x124/0x150 [nvidia]  <f96ddae1> _nv000558rm+0xd1/0x190 [nvidia]
Oct  4 19:03:12 jacekk <f96ddacd> _nv000558rm+0xbd/0x190 [nvidia]  <f95ff0aa> _nv002668rm+0x26/0x2c [nvidia]
Oct  4 19:03:12 jacekk <f970660c> _nv000419rm+0x24/0x28 [nvidia]  <f96d9ff7> _nv004159rm+0xe3/0xec [nvidia]
Oct  4 19:03:12 jacekk <f96d8b4a> _nv000634rm+0x4a/0xac [nvidia]  <f96d8b5a> _nv000634rm+0x5a/0xac [nvidia]
Oct  4 19:03:12 jacekk <f9606e1b> _nv002823rm+0x1b/0x50 [nvidia]  <f96113d4> _nv001816rm+0x210/0x260 [nvidia]
Oct  4 19:03:12 jacekk <f96113e4> _nv001816rm+0x220/0x260 [nvidia]  <f9611e5e> _nv001823rm+0x12a/0x3f0 [nvidia]
Oct  4 19:03:12 jacekk <f9611ec1> _nv001823rm+0x18d/0x3f0 [nvidia]  <f9611e66> _nv001823rm+0x132/0x3f0 [nvidia]
Oct  4 19:03:12 jacekk <f9867fb3> os_acquire_sema+0x52/0x5e [nvidia]  <f960e666> _nv002584rm+0x12/0x18 [nvidia]
Oct  4 19:03:12 jacekk <f96165cd> rm_init_adapter+0x59/0x80 [nvidia]  <f9865319> nv_kern_isr+0x0/0x62 [nvidia]
Oct  4 19:03:12 jacekk <f9865b47> nv_kern_open+0x210/0x2b4 [nvidia]  <c014d2b8> chrdev_open+0x124/0x161
Oct  4 19:03:12 jacekk <c014d194> chrdev_open+0x0/0x161  <c0144c5f> __dentry_open+0xc7/0x1ab
Oct  4 19:03:12 jacekk <c0144da7> nameidata_to_filp+0x19/0x28  <c0144de1> do_filp_open+0x2b/0x31
Oct  4 19:03:12 jacekk <c0144e1b> do_sys_open+0x34/0x65  <c0144e79> sys_open+0x16/0x18
Oct  4 19:03:12 jacekk <c010267b> sysenter_past_esp+0x54/0x75
Oct  4 19:03:12 jacekk Code: 00 00 8b 45 0c 39 b0 50 01 00 00 74 06 8b b0 50 01 00 00 83 c4 f4 8b 55 0c 52 8b 82 94 00 00 00 ff d0 89 45 fc 83 c
4 f4 50 89 c1 <8b> 81 94 00 00 00 ff d0 83 c4 20 3b 45 24 74 03 89 45 24 8b 45
Oct  4 19:03:12 jacekk EIP: [<f96fb125>] _nv002631rm+0x89/0x3bc [nvidia] SS:ESP 0068:f7b51b94


And emerge --info:
Portage 2.1.1 (default-linux/x86/2006.1, gcc-4.1.1, glibc-2.4-r3, 2.6.17-gentoo-r8 i686)
=================================================================
System uname: 2.6.17-gentoo-r8 i686 Intel(R) Pentium(R) 4 CPU 2.80GHz
Gentoo Base System version 1.12.1
Last Sync: Tue, 03 Oct 2006 10:00:01 +0000
app-admin/eselect-compiler: [Not Present]
dev-java/java-config: [Not Present]
dev-lang/python:     2.4.3-r1
dev-python/pycrypto: 2.0.1-r5
dev-util/ccache:     [Not Present]
dev-util/confcache:  [Not Present]
sys-apps/sandbox:    1.2.17
sys-devel/autoconf:  2.13, 2.59-r7
sys-devel/automake:  1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r2
sys-devel/binutils:  2.16.1-r3
sys-devel/gcc-config: 1.3.13-r3
sys-devel/libtool:   1.5.22
virtual/os-headers:  2.6.11-r2
ACCEPT_KEYWORDS="x86"
AUTOCLEAN="yes"
CBUILD="i686-pc-linux-gnu"
CFLAGS="-O2 -march=i686 -pipe"
CHOST="i686-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/kde/3.5/env /usr/kde/3.5/share/config /usr/kde/3.5/shutdown /usr/share/X11/xkb /usr/share/config"
CONFIG_PROTECT_MASK="/etc/env.d /etc/gconf /etc/terminfo"
CXXFLAGS="-O2 -march=i686 -pipe"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoconfig distlocks metadata-transfer sandbox sfperms strict"
GENTOO_MIRRORS="http://gentoo.zie.pg.gda.pl ftp://gentoo.po.opole.pl "
LC_ALL="pl_PL.UTF-8"
LINGUAS="pl en"
MAKEOPTS="-j3"
PKGDIR="/usr/portage/packages"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --delete-after --stats --timeout=180 --exclude='/distfiles' --exclude='/local' --exclude='/packages'"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="x86 X alsa arts berkdb bitmap-fonts cli crypt dlloader dri elibc_glibc fortran gdbm gpm hardened input_devices_evdev input_devices_keyboard input_devices_mouse ipv6 isdnlog kde kernel_linux libg++ linguas_en linguas_pl ncurses nls nptl nptlonly opengl pam pcre perl ppds pppd python qt readline reflection samba session spl ssl tcpd truetype-fonts type1-fonts udev unicode userland_GNU video_cards_apm video_cards_ark video_cards_ati video_cards_chips video_cards_cirrus video_cards_cyrix video_cards_dummy video_cards_fbdev video_cards_glint video_cards_i128 video_cards_i740 video_cards_i810 video_cards_imstt video_cards_mga video_cards_neomagic video_cards_nsc video_cards_nv video_cards_rendition video_cards_s3 video_cards_s3virge video_cards_savage video_cards_siliconmotion video_cards_sis video_cards_sisusb video_cards_tdfx video_cards_tga video_cards_trident video_cards_tseng video_cards_v4l video_cards_vesa video_cards_vga video_cards_via video_cards_vmware video_cards_voodoo winbind xorg zlib"
Unset:  CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LANG, LDFLAGS, PORTAGE_RSYNC_EXTRA_OPTS, PORTDIR_OVERLAY
Comment 1 Jacek 2006-10-05 12:02:48 UTC
After emerging nvidia-legacy-drivers-1.0.7184 (insted of nvidia-drivers), X is starting without oops.

But back to oops... I do strace of X process, and it hang on opening /dev/nvidia0:
write(0, "(**) NVIDIA(0): Depth 24, ", 26) = 26
write(0, "(--) framebuffer bpp 32\n", 24) = 24
write(0, "(==) NVIDIA(0): RGB weight 888\n", 31) = 31
write(0, "(==) NVIDIA(0): Default visual i"..., 44) = 44
write(0, "(==) NVIDIA(0): Using gamma corr"..., 55) = 55
write(0, "(**) NVIDIA(0): Enabling RENDER "..., 45) = 45
open("/proc/modules", O_RDONLY)         = 8
fstat64(8, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7ba7000
read(8, "intel_agp 18204 1 - Live 0xf89d2"..., 1024) = 121
close(8)                                = 0
munmap(0xb7ba7000, 4096)                = 0
geteuid32()                             = 0
open("/proc/driver/nvidia/registry", O_RDONLY) = 8
fstat64(8, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7ba7000
read(8, "VideoMemoryTypeOverride: 1\nEnabl"..., 1024) = 481
read(8, "", 1024)                       = 0
read(8, "", 1024)                       = 0
close(8)                                = 0
munmap(0xb7ba7000, 4096)                = 0
stat64("/dev/nvidiactl", 0xbfeb1070)    = -1 ENOENT (No such file or directory)
stat64("/dev/nvidiactl", 0xbfeb1070)    = -1 ENOENT (No such file or directory)
mknod("/dev/nvidiactl", S_IFCHR|0666, makedev(195, 255)) = 0
chmod("/dev/nvidiactl", 0666)           = 0
chown32("/dev/nvidiactl", 0, 0)         = 0
open("/dev/nvidiactl", O_RDWR)          = 8
getuid32()                              = 0
geteuid32()                             = 0
ioctl(8, 0xc0e046c8, 0xb73d4800)        = 0
ioctl(8, 0xc00c4622, 0xbfeb1208)        = 0
ioctl(8, 0xc040464d, 0xbfeb1164)        = 0
ioctl(8, 0xc040464d, 0xbfeb1164)        = 0
ioctl(8, 0xc040464d, 0xbfeb1164)        = 0
open("/proc/driver/nvidia/registry", O_RDONLY) = 9
fstat64(9, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0xb7ba7000
read(9, "VideoMemoryTypeOverride: 1\nEnabl"..., 1024) = 481
read(9, "", 1024)                       = 0
read(9, "", 1024)                       = 0
close(9)                                = 0
munmap(0xb7ba7000, 4096)                = 0
stat64("/dev/nvidia0", 0xbfeb0fe0)      = -1 ENOENT (No such file or directory)
stat64("/dev/nvidia0", 0xbfeb0fe0)      = -1 ENOENT (No such file or directory)
mknod("/dev/nvidia0", S_IFCHR|0666, makedev(195, 0)) = 0
chmod("/dev/nvidia0", 0666)             = 0
chown32("/dev/nvidia0", 0, 0)           = 0
open("/dev/nvidia0", O_RDWR 
Comment 2 Chris Gianelloni (RETIRED) gentoo-dev 2006-10-17 06:15:43 UTC
What about 8774 nvidia-drivers? or 9625/9626?
Comment 3 Jacek 2006-10-17 11:12:50 UTC
On nvidia-drivers 1.0.8774:
Oct 17 19:46:13 jacekk ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 18
Oct 17 19:46:13 jacekk NVRM: loading NVIDIA Linux x86 Kernel Module  1.0-8774  Tue Aug  1 20:54:08 PDT 2006
Oct 17 19:46:13 jacekk BUG: unable to handle kernel NULL pointer dereference at virtual address 00000094
Oct 17 19:46:13 jacekk printing eip:
Oct 17 19:46:13 jacekk f977d195
Oct 17 19:46:13 jacekk *pde = 00000000
Oct 17 19:46:13 jacekk Oops: 0000 [#1]
Oct 17 19:46:13 jacekk SMP
Oct 17 19:46:13 jacekk Modules linked in: nvidia w83627hf hwmon_vid hwmon eeprom i2c_isa i2c_i801
Oct 17 19:46:13 jacekk CPU:    0
Oct 17 19:46:13 jacekk EIP:    0060:[<f977d195>]    Tainted: P      VLI
Oct 17 19:46:13 jacekk EFLAGS: 00013283   (2.6.17-gentoo-r8 #10)
Oct 17 19:46:13 jacekk EIP is at _nv002631rm+0x89/0x3bc [nvidia]
Oct 17 19:46:13 jacekk eax: 00000000   ebx: 00000002   ecx: 00000000   edx: f7947000
Oct 17 19:46:13 jacekk esi: 00000002   edi: 00000002   ebp: d0c67bdc   esp: d0c67b94
Oct 17 19:46:13 jacekk ds: 007b   es: 007b   ss: 0068
Oct 17 19:46:13 jacekk Process X (pid: 10274, threadinfo=d0c66000 task=d135c070)
Oct 17 19:46:13 jacekk Stack: 00000000 00000041 d0c67bdc f977d18c f7947000 00000007 00000000 f977e1ec
Oct 17 19:46:13 jacekk 00000024 f7947000 d0c67c9c f977a29d d9326000 0000001b 00000001 f7947140
Oct 17 19:46:13 jacekk f7a9a400 00000000 d0c67c4c f975eba7 d9326000 f7947000 00000002 00000002
Oct 17 19:46:13 jacekk Call Trace:
Oct 17 19:46:13 jacekk <f977d18c> _nv002631rm+0x80/0x3bc [nvidia]  <f977e1ec> _nv000853rm+0x38/0x44 [nvidia]
Oct 17 19:46:13 jacekk <f977a29d> _nv006929rm+0xe1/0x17c [nvidia]  <f975eba7> _nv000654rm+0x23f/0x5a8 [nvidia]
Oct 17 19:46:13 jacekk <f975f1d0> _nv000545rm+0x124/0x150 [nvidia]  <f975fb51> _nv000558rm+0xd1/0x190 [nvidia]
Oct 17 19:46:13 jacekk <f975fb3d> _nv000558rm+0xbd/0x190 [nvidia]  <f968110a> _nv002668rm+0x26/0x2c [nvidia]
Oct 17 19:46:13 jacekk <f978867c> _nv000419rm+0x24/0x28 [nvidia]  <f975c067> _nv004159rm+0xe3/0xec [nvidia]
Oct 17 19:46:13 jacekk <f975abba> _nv000634rm+0x4a/0xac [nvidia]  <f975abca> _nv000634rm+0x5a/0xac [nvidia]
Oct 17 19:46:13 jacekk <f9688e7b> _nv002823rm+0x1b/0x50 [nvidia]  <f9693444> _nv001816rm+0x210/0x260 [nvidia]
Oct 17 19:46:13 jacekk <f9693454> _nv001816rm+0x220/0x260 [nvidia]  <f9693ece> _nv001823rm+0x12a/0x3f0 [nvidia]
Oct 17 19:46:13 jacekk <f9693f31> _nv001823rm+0x18d/0x3f0 [nvidia]  <f9693ed6> _nv001823rm+0x132/0x3f0 [nvidia]
Oct 17 19:46:13 jacekk <f98ea173> os_acquire_sema+0x52/0x5e [nvidia]  <f96906d6> _nv002584rm+0x12/0x18 [nvidia]
Oct 17 19:46:13 jacekk <f969863d> rm_init_adapter+0x59/0x80 [nvidia]  <f98e74a9> nv_kern_isr+0x0/0x62 [nvidia]
Oct 17 19:46:13 jacekk <f98e7cd7> nv_kern_open+0x210/0x2b4 [nvidia]  <c014fe6b> chrdev_open+0x124/0x161
Oct 17 19:46:13 jacekk <c014fd47> chrdev_open+0x0/0x161  <c01477e3> __dentry_open+0xc7/0x1ab
Oct 17 19:46:13 jacekk <c014792b> nameidata_to_filp+0x19/0x28  <c0147965> do_filp_open+0x2b/0x31
Oct 17 19:46:13 jacekk <c014799f> do_sys_open+0x34/0x65  <c01479fd> sys_open+0x16/0x18
Oct 17 19:46:13 jacekk <c010267b> sysenter_past_esp+0x54/0x75
Oct 17 19:46:13 jacekk Code: 00 00 8b 45 0c 39 b0 50 01 00 00 74 06 8b b0 50 01 00 00 83 c4 f4 8b 55 0c 52 8b 82 94 00 00 00 ff d0 89 45 fc 83 c
4 f4 50 89 c1 <8b> 81 94 00 00 00 ff d0 83 c4 20 3b 45 24 74 03 89 45 24 8b 45
Oct 17 19:46:13 jacekk EIP: [<f977d195>] _nv002631rm+0x89/0x3bc [nvidia] SS:ESP 0068:d0c67b94

1.0-9626 did not crashed X-server, and it rather looks ok.
But if everything is ok, I could check tomorrow when I'll be at work (now I'm connected via ssh).
In kern.log:
Oct 17 20:01:30 jacekk ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 18
Oct 17 20:01:30 jacekk NVRM: loading NVIDIA Linux x86 Kernel Module  1.0-9626  Wed Sep 20 16:39:14 PDT 2006

Tomorrow I'll answer if both CRT displays works correctly.

BTW, with nvidia-legacy-drivers-1.0.7184 when I left an opengl screensaver, system hangs (average after 12-30 hours). I got this 3 times. So I'll check also this.
Comment 4 Jacek 2006-10-18 03:06:07 UTC
Ok - dual monitors are working.
But there is another problem. OpenGL (screensaver) stopped working.

glxinfo shows:
direct rendering: Yes

On previous drivers screensaver was working correctly.
Comment 5 Chris Gianelloni (RETIRED) gentoo-dev 2006-12-05 09:28:16 UTC
Please try 1.0.9631, which was just added to portage.
Comment 6 Jacek 2006-12-11 06:59:37 UTC
(In reply to comment #5)
> Please try 1.0.9631, which was just added to portage.

Yes, new version fixed this problem.