Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!

Bug 40147

Summary: nvidia-kernel-1.0.4496-r3 resulting in "badness in pci_find_subsys" causing a freeze
Product: Gentoo Linux Reporter: red0x <red0x>
Component: [OLD] Core systemAssignee: Gentoo X packagers <x11>
Status: RESOLVED WORKSFORME    
Severity: critical    
Priority: Highest    
Version: 1.4   
Hardware: x86   
OS: All   
Whiteboard:
Package list:
Runtime testing required: ---

Description red0x 2004-02-01 21:25:00 UTC
kernels affected: 2.6.1-mm5, 2.6.1-gentoo, 2.6.2-rc2-mm1
kernels not affected: 2.6.1-mm1 (AFAIK)

Dmesg output when system freezes:
Feb  1 20:51:38 xpider Badness in pci_find_subsys at drivers/pci/search.c:132
Feb  1 20:51:38 xpider Call Trace:
Feb  1 20:51:38 xpider [<c025ab81>] pci_find_subsys+0xc1/0xd0
Feb  1 20:51:38 xpider [<c025abbf>] pci_find_device+0x2f/0x40
Feb  1 20:51:38 xpider [<c025aa98>] pci_find_slot+0x28/0x50
Feb  1 20:51:38 xpider [<e0d84d60>] os_pci_init_handle+0x3a/0x67 [nvidia]
Feb  1 20:51:38 xpider [<c025abbf>] pci_find_device+0x2f/0x40
Feb  1 20:51:38 xpider [<e0d9761f>] __nvsym00057+0x1f/0x24 [nvidia]
Feb  1 20:51:38 xpider [<e0e2bc52>] __nvsym03763+0x72/0xe0 [nvidia]
Feb  1 20:51:38 xpider [<e0e70761>] __nvsym04466+0x15/0x78 [nvidia]
Feb  1 20:51:38 xpider [<e0ea7ba7>] __nvsym04875+0x127/0x170 [nvidia]
Feb  1 20:51:38 xpider [<e0e2a65d>] __nvsym03749+0x41/0xbc [nvidia]
Feb  1 20:51:38 xpider [<e0ea794a>] __nvsym00780+0x21a/0x224 [nvidia]
Feb  1 20:51:38 xpider [<e0e2b004>] __nvsym03741+0x74/0x88 [nvidia]
Feb  1 20:51:38 xpider [<e0e29d5f>] __nvsym03751+0x337/0x8a4 [nvidia]
Feb  1 20:51:38 xpider [<e0e6e5a3>] __nvsym00688+0x1e3/0x338 [nvidia]
Feb  1 20:51:38 xpider [<e0d99d29>] __nvsym00827+0xd/0x1c [nvidia]
Feb  1 20:51:38 xpider [<e0d9b3c4>] rm_isr_bh+0xc/0x10 [nvidia]
Feb  1 20:51:38 xpider [<c0123046>] tasklet_action+0x46/0x70
Feb  1 20:51:38 xpider [<c0122e69>] do_softirq+0x99/0xa0
Feb  1 20:51:38 xpider [<c010cdf7>] do_IRQ+0xd7/0x100
Feb  1 20:51:38 xpider [<c010b34c>] common_interrupt+0x18/0x20
Feb  1 20:51:38 xpider

The message repeats 4 times in 0 seconds (wow!), waits 8 seconds, and repeates again twice.  Then, system becomes unstable and it is necessary to use the magic sys-rq key to recover.  Recently, this happened, i rebooted, and it happened right off the bat again! (which leads me to believe it *may* be a cooling problem, but it probably isn't).  

Nvidia-kernel version: [ebuild   R   ] media-video/nvidia-kernel-1.0.4496-r3
Comment 1 red0x 2004-02-01 21:30:46 UTC
xpider red0x # cd /proc/driver/nvidia/agp/
xpider agp # ls
card  host-bridge  status
xpider agp # cat status
Status:          Enabled
Driver:          AGPGART
AGP Rate:        8x
Fast Writes:     Enabled
SBA:             Disabled
xpider agp #
Comment 2 red0x 2004-02-01 21:48:10 UTC
trying 

   Option "NvAGP"  "1"

 in my XF86Config file.  I'll keep you posted on how it works (hopefully, i'll get my screensavers back)
Comment 3 Donnie Berkholz (RETIRED) gentoo-dev 2004-02-01 22:44:13 UTC
Could you please test newer drivers (1.0.5336-r1 in particular, since that's latest)?
Comment 4 red0x 2004-02-21 14:34:46 UTC
the NvAGP option has not helped, I just emerged the most recent drivers today, so we will see how that goes.  Also, I have upgraded to 2.6.3-gentoo-r1.  Wish me luck!
Comment 5 red0x 2004-02-28 05:27:34 UTC
Unfortunately, I'm still getting such errors with the new drivers..  :(
Comment 6 red0x 2004-03-08 00:03:00 UTC
upgraded to 2.6.4-rc1-mm2
I think this may have been a part of the mremap bug...  not sure.  I figured it was worth a shot.

Other things changed: More cooling, removed extra hard drive, tried a PCI promise IDE controller (did not help), new thermal goo and heatsink on processor.
Comment 7 red0x 2004-03-16 08:58:14 UTC
It seems that my upgrade to 2.6.4-rc1 has fixed all my problems with the video card.  If I have anymore troubles, I'll reopen this bad boy.