Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 203391 - Kernel Bug in 2.6.23 gentoo-sources in skbuff.c (802.11 handling)
Summary: Kernel Bug in 2.6.23 gentoo-sources in skbuff.c (802.11 handling)
Status: RESOLVED NEEDINFO
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: AMD64 Linux
: High major (vote)
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-12-26 18:01 UTC by Tristan Scott
Modified: 2008-01-23 01:10 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Tristan Scott 2007-12-26 18:01:13 UTC
This bug occurs using ndiswrapper, but from ancedotal evidence on the 'net seems not to be ndiswrapper specific.
After reading about skbuff changes in 2.6.22, I decided to try 2.6.21 - which works. Same drivers in ndiswrapper, recompiled module but same version of ndiswrapper.
This is the gentoo version of the kernel, but seems to occur across kernel sub-versions. should this be reported as a kernel bug?

Also, I am getting lots of reports (which did not seem to correspond to any particular hardware activity) of:
APIC error on CPU0: 00(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
This is appearing in dmesg/kernel log fairly slowly (every few minutes) while using the laptop normally in X. This did not change between kernel 2.6.23 and 2.6.21 - seems to be unrelated, but it's the only thing i can see out of the ordinary.
Full kernel logs of 2.6.23 or 2.6.21 I'll go record if useful.

--- ndiswrapper version
localhost src # ndiswrapper -v
utils version: '1.9', utils version needed by module: '1.9'
module details:
filename:       /lib/modules/2.6.21-gentoo-r4-ElNino/misc/ndiswrapper.ko
version:        1.50
vermagic:       2.6.21-gentoo-r4-ElNino SMP mod_unload

---Output in kernel log

Dec 26 13:09:30 [kernel] ndiswrapper version 1.50 loaded (smp=yes, preempt=no)
Dec 26 13:09:30 [kernel] ndiswrapper (link_pe_images:576): fixing KI_USER_SHARED_DATA address in the driver
Dec 26 13:09:30 [kernel] ndiswrapper: driver bcmwl5 (Broadcom,10/12/2006, 4.100.15.5) loaded
Dec 26 13:09:30 [kernel] ACPI: PCI Interrupt 0000:30:00.0[A] -> GSI 18 (level, low) -> IRQ 18
Dec 26 13:09:30 [kernel] PCI: Setting latency timer of device 0000:30:00.0 to 64
Dec 26 13:09:30 [kernel] ndiswrapper: using IRQ 18
Dec 26 13:09:30 [kernel] wlan0: ethernet device 00:1a:73:a8:bc:d6 using NDIS driver: bcmwl5, version: 0x4640f05, NDIS version: 0x
501, vendor: 'NDIS Network Adapter', 14E4:4312.5.conf
Dec 26 13:09:30 [kernel] wlan0: encryption modes supported: WEP; TKIP with WPA, WPA2, WPA2PSK; AES/CCMP with WPA, WPA2, WPA2PSK
Dec 26 13:09:30 [kernel] NTFS volume version 3.1.
                - Last output repeated 2 times -
Dec 26 13:09:30 [kernel] ADDRCONF(NETDEV_UP): wlan0: link is not ready
Dec 26 13:09:30 [kernel] pcmcia: Detected deprecated PCMCIA ioctl usage from process: cardmgr.
Dec 26 13:09:30 [kernel] pcmcia: This interface will soon be removed from the kernel; please expect breakage unless you upgrade t
o new tools.
Dec 26 13:09:30 [kernel] pcmcia: see http://www.kernel.org/pub/linux/utils/kernel/pcmcia/pcmcia.html for details.
Dec 26 13:09:31 [kernel] ADDRCONF(NETDEV_UP): eth0: link is not ready
Dec 26 13:09:32 [kernel] ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
Dec 26 13:09:41 [kernel] kernel BUG at net/core/skbuff.c:95!
Dec 26 13:09:41 [kernel] CPU 1
Dec 26 13:09:41 [kernel] Modules linked in: ndiswrapper fglrx(P)
Dec 26 13:09:41 [kernel] Pid: 4240, comm: ntos_wq Tainted: P        2.6.23-gentoo-r3-ElNino #7
Dec 26 13:09:41 [kernel] RIP: 0010:[<ffffffff80491840>]  [<ffffffff80491840>] skb_over_panic+0x57/0x5b
Dec 26 13:09:41 [kernel] RSP: 0018:ffff810072547988  EFLAGS: 00010292
Dec 26 13:09:41 [kernel] RAX: 0000000000000080 RBX: ffff810074cbddc0 RCX: 0000000100000000
Dec 26 13:09:41 [kernel] RDX: ffffffff806489a8 RSI: 0000000000000096 RDI: ffffffff806489a0
Dec 26 13:09:41 [kernel] RBP: ffff8100754b3740 R08: ffffffff806489a8 R09: 0000000000000001
Dec 26 13:09:41 [kernel] R10: 0000000000000c31 R11: ffffffff8021cd0a R12: 000000000000002a
Dec 26 13:09:41 [kernel] R13: 0000000000000010 R14: ffff8100001290d6 R15: 0000000000000040
Dec 26 13:09:41 [kernel] FS:  00002ae065c1af40(0000) GS:ffff81000128bbc0(0000) knlGS:0000000000000000
Dec 26 13:09:41 [kernel] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Dec 26 13:09:41 [kernel] CR2: 00002ae065a43f30 CR3: 0000000000201000 CR4: 00000000000006e0
Dec 26 13:09:41 [kernel] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 26 13:09:41 [kernel] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Dec 26 13:09:41 [kernel] Process ntos_wq (pid: 4240, threadinfo ffff810072546000, task ffff81007218a040)
Dec 26 13:09:41 [kernel] Stack:  0000000000000000 0000000000000040 ffffffff805dc291 ffffffff880d074e
Dec 26 13:09:41 [kernel]  00000001001290d6 ffff810072547a60 ffff810074dee200 00000000749db900
Dec 26 13:09:41 [kernel]  ffff810074dee277 ffffc2000036d000 ffff8100001290d6 ffff8100751fa618
Dec 26 13:09:41 [kernel] Call Trace:
Dec 26 13:09:41 [kernel]  [<ffffffff880d074e>] :ndiswrapper:NdisMIndicateReceivePacket+0x120/0x23f
Dec 26 13:09:41 [kernel]  [<ffffffff880df10b>] :ndiswrapper:get_iw_stats+0xc22/0xd37
Dec 26 13:09:41 [kernel]  [<ffffffff8023b8fb>] lock_timer_base+0x26/0x4b
Dec 26 13:09:41 [kernel]  [<ffffffff880d0f26>] :ndiswrapper:NdisAllocateBuffer+0x11a/0x138
Dec 26 13:09:41 [kernel]  [<ffffffff8051ede7>] _spin_lock_bh+0x9/0x19
                - Last output repeated twice -
Dec 26 13:09:41 [kernel]  [<ffffffff880df0f7>] :ndiswrapper:get_iw_stats+0xc0e/0xd37
Dec 26 13:09:41 [kernel]  [<ffffffff880d5858>] :ndiswrapper:KeSynchronizeExecution+0x25/0x3f
Dec 26 13:09:41 [kernel]  [<ffffffff880d3bb0>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x7b/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff880cf727>] :ndiswrapper:deserialized_irq_handler+0x17/0x34
Dec 26 13:09:41 [kernel]  [<ffffffff8051d26e>] thread_return+0x0/0xd8
Dec 26 13:09:41 [kernel]  [<ffffffff880d3c83>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x14e/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff8051ddce>] mutex_lock+0xd/0x1e
Dec 26 13:09:41 [kernel]  [<ffffffff880d3c6a>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x135/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff80241971>] run_workqueue+0x83/0x10c
Dec 26 13:09:41 [kernel]  [<ffffffff8024226a>] worker_thread+0x0/0xe4
Dec 26 13:09:41 [kernel]  [<ffffffff80242344>] worker_thread+0xda/0xe4
Dec 26 13:09:41 [kernel]  [<ffffffff8024504e>] autoremove_wake_function+0x0/0x2e
Dec 26 13:09:41 [kernel]  [<ffffffff80244f30>] kthread+0x47/0x73
Dec 26 13:09:41 [kernel]  [<ffffffff8020c188>] child_rip+0xa/0x12
Dec 26 13:09:41 [kernel]  [<ffffffff80244ee9>] kthread+0x0/0x73
Dec 26 13:09:41 [kernel]  [<ffffffff8020c17e>] child_rip+0x0/0x12
Dec 26 13:09:41 [kernel] Code: 0f 0b eb fe 48 8b 46 20 49 89 f8 49 89 f1 48 c7 47 10 00 00
Dec 26 13:09:41 [kernel] RIP  [<ffffffff80491840>] skb_over_panic+0x57/0x5b
Dec 26 13:09:41 [kernel]  RSP <ffff810072547988>
Dec 26 13:09:42 [kernel] wlan0: no IPv6 routers present
Dec 26 13:09:56 [kernel] ACPI: PCI Interrupt 0000:01:05.0[B] -> GSI 19 (level, low) -> IRQ 19


Reproducible: Always

Steps to Reproduce:
1. Use kernel 2.6.23 on a AMD64 laptop with a broadcom 4312 wireless controller.
2. Use ndiswrapper with any bcmwl5 (Windows xp for 4312) drivers
3. modprobe ndiswrapper

Actual Results:  
kernel bug appears in dmesg, entire wireless subsystem stops responding.
No other (PCMCIA) wireless cards will register, though the drivers load. iwconfig hangs forever.
Any wireless operation is down.

Expected Results:  
Experienced on 2.6.21. Driver loads normally, ndiswrapper reports capabilities of card, wpa_supplicant takes over, and the network interface comes up as normal.

This bug occurs using ndiswrapper, but from ancedotal evidence on the 'net seems not to be ndiswrapper specific.
After reading about skbuff changes in 2.6.22, I decided to try 2.6.21 - which works. Same drivers in ndiswrapper, recompiled module but same version of ndiswrapper.
This is the gentoo version of the kernel, but seems to occur across kernel sub-versions. should this be reported as a kernel bug?

Also, I am getting lots of reports (which did not seem to correspond to any particular hardware activity) of:
APIC error on CPU0: 00(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
APIC error on CPU0: 40(40)
This is appearing in dmesg/kernel log fairly slowly (every few minutes) while using the laptop normally in X. This did not change between kernel 2.6.23 and 2.6.21 - seems to be unrelated, but it's the only thing i can see out of the ordinary.
Full kernel logs of 2.6.23 or 2.6.21 I'll go record if useful.

--- ndiswrapper version
localhost src # ndiswrapper -v
utils version: '1.9', utils version needed by module: '1.9'
module details:
filename:       /lib/modules/2.6.21-gentoo-r4-ElNino/misc/ndiswrapper.ko
version:        1.50
vermagic:       2.6.21-gentoo-r4-ElNino SMP mod_unload

---Output in kernel log

Dec 26 13:09:30 [kernel] ndiswrapper version 1.50 loaded (smp=yes, preempt=no)
Dec 26 13:09:30 [kernel] ndiswrapper (link_pe_images:576): fixing KI_USER_SHARED_DATA address in the driver
Dec 26 13:09:30 [kernel] ndiswrapper: driver bcmwl5 (Broadcom,10/12/2006, 4.100.15.5) loaded
Dec 26 13:09:30 [kernel] ACPI: PCI Interrupt 0000:30:00.0[A] -> GSI 18 (level, low) -> IRQ 18
Dec 26 13:09:30 [kernel] PCI: Setting latency timer of device 0000:30:00.0 to 64
Dec 26 13:09:30 [kernel] ndiswrapper: using IRQ 18
Dec 26 13:09:30 [kernel] wlan0: ethernet device 00:1a:73:a8:bc:d6 using NDIS driver: bcmwl5, version: 0x4640f05, NDIS version: 0x
501, vendor: 'NDIS Network Adapter', 14E4:4312.5.conf
Dec 26 13:09:30 [kernel] wlan0: encryption modes supported: WEP; TKIP with WPA, WPA2, WPA2PSK; AES/CCMP with WPA, WPA2, WPA2PSK
Dec 26 13:09:30 [kernel] NTFS volume version 3.1.
                - Last output repeated 2 times -
Dec 26 13:09:30 [kernel] ADDRCONF(NETDEV_UP): wlan0: link is not ready
Dec 26 13:09:30 [kernel] pcmcia: Detected deprecated PCMCIA ioctl usage from process: cardmgr.
Dec 26 13:09:30 [kernel] pcmcia: This interface will soon be removed from the kernel; please expect breakage unless you upgrade t
o new tools.
Dec 26 13:09:30 [kernel] pcmcia: see http://www.kernel.org/pub/linux/utils/kernel/pcmcia/pcmcia.html for details.
Dec 26 13:09:31 [kernel] ADDRCONF(NETDEV_UP): eth0: link is not ready
Dec 26 13:09:32 [kernel] ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
Dec 26 13:09:41 [kernel] kernel BUG at net/core/skbuff.c:95!
Dec 26 13:09:41 [kernel] CPU 1
Dec 26 13:09:41 [kernel] Modules linked in: ndiswrapper fglrx(P)
Dec 26 13:09:41 [kernel] Pid: 4240, comm: ntos_wq Tainted: P        2.6.23-gentoo-r3-ElNino #7
Dec 26 13:09:41 [kernel] RIP: 0010:[<ffffffff80491840>]  [<ffffffff80491840>] skb_over_panic+0x57/0x5b
Dec 26 13:09:41 [kernel] RSP: 0018:ffff810072547988  EFLAGS: 00010292
Dec 26 13:09:41 [kernel] RAX: 0000000000000080 RBX: ffff810074cbddc0 RCX: 0000000100000000
Dec 26 13:09:41 [kernel] RDX: ffffffff806489a8 RSI: 0000000000000096 RDI: ffffffff806489a0
Dec 26 13:09:41 [kernel] RBP: ffff8100754b3740 R08: ffffffff806489a8 R09: 0000000000000001
Dec 26 13:09:41 [kernel] R10: 0000000000000c31 R11: ffffffff8021cd0a R12: 000000000000002a
Dec 26 13:09:41 [kernel] R13: 0000000000000010 R14: ffff8100001290d6 R15: 0000000000000040
Dec 26 13:09:41 [kernel] FS:  00002ae065c1af40(0000) GS:ffff81000128bbc0(0000) knlGS:0000000000000000
Dec 26 13:09:41 [kernel] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Dec 26 13:09:41 [kernel] CR2: 00002ae065a43f30 CR3: 0000000000201000 CR4: 00000000000006e0
Dec 26 13:09:41 [kernel] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 26 13:09:41 [kernel] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Dec 26 13:09:41 [kernel] Process ntos_wq (pid: 4240, threadinfo ffff810072546000, task ffff81007218a040)
Dec 26 13:09:41 [kernel] Stack:  0000000000000000 0000000000000040 ffffffff805dc291 ffffffff880d074e
Dec 26 13:09:41 [kernel]  00000001001290d6 ffff810072547a60 ffff810074dee200 00000000749db900
Dec 26 13:09:41 [kernel]  ffff810074dee277 ffffc2000036d000 ffff8100001290d6 ffff8100751fa618
Dec 26 13:09:41 [kernel] Call Trace:
Dec 26 13:09:41 [kernel]  [<ffffffff880d074e>] :ndiswrapper:NdisMIndicateReceivePacket+0x120/0x23f
Dec 26 13:09:41 [kernel]  [<ffffffff880df10b>] :ndiswrapper:get_iw_stats+0xc22/0xd37
Dec 26 13:09:41 [kernel]  [<ffffffff8023b8fb>] lock_timer_base+0x26/0x4b
Dec 26 13:09:41 [kernel]  [<ffffffff880d0f26>] :ndiswrapper:NdisAllocateBuffer+0x11a/0x138
Dec 26 13:09:41 [kernel]  [<ffffffff8051ede7>] _spin_lock_bh+0x9/0x19
                - Last output repeated twice -
Dec 26 13:09:41 [kernel]  [<ffffffff880df0f7>] :ndiswrapper:get_iw_stats+0xc0e/0xd37
Dec 26 13:09:41 [kernel]  [<ffffffff880d5858>] :ndiswrapper:KeSynchronizeExecution+0x25/0x3f
Dec 26 13:09:41 [kernel]  [<ffffffff880d3bb0>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x7b/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff880cf727>] :ndiswrapper:deserialized_irq_handler+0x17/0x34
Dec 26 13:09:41 [kernel]  [<ffffffff8051d26e>] thread_return+0x0/0xd8
Dec 26 13:09:41 [kernel]  [<ffffffff880d3c83>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x14e/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff8051ddce>] mutex_lock+0xd/0x1e
Dec 26 13:09:41 [kernel]  [<ffffffff880d3c6a>] :ndiswrapper:KeRaiseIrqlToDpcLevel+0x135/0x1ae
Dec 26 13:09:41 [kernel]  [<ffffffff80241971>] run_workqueue+0x83/0x10c
Dec 26 13:09:41 [kernel]  [<ffffffff8024226a>] worker_thread+0x0/0xe4
Dec 26 13:09:41 [kernel]  [<ffffffff80242344>] worker_thread+0xda/0xe4
Dec 26 13:09:41 [kernel]  [<ffffffff8024504e>] autoremove_wake_function+0x0/0x2e
Dec 26 13:09:41 [kernel]  [<ffffffff80244f30>] kthread+0x47/0x73
Dec 26 13:09:41 [kernel]  [<ffffffff8020c188>] child_rip+0xa/0x12
Dec 26 13:09:41 [kernel]  [<ffffffff80244ee9>] kthread+0x0/0x73
Dec 26 13:09:41 [kernel]  [<ffffffff8020c17e>] child_rip+0x0/0x12
Dec 26 13:09:41 [kernel] Code: 0f 0b eb fe 48 8b 46 20 49 89 f8 49 89 f1 48 c7 47 10 00 00
Dec 26 13:09:41 [kernel] RIP  [<ffffffff80491840>] skb_over_panic+0x57/0x5b
Dec 26 13:09:41 [kernel]  RSP <ffff810072547988>
Dec 26 13:09:42 [kernel] wlan0: no IPv6 routers present
Dec 26 13:09:56 [kernel] ACPI: PCI Interrupt 0000:01:05.0[B] -> GSI 19 (level, low) -> IRQ 19
Comment 1 Jakub Moc (RETIRED) gentoo-dev 2007-12-26 22:54:19 UTC
Reopen with real errors....
Comment 2 Jakub Moc (RETIRED) gentoo-dev 2007-12-27 10:23:27 UTC
Sigh, stupid bugzilla jumping on next bug in the list.
Comment 3 Mike Pagano gentoo-dev 2008-01-01 21:40:21 UTC
Let's start by testing the latest gentoo-sources which is 2.6.23-r5 as of this writing and then upon failure, please test with the latest development vanilla kernel which is vanilla-kernel-2.6.24_rc6 as of this writing.

emerge --info on failing system, also.
Comment 4 Mike Pagano gentoo-dev 2008-01-23 01:10:50 UTC
Feel free to reopen when you have had a chance to to perform the tests from comment #3