After updating to 2.6.31 I'm getting a number of wifi-related problems, the worst being a complete system crash after a random length of time. My dmesg log gets flooded with a kernel error repeatedly (see below) and eventually the whole system locks up with no warning. Reproducible: Always Steps to Reproduce: 1. emerge sys-kernel/gentoo-sources net-wireless/hostapd 2. Try to connect/disconnect from the access point a few times 3. Actual Results: hostapd eventually stops responding, the machine itself is dead and needs a power-cycle. Expected Results: It works fine on 2.6.30. I can get this to crash with vanilla-sources 2.6.31, but I'm not sure if the problem's with hostapd or the kernel itself. This is the dmesg stuff. Often when it crashes the console output ends right after "<EOI>". I've tried booting with "idle=halt" and "noapic", but it doesn't help: Sep 21 20:36:38 [kernel] [74880.178203] ------------[ cut here ]------------ Sep 21 20:36:38 [kernel] [74880.178203] WARNING: at net/mac80211/tx.c:1325 ieee80211_tx+0x204/0x240() Sep 21 20:36:38 [kernel] [74880.178203] Hardware name: Sep 21 20:36:38 [kernel] [74880.178203] tx refused but queue active Sep 21 20:36:38 [kernel] [74880.178203] Modules linked in: ipt_REJECT xt_tcpudp iptable_filter ip_tables ip6table_filter ip6_tables x_tables sr_mod cdrom p54usb p54common [last unloaded: microcode] Sep 21 20:36:38 [kernel] [74880.178203] Pid: 0, comm: swapper Tainted: G W 2.6.31-gentoo #2 Sep 21 20:36:38 [kernel] [74880.178203] Call Trace: Sep 21 20:36:38 [kernel] [74880.178203] <IRQ> [<ffffffff81039663>] warn_slowpath_common+0x88/0xb6 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff81039714>] warn_slowpath_fmt+0x4b/0x61 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8131d17a>] ieee80211_tx+0x204/0x240 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8131d35a>] ieee80211_tx_pending+0x1a4/0x28f Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff81053887>] ? ktime_get_ts+0x5c/0x79 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8103e482>] tasklet_action+0x7e/0xe0 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8103f0b0>] __do_softirq+0xa5/0x151 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8101f282>] ? apic_write+0x24/0x3a Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff81020c65>] ? ack_apic_level+0x4f/0xfc Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100cd7c>] call_softirq+0x1c/0x28 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100edef>] do_softirq+0x43/0x98 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8103ed2f>] irq_exit+0x52/0xa3 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100e4aa>] do_IRQ+0xb3/0xe0 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100c613>] ret_from_intr+0x0/0xa Sep 21 20:36:38 [kernel] [74880.178203] <EOI> [<ffffffff81013a0e>] ? mwait_idle+0x81/0x9a Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100af55>] ? enter_idle+0x33/0x49 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff8100b015>] ? cpu_idle+0xaa/0xed Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff81322bf0>] ? rest_init+0x74/0x8a Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff814b9297>] ? start_kernel+0x308/0x329 Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff814b86e1>] ? x86_64_start_reservations+0x91/0xac Sep 21 20:36:38 [kernel] [74880.178203] [<ffffffff814b87ed>] ? x86_64_start_kernel+0xf1/0x10c Sep 21 20:36:38 [kernel] [74880.178203] ---[ end trace 0ec0ca83d4612857 ]--- emerge --info: Portage 2.1.6.13 (default/linux/amd64/2008.0/no-multilib, gcc-4.4.1, glibc-2.10.1-r0, 2.6.31-gentoo x86_64) ================================================================= System uname: Linux-2.6.31-gentoo-x86_64-Intel-R-_Atom-TM-_CPU_230_@_1.60GHz-with-gentoo-2.0.1 Timestamp of tree: Thu, 17 Sep 2009 17:45:01 +0000 distcc 3.1 x86_64-pc-linux-gnu [enabled] app-shells/bash: 4.0_p33 dev-lang/python: 2.6.2-r1, 3.1.1 sys-apps/baselayout: 2.0.1 sys-apps/openrc: 0.4.3-r3 sys-apps/sandbox: 2.1 sys-devel/autoconf: 2.63-r1 sys-devel/automake: 1.10.2, 1.11 sys-devel/binutils: 2.19.1-r1 sys-devel/gcc-config: 1.4.1 sys-devel/libtool: 2.2.6a virtual/os-headers: 2.6.30-r1 ACCEPT_KEYWORDS="amd64 ~amd64" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -march=core2 -mtune=generic -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /var/bind" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/eselect/postgresql /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo /etc/udev/rules.d" CXXFLAGS="-O2 -march=core2 -mtune=generic -pipe" DISTDIR="/var/portage/distfiles/" FEATURES="distcc distlocks fixpackages parallel-fetch protect-owned sandbox sfperms strict unmerge-orphans userfetch" GENTOO_MIRRORS="http://www.mirrorservice.org/sites/www.ibiblio.org/gentoo/ http://ftp.snt.utwente.nl/pub/os/linux/gentoo http://mirror.qubenet.net/mirror/gentoo/ http://mirror.ovh.net/gentoo-distfiles/" LDFLAGS="-Wl,-O1,--sort-common,--hash-style=gnu,--as-needed" MAKEOPTS="-j3" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/var/portage/repos/gentoo/" PORTDIR_OVERLAY="/home/ant/public/git/ebuilds" SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage" USE="aac acl alsa amd64 avahi bash-completion bzip2 cdparanoia cli cracklib crypt cups dbus dri faad fam fastcgi flac fortran iconv ipv6 isdnlog lame libnl lzma mmap mmx mp3 mudflap ncurses nls nptl nptlonly ogg openmp pch pcre perl php pic pppd python readline reflection replaygain session speex spl sqlite sse sse2 ssl ssse3 sysfs tcpd threads unicode vim-syntax vorbis xorg zlib" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" ELIBC="glibc" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" USERLAND="GNU" VIDEO_CARDS="fbdev glint intel mach64 mga neomagic nv r128 radeon savage sis tdfx trident vesa vga via vmware voodoo" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, FFLAGS, INSTALL_MASK, LANG, LC_ALL, LINGUAS, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS
You'll probably be glad to know that I think I have the same problem. I am also running hostapd 0.6.9 with p54usb. All is well on 2.6.30-gentoo-r6. I upgraded to 2.6.31 and encountered several seemingly random freezes before switching back. At the time, I put it down to fglrx but now I think about it, fglrx usually only freezes when you do something like start or stop X. I was experimenting with KMS yesterday, first with 2.6.31 and then with 2.6.32-rc3. After a few freezes, I disabled p54usb but KMS was also causing freezes so it was hard to tell whether this was still affecting 2.6.32 or not. I'm going to try KMS again when rc4 comes out so I'll also try and see whether this p54usb issue is fixed or not. Have you reported this anywhere else? It would probably be best to report this at bugzilla.kernel.org.
Since it works with 2.6.30 and not 2.6.31, I think it is safe to say, that this is a kernel driver bug, and not a bug in hostapd.
Can you try this patch: http://patchwork.kernel.org/patch/34618/ against a 2.6.31 kernel and see if the problem persists?
I've just tried gentoo-sources-2.6.31-r2 which apparently already has that patch applied; still crashes.
Let's follow this upstream bug report, for now, which seems to addressing the same issue as yours, Anthony: http://bugzilla.kernel.org/show_bug.cgi?id=13725
A user in http://bugzilla.kernel.org/show_bug.cgi?id=13725 reports the following: " I think this might be fixed now. I was experiencing it too but I've just been running yesterday's linux-next for a couple of hours, with a Wii and Vista both connecting, and there were no panics or freezes. " Care to test it?
Running 2.6.32-rc4-next-20091012 for 20 hours now. It hasn't crashed in that time and dmesg seems to be quiet again.
(In reply to comment #7) > Running 2.6.32-rc4-next-20091012 for 20 hours now. It hasn't crashed in that > time and dmesg seems to be quiet again. > Good to know. I'll be watching the upstream report for the specific commit that causes the issue and backport it for the gentoo-sources.
We'll watch the upstream bug and work to backport the fix if they identify it.
(In reply to comment #9) > We'll watch the upstream bug and work to backport the fix if they identify it. According to http://forums.gentoo.org/viewtopic-t-795441-highlight-.html the bug is fixed in linux-next, so a backport is appreciated because several users are hit by this problem.
If someone can identify the patch that acutally fixes the issue, we'll be happy to look into back-porting it.
(In reply to comment #11) > If someone can identify the patch that acutally fixes the issue, we'll be happy > to look into back-porting it. http://git.kernel.org/?p=linux/kernel/git/next/linux-next.git;a=commit;h=47ab3840a389ff1b9959734995123e5bc94c3443 This is the fix, please backport.
Did you apply this against 2.6.31-r6 and test?
This is probably not the patch, since this patch has been in the kernel since 2.6.31-rc3. http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.31.y.git;a=commitdiff;h=47ab3840a389ff1b9959734995123e5bc94c3443
(In reply to comment #14) > This is probably not the patch, since this patch has been in the kernel since > 2.6.31-rc3. > > http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.31.y.git;a=commitdiff;h=47ab3840a389ff1b9959734995123e5bc94c3443 Hm, it is said to fix the issue from upstream bug report. Let's hope for the best.
I don't think this is it either seeing as it happened to me in a later rc. The upstream bug actually says... "I am using 2.6.31-rc6-git7 as I saw patch from http://patchwork.kernel.org/patch/34618/ is already applied but the problem still occurs."