Summary: | >=x11-drivers/nvidia-drivers-270.41.03 - resizing window ejects nvidia module: kernel BUG at mm/slub.c:2747! invalid opcode: 0000 [#1] PREEMPT SMP | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Dmitry Suloev <SuloevDmitry> |
Component: | New packages | Assignee: | Doug Goldstein (RETIRED) <cardoe> |
Status: | RESOLVED FIXED | ||
Severity: | critical | CC: | dyle, heltem+gentoo, jer, kredba, Martin.vGagern, mitaspiotr, polidevk.polidevk, polynomial-c, spock, zzam |
Priority: | Normal | ||
Version: | unspecified | ||
Hardware: | AMD64 | ||
OS: | Linux | ||
URL: | http://www.nvnews.net/vbulletin/showthread.php?t=160115 | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- | |
Attachments: | Nvidia bug report |
Description
Dmitry Suloev
2011-04-18 11:33:30 UTC
------------[ cut here ]------------ kernel BUG at mm/slub.c:2747! invalid opcode: 0000 [#1] PREEMPT SMP last sysfs file: /sys/devices/virtual/bdi/0:17/uevent CPU 3 Modules linked in: fuse snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss nvidia(P) ir_lirc_codec lirc_dev usb_storage ir_sony_decoder rc_pixelview_new tuner snd_hda_codec_realtek usb_libusual tuner_xc2028 ir_jvc_decoder ir_rc6_decoder snd_hda_intel snd_hda_codec 8250_pnp cx88_alsa snd_hwdep ir_rc5_decoder ohci_hcd snd_pcm cx8800 snd_timer ir_nec_decoder cx88xx asus_atk0110 snd rc_core k10temp ehci_hcd usbcore 8250 serial_core evdev tveeprom v4l2_common videobuf_dma_sg videodev videobuf_core v4l2_compat_ioctl32 hwmon btcx_risc snd_page_alloc nls_base unix Pid: 7482, comm: X Tainted: P 2.6.38-gentoo-r2 #1 System manufacturer System Product Name/M4A79T Deluxe RIP: 0010:[<ffffffff810e67bd>] [<ffffffff810e67bd>] kfree+0xad/0xb0 RSP: 0018:ffff880125489c88 EFLAGS: 00010246 RAX: ffffea0000000000 RBX: 0000000000000000 RCX: ffff8800cf68ad08 RDX: 0000000000000000 RSI: 000000000000001e RDI: ffff880000000000 RBP: ffff880000000000 R08: ffff8800cf68af08 R09: ffff8801276be540 R10: 0000000000000000 R11: 0000000000003246 R12: ffff880104250400 R13: 0000000000000004 R14: ffff880125438000 R15: 0000000000000004 FS: 00007fa644a24880(0000) GS:ffff8800cfcc0000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00007fa63de0bbf8 CR3: 000000012a692000 CR4: 00000000000006e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process X (pid: 7482, threadinfo ffff880125488000, task ffff8801041a19c0) Stack: 0000000000000292 0000000000000000 ffff8800cf68ac98 ffffffffa074b278 ffff8801276c0010 ffffffffa05540dc ffff88012b005000 0000000000000001 ffff8800cf68af08 ffffffffa054c0fe ffff880125438000 ffff88012b005000 Call Trace: [<ffffffffa074b278>] ? _nv022923rm+0xe/0x13 [nvidia] [<ffffffffa05540dc>] ? _nv012667rm+0x47/0x7e [nvidia] [<ffffffffa054c0fe>] ? _nv012700rm+0x35/0xde [nvidia] [<ffffffffa054f5f3>] ? _nv013081rm+0xf4/0x388 [nvidia] [<ffffffffa0530884>] ? _nv013082rm+0xd44/0x10cd [nvidia] [<ffffffffa0530c98>] ? _nv013083rm+0x8b/0xb3 [nvidia] [<ffffffffa02706a8>] ? _nv002390rm+0x221/0x237 [nvidia] [<ffffffffa02665ca>] ? _nv002011rm+0x5b/0x94 [nvidia] [<ffffffffa0759745>] ? _nv002409rm+0x3e5/0x6a0 [nvidia] [<ffffffffa0755ed5>] ? rm_ioctl+0x2f/0x67 [nvidia] [<ffffffffa07716e2>] ? nv_kern_ioctl+0x162/0x460 [nvidia] [<ffffffffa0771a2c>] ? nv_kern_unlocked_ioctl+0x1c/0x30 [nvidia] [<ffffffff81101791>] ? do_vfs_ioctl+0x91/0x510 [<ffffffff81102353>] ? poll_select_copy_remaining+0x103/0x150 [<ffffffff81101ca8>] ? sys_ioctl+0x98/0xa0 [<ffffffff81002d7b>] ? system_call_fastpath+0x16/0x1b Code: c3 48 89 ea 48 89 c6 e8 12 fc ff ff eb ea 48 f7 00 00 c0 00 00 74 14 48 83 c4 08 48 89 c7 5b 5d e9 d9 d4 fc ff 48 8b 40 10 eb 8f <0f> 0b 90 53 48 8d 5f 90 48 8b 7f e8 e8 42 ff ff ff 48 89 df 5b RIP [<ffffffff810e67bd>] kfree+0xad/0xb0 RSP <ffff880125489c88> ---[ end trace a3720255180ace0a ]--- I get this message in /var/log/Xorg.0.log: [ 789.872] (EE) NVIDIA(0): Failed to allocate 2D engine [ 789.872] (EE) NVIDIA(0): *** Aborting *** [ 789.872] (EE) NVIDIA(0): Failed to allocate 2D objects [ 789.872] (EE) NVIDIA(0): *** Aborting *** [ 789.872] (EE) NVIDIA(0): Error recovery failed. [ 789.872] (EE) NVIDIA(0): *** Aborting *** Please post your `emerge --info' output too. dos ~ # emerge --info Portage 2.1.9.46 (default/linux/amd64/10.0/no-multilib, gcc-4.5.2, glibc-2.13-r2, 2.6.38-gentoo-r2 x86_64) ================================================================= System uname: Linux-2.6.38-gentoo-r2-x86_64-AMD_Phenom-tm-_II_X4_965_Processor-with-gentoo-2.0.2 Timestamp of tree: Mon, 18 Apr 2011 09:30:01 +0000 app-shells/bash: 4.2_p8 dev-lang/python: 2.7.1-r1 dev-util/cmake: 2.8.4 sys-apps/baselayout: 2.0.2 sys-apps/openrc: 0.8.2 sys-apps/sandbox: 2.5 sys-devel/autoconf: 2.13, 2.68 sys-devel/automake: 1.9.6-r3, 1.10.3, 1.11.1-r1 sys-devel/binutils: 2.21 sys-devel/gcc: 4.5.2 sys-devel/gcc-config: 1.4.1 sys-devel/libtool: 2.4-r1 sys-devel/make: 3.82 sys-kernel/linux-headers: 2.6.38 virtual/os-headers: 0 ACCEPT_KEYWORDS="amd64 ~amd64" ACCEPT_LICENSE="* -@EULA AdobeFlash-10.1 PUEL skype-eula" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -pipe -march=native" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo" CXXFLAGS="-O2 -pipe -march=native" DISTDIR="/usr/portage/distfiles" FEATURES="assume-digests binpkg-logs distlocks fixlafiles fixpackages news parallel-fetch protect-owned sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch" FFLAGS="" GENTOO_MIRRORS="http://distfiles.gentoo.org" LANG="ru_RU.UTF-8" LC_ALL="" LDFLAGS="-Wl,-O1 -Wl,--as-needed" LINGUAS="ru" MAKEOPTS="-j5" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/var/lib/layman/x11 /var/lib/layman/sunrise" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="3dnow 3dnowext X acl acpi alsa amd64 apng berkdb bzip2 cairo cli consolekit cracklib cxx dbus device-mapper dga djvu dri enca encode exif extras fortran fuse gdbm git graphite gtk hotkeys iconv icu idle ip1500 jpeg jpeg2k lastfm libnotify libsamplerate mad mercurial mmx mmxext modules mp3 mudflap ncurses nls nptl nptlonly nsplugin ogg opencl opengl openmp pam pcre png policykit pppd radio readline secure-delete session smp sqlite srv sse sse2 sse3 ssl ssse3 startup-notification subversion svg sysfs tcpd threads tiff truetype udev unicode usb v4l v4l2 v8 vdpau vorbis xcb xorg xvmc zlib" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf superstar2 timing tsip tripmate tnt ubx" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LINGUAS="ru" LIRC_DEVICES="devinput" PHP_TARGETS="php5-3" RUBY_TARGETS="ruby18" SANE_BACKENDS="genesys" USERLAND="GNU" VIDEO_CARDS="nvidia" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS New details in /var/log/Xorg.0.log: [ 184.265] (WW) NVIDIA(0): WAIT (2, 6, 0x8000, 0x00006b64, 0x00007a0c) [ 185.100] (WW) NVIDIA(0): WAIT (0, 6, 0x8000, 0x00007a0c, 0x00007a0c) [ 791.565] (EE) NVIDIA(0): Failed to allocate 2D engine [ 791.565] (EE) NVIDIA(0): *** Aborting *** [ 791.565] (EE) NVIDIA(0): Failed to allocate 2D objects [ 791.565] (EE) NVIDIA(0): *** Aborting *** [ 791.565] (EE) NVIDIA(0): Error recovery failed. [ 791.565] (EE) NVIDIA(0): *** Aborting *** [ 795.540] [mi] EQ overflowing. The server is probably stuck in an infinite loop. [ 795.540] Backtrace: [ 795.557] 0: /usr/bin/X (xorg_backtrace+0x28) [0x4a12a8] [ 795.557] 1: /usr/bin/X (mieqEnqueue+0x1c3) [0x4a0823] [ 795.557] 2: /usr/bin/X (xf86PostMotionEventM+0x96) [0x47d9b6] [ 795.557] 3: /usr/bin/X (xf86PostMotionEvent+0xd6) [0x47dbd6] [ 795.557] 4: /usr/lib64/xorg/modules/input/mouse_drv.so (0x7f9049fba000+0x711f) [0x7f9049fc111f] [ 795.557] 5: /usr/lib64/xorg/modules/input/mouse_drv.so (0x7f9049fba000+0x7886) [0x7f9049fc1886] [ 795.557] 6: /usr/lib64/xorg/modules/input/mouse_drv.so (0x7f9049fba000+0x552e) [0x7f9049fbf52e] [ 795.557] 7: /usr/bin/X (0x400000+0x6b678) [0x46b678] [ 795.557] 8: /usr/bin/X (0x400000+0x118f06) [0x518f06] [ 795.557] 9: /lib64/libpthread.so.0 (0x7f904f483000+0xf4f0) [0x7f904f4924f0] [ 795.557] 10: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0x768e0) [0x7f904a4428e0] [ 795.557] 11: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0x78520) [0x7f904a444520] [ 795.557] 12: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0xde549) [0x7f904a4aa549] [ 795.557] 13: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0x45cf92) [0x7f904a828f92] [ 795.557] 14: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0x45d8d5) [0x7f904a8298d5] [ 795.557] 15: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7f904a3cc000+0x45da7d) [0x7f904a829a7d] [ 795.557] 16: /usr/bin/X (0x400000+0xdb12e) [0x4db12e] [ 795.557] 17: /usr/bin/X (0x400000+0xb5d28) [0x4b5d28] [ 795.557] 18: /usr/bin/X (0x400000+0xb6ef5) [0x4b6ef5] [ 795.557] 19: /usr/bin/X (0x400000+0x303a9) [0x4303a9] [ 795.557] 20: /usr/bin/X (0x400000+0x2488d) [0x42488d] [ 795.557] 21: /lib64/libc.so.6 (__libc_start_main+0xfd) [0x7f904e3cfebd] [ 795.557] 22: /usr/bin/X (0x400000+0x243e9) [0x4243e9] Same problem with x11-drivers/nvidia-drivers-270.41.06 Created attachment 270759 [details]
Nvidia bug report
Problem resolved by downgrade X-server 1.10.1 to 1.9.5 Resizing kde-base/konsole-4.6.2 sometimes makes my X unresponsive. Had a look at Xorg.0.log today, which looks a lot like comment #5, although some addresses are different. Maybe because I use evdev not mouse driver. [ 45628.797] (EE) NVIDIA(0): Failed to allocate 2D engine [ 45628.846] (EE) NVIDIA(0): *** Aborting *** [ 45628.847] (EE) NVIDIA(0): Failed to allocate 2D objects [ 45628.847] (EE) NVIDIA(0): *** Aborting *** [ 45628.847] (EE) NVIDIA(0): Error recovery failed. [ 45628.847] (EE) NVIDIA(0): *** Aborting *** [ 45631.850] (WW) NVIDIA(0): WAIT (2, 6, 0x8000, 0x00000000, 0x00000920) [ 45638.850] (WW) NVIDIA(0): WAIT (1, 6, 0x8000, 0x00000000, 0x00000920) [ 45641.851] (WW) NVIDIA(0): WAIT (2, 6, 0x8000, 0x00000000, 0x00001228) [ 45648.851] (WW) NVIDIA(0): WAIT (1, 6, 0x8000, 0x00000000, 0x00001228) [ 45730.612] [mi] EQ overflowing. The server is probably stuck in an infinite loop. [ 45730.630] Backtrace: [ 45730.782] 0: /usr/bin/X (xorg_backtrace+0x28) [0x4a3448] [ 45730.782] 1: /usr/bin/X (mieqEnqueue+0x1c3) [0x4a29c3] [ 45730.782] 2: /usr/bin/X (xf86PostMotionEventM+0x96) [0x47fb56] [ 45730.782] 3: /usr/bin/X (xf86PostMotionEventP+0x31) [0x47fc51] [ 45730.782] 4: /usr/lib64/xorg/modules/input/evdev_drv.so (0x7fe18e252000+0x4866) [0x7fe18e256866] [ 45730.783] 5: /usr/bin/X (0x400000+0x6d818) [0x46d818] [ 45730.783] 6: /usr/bin/X (0x400000+0x11b0a6) [0x51b0a6] [ 45730.783] 7: /lib64/libpthread.so.0 (0x3889800000+0xf4f0) [0x388980f4f0] [ 45730.783] 8: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7fe18ed09000+0x768e0) [0x7fe18ed7f8e0] [ 45730.783] 9: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7fe18ed09000+0x78520) [0x7fe18ed81520] [ 45730.783] 10: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7fe18ed09000+0xdb305) [0x7fe18ede4305] [ 45730.783] 11: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7fe18ed09000+0xdefb3) [0x7fe18ede7fb3] [ 45730.783] 12: /usr/lib64/xorg/modules/drivers/nvidia_drv.so (0x7fe18ed09000+0x60f9d) [0x7fe18ed69f9d] [ 45730.783] 13: /usr/bin/X (0x400000+0x13de7d) [0x53de7d] [ 45730.783] 14: /usr/bin/X (0x400000+0x13c8e0) [0x53c8e0] [ 45730.783] 15: /usr/bin/X (miPointerUpdateSprite+0x21a) [0x45c8da] [ 45730.783] 16: /usr/bin/X (0x400000+0x5cb0a) [0x45cb0a] [ 45730.783] 17: /usr/bin/X (0x400000+0xa9483) [0x4a9483] [ 45730.783] 18: /usr/bin/X (0x400000+0x165f73) [0x565f73] [ 45730.783] 19: /usr/bin/X (0x400000+0x38e1a) [0x438e1a] [ 45730.783] 20: /usr/bin/X (0x400000+0x3bee8) [0x43bee8] [ 45730.783] 21: /usr/bin/X (WindowsRestructured+0x41) [0x43c101] [ 45730.783] 22: /usr/bin/X (MapWindow+0x145) [0x456e85] [ 45730.783] 23: /usr/bin/X (ReparentWindow+0x275) [0x458655] [ 45730.783] 24: /usr/bin/X (HandleSaveSet+0x81) [0x458741] [ 45730.783] 25: /usr/bin/X (FreeClientResources+0x1f) [0x44e6df] [ 45730.783] 26: /usr/bin/X (CloseDownClient+0x5a) [0x430b7a] [ 45730.783] 27: /usr/bin/X (0x400000+0x31657) [0x431657] [ 45730.783] 28: /usr/bin/X (0x400000+0x25bcd) [0x425bcd] [ 45730.783] 29: /lib64/libc.so.6 (__libc_start_main+0xfd) [0x3888c1ecdd] [ 45730.783] 30: /usr/bin/X (0x400000+0x25729) [0x425729] How did you obtain the trace from comment #1? This is from dmesg: [45556.472339] NVRM: Xid (0000:01:00): 13, 0001 00000000 00005097 000015e0 00000000 00000100 [45556.708901] NVRM: Xid (0000:01:00): 13, 0001 00000000 00005097 000015e0 00000000 00000100 I guess I'll downgrade as suggested in comment #8, see if that helps me as well. Just an idea: perhaps https://bugs.kde.org/show_bug.cgi?id=227246 is related in some strange way: that too is about konsole resizing and nvidia hardware. That only makes things slow, though, whereas this here crashes X. But the message does talk about a possibly infinite loop, and the slowness might be due to a long loop, so there might be some relation after all, at least for konsole. (In reply to comment #9) This bug is not specific for konsole. The same thing happens when you resize the window lxterminal. (In reply to comment #10) > This bug is not specific for konsole. The same thing happens when you resize > the window lxterminal. I realized that, but perhaps something similar might be going on behind the scenes. Could this issue perhaps be specific to transparent windows? Or is your lxterminal opaque? (In reply to comment #9) > How did you obtain the trace from comment #1? I'm still curious. >Could this issue perhaps be specific to transparent windows?
This crash happens even when compositing is disabled.
Have you submitted these crashes to linux-bugs@nvidia.com ? I get locks up as well with this driver and 270.41.06 on xorg-server 1.10 on kernel 2.6.38. I can't report what the error even is as the system requires a hard shutdown. No ssh either. Frustrating. Disabling composititng and desktop effects doesn't help. It seems to happen on resizing of konsole but it did it with firefox too. Mouse input in general seems to initiate the problem. The nv driver seems to be working fine the last few days. Likewise going to downgrade to xorg-server 1.9.5 After switching to gentoo-sources-2.6.39 problem still occurs. (In reply to comment #9) > [ 45730.612] [mi] EQ overflowing. The server is probably stuck in an infinite > loop. Had this thingy again today. Pressed ESC to exit full-screen flash (32 bit flash 10.3.181.14-r1 running using nspluginwrapper), then used mouse to re-enable full-screen. The result was a white screen with a responsive arrow mouse cursor, but little else, in particular no keyboard access. > I guess I'll downgrade as suggested in comment #8, see if that helps me as > well. This IS xorg-server-1.9.5, so that didn't solve all the problems related to this "[mi] EQ overflowing." message. Does that make it a separate bug? (In reply to comment #13) > Have you submitted these crashes to linux-bugs@nvidia.com ? There are some threads there discussing similar issues: http://www.nvnews.net/vbulletin/showthread.php?t=157499 (the [mi] error line) http://www.nvnews.net/vbulletin/showthread.php?t=161664 (flash & nvidia) So nvidia guys should know about this issue, I think. I saved the nvidia-bug-report output nevertheless, but haven't published it anywhere yet. There is also at least one bug report about this for xorg: https://bugs.freedesktop.org/show_bug.cgi?id=32242 I'm considering using nuveau instead of closed source nvidia-drivers. Particularly after encountering 367795 as well. *** Bug 368711 has been marked as a duplicate of this bug. *** This bug was fixed in nvidia-drivers-275.09.04. (In reply to comment #18) > This bug was fixed in nvidia-drivers-275.09.04. But it isn't in the tree yet. If nvidia-drivers-275.09.07, which are in the tree now, resolve this issue, we should close this bug. Thanks. |