Summary: | Please mark =sys-kernel/gentoo-sources-2.6.30-r4 and =sys-kernel/vanilla-sources-2.6.30.3 stable | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Mike Pagano <mpagano> |
Component: | New packages | Assignee: | Gentoo Kernel Bug Wranglers and Kernel Maintainers <kernel> |
Status: | RESOLVED FIXED | ||
Severity: | normal | CC: | axiator, please.no.spam.here, webmaster |
Priority: | High | Keywords: | STABLEREQ |
Version: | unspecified | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- | |
Bug Depends on: | 272676, 280244 | ||
Bug Blocks: | |||
Attachments: | Likely possible culprit of kernel 2.6.30 hard freeze |
Description
Mike Pagano
2009-07-30 13:53:54 UTC
===AMD64 AT REPORT=== sys-kernel/gentoo-sources-2.6.30-r4 works ok here with, lvm2, raid and nvidia-drivers. (I've running this kernel for more than 24 hours). emerge --info: Portage 2.1.6.13 (default/linux/amd64/2008.0, gcc-4.3.2, glibc-2.9_p20081201-r2, 2.6.30-gentoo-r4 x86_64) ================================================================= System uname: Linux-2.6.30-gentoo-r4-x86_64-AMD_Athlon-tm-_64_Processor_3200+-with-gentoo-2.0.1 Timestamp of tree: Fri, 31 Jul 2009 16:00:01 +0000 app-shells/bash: 3.2_p39 dev-java/java-config: 2.1.8-r1 dev-lang/python: 2.5.4-r3, 2.6.2-r1 dev-util/cmake: 2.6.4 sys-apps/baselayout: 2.0.1 sys-apps/openrc: 0.4.3-r3 sys-apps/sandbox: 1.6-r2 sys-devel/autoconf: 2.63-r1 sys-devel/automake: 1.7.9-r1, 1.9.6-r2, 1.10.2 sys-devel/binutils: 2.18-r3 sys-devel/gcc-config: 1.4.1 sys-devel/libtool: 2.2.6a virtual/os-headers: 2.6.27-r2 ACCEPT_KEYWORDS="amd64" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-march=native -O2 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/env.d/java/ /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo /etc/udev/rules.d" CXXFLAGS="-march=native -O2 -pipe" DISTDIR="/usr/portage/distfiles" FEATURES="distlocks fixpackages parallel-fetch protect-owned sandbox sfperms strict unmerge-orphans userfetch" GENTOO_MIRRORS=" ftp://ftp.free.fr/mirrors/ftp.gentoo.org/ ftp://ftp.tu-clausthal.de/pub/linux/gentoo/ ftp://sunsite.informatik.rwth-aachen.de/pub/Linux/gentoo " LDFLAGS="-Wl,-O1" LINGUAS="en" MAKEOPTS="-j2" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/mnt/shmfs" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/usr/portage/local/layman/desktop-effects /usr/portage/local/layman/sunrise /usr/local/portage /var/vquicksilver-overlay" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="3dnow 3dnowext X a52 aac acl acpi alsa amd64 bzip2 cairo caps cleartype cli consolekit crypt dbus dga dri dv dvb exif fam ffmpeg flac gdbm glitz gmp gnutls gpm gstreamer gtk hal hddtemp iconv ieee1394 ipv6 isdnlog ithreads joystick jpeg lcms libnotify lm_sensors mad midi mmx mmxext mng mp3 mpeg mudflap multilib ncurses network nls nptl nptlonly ogg opengl openmp pam pcre perl png pppd pulseaudio python readline reflection sdl session spell spl sse sse2 ssl startup-notification svg sysfs tcpd theora threads tiff unicode vorbis wifi xcb xorg zlib" ALSA_CARDS="emu10k1" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" ELIBC="glibc" INPUT_DEVICES="evdev keyboard mouse" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LINGUAS="en" USERLAND="GNU" VIDEO_CARDS="nvidia" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, FFLAGS, INSTALL_MASK, LANG, LC_ALL, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS arm/ia64/sh stable amd64 stable for both, thanks Victor (also proxy commit for gentoofan23) x86 stable Both stable for HPPA. ppc stable ppc64 done Could someone please test this on sparc? If people specify "data-ordered" within /etc/fstab /dev/sda2 / ext3 noatime,data=ordered 0 0 And their kernel config for 2.6.30 has: # CONFIG_EXT3_DEFAULTS_TO_ORDERED is not set The system will not boot if their root is on this ext3fs. (Also, I'll try later tonight without loading (the latest) nvidia-drivers-185.18.31 binary driver, but I'm getting system hard locks. Due to the lack of info in the logs, I can likely guess it's an nvidia issue. Will follow-up by tomorrow morning if I can.) I've got my box hooked via serial cables as 2.6.30 froze again last night w/o nvidia driver loaded. Right now, my bet is on a corrupted file or file system error, which in turn was caused by booting 2.6.30 with ext3 option "data=ordered" in fstab. To me, it's something that seems conflicting. I'll leave it on again tonight and hopefully I'll catch something via serial cable. And then reload the binary nvidia driver later. I'll post back w/i the next few days to confirm whether I still have uptime, or am still freezing, along with a more conclusive cause. I think I've got this hard freeze somewhat nailed down. The later releases of 2.6.29 kernel versions included some heavy patching to e100.c which initially broke wake on lan function. Then things got really bad with further heavier patching to e100.c (from memory, a lot of eprom stuff). Now, 2.6.30 e100.c totally borks the kernel from apparently this dmesg logging I got just prior to a hard kernel freeze with nvidia binary driver unloaded, using Xorg nv driver. Created attachment 201292 [details]
Likely possible culprit of kernel 2.6.30 hard freeze
Likely possible culprit of kernel 2.6.30 hard freeze.
The severity of this bug, or any of it's errors are seen in <=2.6.29.
I'm looking specifically at the unusual presence of the following:
e100: eth0 NIC Link is Down
Aug 14 16:52:35 localhost2 e100: eth0 NIC Link is Up 100 Mbps Half Duplex
Aug 14 16:53:03 localhost2 e100: eth0 NIC Link is Down
Aug 14 16:53:07 localhost2 e100: eth0 NIC Link is Up 100 Mbps Half Duplex
Aug 14 17:01:02 localhost2 e100: eth0 NIC Link is Down
If I find this is the culprit, I'm going to be ticked at the person submitting unstable code to the e100.c file! ... then again, not like I'm submitting any. :-/
btw, this is a hard freeze. Even console via serial is frozen. I had 2.6.30-r4 up for 16+ hours overnight and today, while compiling e100.c as module and then blacklisting it. Of course this omitted starting a lot of init level services, but due to all the unstable patches submitted to this driver since the beginning of 2.6.29, I'm pretty certain at this point the cause for all of my hard system freezes were due to e100.c. I could likely easily copy in 2.6.29 e100.c into 2.6.30, but I don't have anymore time until winter to further debug. Another issue I ran into with 2.6.30, is auto power-off. Or something init wise is hanging power off. All this, including the trivial awkwardness of 2.6.30 not being able to boot with data=ordered in fstab (with ordered ext3 feature turned-off in the kernel) makes it extremely unstable. But in another sense, when it was totally unstable on my i440bx platforms here, it appears stable on my i815 laptop platform. Which leaves me guessing if there aren't any chipset/pci bugs not found yet? Can't be done on sparc due to bug 272676 g-s 2.6.30-r4 was already stable on alpha and v-s 30.3 now is. sparc stable, closing |