For a few days, when I try to emerge a big ebuild like X11, I get the following kind of message in /var/log/messages, followed by an automatic reboot. This kernel has worked perfectly for months, but I have changed my system last week : - Installed a new SATA 250Gb disk - Moved my system to that new disk - Use LVM2 and ReiserFS to hold /usr, /var, /opt, /home and /portage (the portage tree) in different logical volumes. Device-mapper and ReiserFS were not used before moving my system. - "/" and /boot are still in EXT3 As you can see from the message, "count" has a strange value, which in hex is 0xFF800000. That does not seem to be a random bit flip. I also ran memtest, and found no error. Let me know if you need other informations. As for now I will try to switch to the system on the old disk to see if I can trigger that problem whithout LVM2 and ReiserFS. /var/log/messages: Bad page state at prep_new_page (in process 'ld', page ffff8100019c5ae0) flags:0x4000000000000004 mapping:0000000000000000 mapcount:0 count:-8388608 Backtrace: Call Trace:<ffffffff80155511>{bad_page+113} <ffffffff80156021>{buffered_rmqueue+609} <ffffffff801562b3>{__alloc_pages+243} <ffffffff80162447>{do_no_page+279} <ffffffff801544a8>{__generic_file_aio_read+424} <ffffffff801629a9>{__handle_mm_fault+425} <ffffffff8015465b>{generic_file_read+187} <ffffffff803e2d56>{do_page_fault+998} <ffffffff80146000>{autoremove_wake_function+0} <ffffffff801735e4>{vfs_read+308} <ffffffff8010e349>{error_exit+0} Trying to fix it up, but a reboot is needed Bad page state at prep_new_page (in process 'ld.so', page ffff810001d24f30) flags:0x4000000000000004 mapping:0000000000000000 mapcount:0 count:-8388608 Backtrace: Call Trace:<ffffffff80155511>{bad_page+113} <ffffffff80156021>{buffered_rmqueue+609} <ffffffff801562b3>{__alloc_pages+243} <ffffffff80162447>{do_no_page+279} <ffffffff801629a9>{__handle_mm_fault+425} <ffffffff801600f6>{free_pgtables+102} <ffffffff803e2d56>{do_page_fault+998} <ffffffff80167b83>{do_mremap+1523} <ffffffff8010e349>{error_exit+0} Trying to fix it up, but a reboot is needed Bad page state at prep_new_page (in process 'pdflush', page ffff8100014c1970) flags:0x4000000000000004 mapping:0000000000000000 mapcount:0 count:-8388608 Backtrace: Call Trace:<ffffffff80155511>{bad_page+113} <ffffffff80156021>{buffered_rmqueue+609} <ffffffff801562b3>{__alloc_pages+243} <ffffffff80159a73>{cache_alloc_refill+675} <ffffffff801593db>{kmem_cache_alloc+43} <ffffffff80154b39>{mempool_alloc+57} <ffffffff80146000>{autoremove_wake_function+0} <ffffffff884ff8cc>{:dm_mod:dm_request+172} <ffffffff802c38eb>{generic_make_request+539} <ffffffff801785df>{bio_alloc_bioset+47} <ffffffff801787b0>{bio_alloc+16} <ffffffff8017631f>{submit_bh+159} <ffffffff8852a5c0>{:reiserfs:write_ordered_chunk+0} <ffffffff8852a617>{:reiserfs:write_ordered_chunk+87} <ffffffff8852a6c6>{:reiserfs:add_to_chunk+134} <ffffffff8852aaf1>{:reiserfs:write_ordered_buffers+321} <ffffffff8852b1ef>{:reiserfs:flush_commit_list+415} <ffffffff8852f127>{:reiserfs:do_journal_end+3063} <ffffffff801459f0>{keventd_create_kthread+0} <ffffffff8851e4b0>{:reiserfs:reiserfs_sync_fs+64} <ffffffff8017a3a4>{sync_supers+180} <ffffffff80157a7a>{wb_kupdate+42} <ffffffff801586af>{pdflush+383} <ffffffff80157a50>{wb_kupdate+0} <ffffffff801459f0>{keventd_create_kthread+0} <ffffffff80158530>{pdflush+0} <ffffffff801459ad>{kthread+205} <ffffffff8010e4fe>{child_rip+8} <ffffffff801459f0>{keventd_create_kthread+0} <ffffffff801458e0>{kthread+0} <ffffffff8010e4f6>{child_rip+0} Trying to fix it up, but a reboot is needed # uname -a Linux belldandy 2.6.14.2 #1 PREEMPT Sun Nov 13 16:09:07 CET 2005 x86_64 AMD Athlon(tm) 64 Processor 3000+ GNU/Linux # emerge --info Portage 2.0.54 (default-linux/amd64/2005.0, gcc-3.4.5, glibc-2.3.5-r2, 2.6.14.2 x86_64) ================================================================= System uname: 2.6.14.2 x86_64 AMD Athlon(tm) 64 Processor 3000+ Gentoo Base System version 1.6.14 dev-lang/python: 2.3.5-r2, 2.4.2 dev-python/pycrypto: [Not Present] dev-util/ccache: [Not Present] dev-util/confcache: [Not Present] sys-apps/sandbox: 1.2.12 sys-devel/autoconf: 2.13, 2.59-r7 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1 sys-devel/binutils: 2.16.1 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O3 -pipe -fomit-frame-pointer" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3.3/env /usr/kde/3.3/share/config /usr/kde/3.3/shutdown /usr/kde/3.4/env /usr/kde/3.4/share/config /usr/kde/3.4/shutdown /usr/kde/3/share/config /usr/lib/X11/xkb /usr/share/config /usr/share/texmf/dvipdfm/config/ /usr/share/texmf/dvips/config/ /usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/ /usr/share/texmf/xdvi/ /var/qmail/control" CONFIG_PROTECT_MASK="/etc/eselect/compiler /etc/gconf /etc/terminfo /etc/env.d" CXXFLAGS="-O3 -pipe -fomit-frame-pointer" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks sandbox sfperms strict" GENTOO_MIRRORS="http://mirror.uni-c.dk/gentoo/" LANG="fr_FR@euro" MAKEOPTS="-j2" PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/portage" SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage" USE="amd64 X alsa arts audiofile avi bash-completion berkdb bitmap-fonts bonobo bzip2 cdr cli crypt cups curl dri eds emboss encode esd exif expat fam ffmpeg flac foomaticdb fortran gd gdbm gif glut gnome gphoto2 gpm gstreamer gtk gtk2 gtkhtml idn imagemagick imlib ipv6 isdnlog java jpeg junit kde lcms ldap libwww lzw lzw-tiff mad matroska mhash mng mp3 mpeg mysql ncurses nls ogg opengl pam pcre pdflib perl php png postgres pppd python qt quicktime readline reflection samba sdl session spell spl ssl tcltk tcpd tetex tiff truetype truetype-fonts type1-fonts udev usb vorbis xine xml xml2 xmms xorg xpm xv xvid zlib userland_GNU kernel_linux elibc_glibc" Unset: ASFLAGS, CTARGET, INSTALL_MASK, LC_ALL, LDFLAGS, LINGUAS, PORTAGE_RSYNC_EXTRA_OPTS, PORTAGE_RSYNC_OPTS, PORTDIR_OVERLAY
Reopen if you can reproduce the problem w/ latest kernel version.
I have compiled and used kernel 2.6.16-gentoo-r7 and I still get the problem. The message is slightly different, but the count has still the value 0xFF800000. Note also that flags are no more symetrical. What information could I provide to help ? Bad page state in process 'cc1' page:ffff810001287aa0 flags:0x4000000000000000 mapping:0000000000000000 mapcount:0 count:-8388608 Trying to fix it up, but a reboot is needed Backtrace: Call Trace: <ffffffff801489f4>{bad_page+80} <ffffffff801495b7>{get_page_from_freelist+665} <ffffffff80149745>{__alloc_pages+81} <ffffffff8015248b>{__handle_mm_fault+449} <ffffffff80147cc2>{generic_file_read+171} <ffffffff80170202>{permission+126} <ffffffff80118bbe>{do_page_fault+915} <ffffffff801788bf>{dput+32} <ffffffff80164a43>{__fput+389} <ffffffff8010b151>{error_exit+0}
Is this reproducible on the latest development kernel? Currently 2.6.17-rc4
After inspecting my system, I found that the power supply fan was stopped. There was too much dust on the axis. After changing that part, I cannot reproduce the bug whith any kernel.