Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 31650 - mm-sources behave strange: Bad page state at free_hot_cold_page
Summary: mm-sources behave strange: Bad page state at free_hot_cold_page
Status: RESOLVED CANTFIX
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: x86 Linux
: High critical (vote)
Assignee: x86-kernel@gentoo.org (DEPRECATED)
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2003-10-20 23:49 UTC by tomgo
Modified: 2003-10-21 03:05 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description tomgo 2003-10-20 23:49:01 UTC
hm, i don't know if this bug is for you, for the kernel mailing list or if it is
just due to some misconfiguration of my box, but anyway: i'll file it here
first, because i think it is kernel-related. my problem is:
there are randomly strange segfaults of different programms. most of the time it
seems to start with portage, but i'm not sure if always. then other programs
start to segfault. in my /var/log/messages appear a lot of lines like:
Oct 21 08:17:18 hal login(pam_unix)[3112]: session opened for user root by (uid=0)
Oct 21 08:20:48 hal Bad page state at free_hot_cold_page
Oct 21 08:20:48 hal flags:0x01000014 mapping:00000000 mapped:1 count:0
Oct 21 08:20:48 hal Backtrace:
Oct 21 08:20:48 hal Call Trace:
Oct 21 08:20:48 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:20:48 hal [<c013219b>] free_hot_cold_page+0x48/0xd8
Oct 21 08:20:48 hal [<c013923b>] zap_pte_range+0x13c/0x174
Oct 21 08:20:48 hal [<c01392b7>] zap_pmd_range+0x44/0x50
Oct 21 08:20:48 hal [<c0139300>] unmap_page_range+0x3d/0x59
Oct 21 08:20:48 hal [<c01393f5>] unmap_vmas+0xd9/0x1c7
Oct 21 08:20:48 hal [<c013c7de>] exit_mmap+0x61/0x14c
Oct 21 08:20:48 hal [<c01194e7>] mmput+0x7b/0xb8
Oct 21 08:20:48 hal [<c011ca1e>] do_exit+0x152/0x33d
Oct 21 08:20:48 hal [<c011ccd7>] sys_exit_group+0x0/0x11
Oct 21 08:20:48 hal [<c0123ee7>] get_signal_to_deliver+0x2bd/0x2e2
Oct 21 08:20:48 hal [<c010a55e>] do_signal+0x4f/0xbc
Oct 21 08:20:48 hal [<c011d27a>] sys_wait4+0x205/0x20f
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:20:48 hal [<c010a5f2>] do_notify_resume+0x27/0x35
Oct 21 08:20:48 hal [<c02bc83e>] work_notifysig+0x13/0x15
Oct 21 08:20:48 hal 
Oct 21 08:20:48 hal Trying to fix it up, but a reboot is needed
Oct 21 08:20:48 hal Bad page state at free_hot_cold_page
Oct 21 08:20:48 hal flags:0x01000014 mapping:00000000 mapped:1 count:0
Oct 21 08:20:48 hal Backtrace:
Oct 21 08:20:48 hal Call Trace:
Oct 21 08:20:48 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:20:48 hal [<c013219b>] free_hot_cold_page+0x48/0xd8
Oct 21 08:20:48 hal [<c013923b>] zap_pte_range+0x13c/0x174
Oct 21 08:20:48 hal [<c01392b7>] zap_pmd_range+0x44/0x50
Oct 21 08:20:48 hal [<c0139300>] unmap_page_range+0x3d/0x59
Oct 21 08:20:48 hal [<c01393f5>] unmap_vmas+0xd9/0x1c7
Oct 21 08:20:48 hal [<c013c7de>] exit_mmap+0x61/0x14c
Oct 21 08:20:48 hal [<c01194e7>] mmput+0x7b/0xb8
Oct 21 08:20:48 hal [<c011ca1e>] do_exit+0x152/0x33d
Oct 21 08:20:48 hal [<c011ccd7>] sys_exit_group+0x0/0x11
Oct 21 08:20:48 hal [<c0123ee7>] get_signal_to_deliver+0x2bd/0x2e2
Oct 21 08:20:48 hal [<c010a55e>] do_signal+0x4f/0xbc
Oct 21 08:20:48 hal [<c011d27a>] sys_wait4+0x205/0x20f
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:20:48 hal [<c010a5f2>] do_notify_resume+0x27/0x35
Oct 21 08:20:48 hal [<c02bc83e>] work_notifysig+0x13/0x15
Oct 21 08:20:48 hal 
Oct 21 08:20:48 hal Trying to fix it up, but a reboot is needed
Oct 21 08:20:48 hal Bad page state at free_hot_cold_page
Oct 21 08:20:48 hal flags:0x01000014 mapping:00000000 mapped:1 count:0
Oct 21 08:20:48 hal Backtrace:
Oct 21 08:20:48 hal Call Trace:
Oct 21 08:20:48 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:20:48 hal [<c013219b>] free_hot_cold_page+0x48/0xd8
Oct 21 08:20:48 hal [<c013923b>] zap_pte_range+0x13c/0x174
Oct 21 08:20:48 hal [<c01392b7>] zap_pmd_range+0x44/0x50
Oct 21 08:20:48 hal [<c0139300>] unmap_page_range+0x3d/0x59
Oct 21 08:20:48 hal [<c01393f5>] unmap_vmas+0xd9/0x1c7
Oct 21 08:20:48 hal [<c013c7de>] exit_mmap+0x61/0x14c
Oct 21 08:20:48 hal [<c01194e7>] mmput+0x7b/0xb8
Oct 21 08:20:48 hal [<c011ca1e>] do_exit+0x152/0x33d
Oct 21 08:20:48 hal [<c011ccd7>] sys_exit_group+0x0/0x11
Oct 21 08:20:48 hal [<c0123ee7>] get_signal_to_deliver+0x2bd/0x2e2
Oct 21 08:20:48 hal [<c010a55e>] do_signal+0x4f/0xbc
Oct 21 08:20:48 hal [<c011d27a>] sys_wait4+0x205/0x20f
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:20:48 hal [<c010a5f2>] do_notify_resume+0x27/0x35
Oct 21 08:20:48 hal [<c02bc83e>] work_notifysig+0x13/0x15
Oct 21 08:20:48 hal 
Oct 21 08:20:48 hal Trying to fix it up, but a reboot is needed
Oct 21 08:20:48 hal Bad page state at free_hot_cold_page
Oct 21 08:20:48 hal flags:0x0102000c mapping:d3b5d9f4 mapped:0 count:0
Oct 21 08:20:48 hal Backtrace:
Oct 21 08:20:48 hal Call Trace:
Oct 21 08:20:48 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:20:48 hal [<c013219b>] free_hot_cold_page+0x48/0xd8
Oct 21 08:20:48 hal [<c013923b>] zap_pte_range+0x13c/0x174
Oct 21 08:20:48 hal [<c01392b7>] zap_pmd_range+0x44/0x50
Oct 21 08:20:48 hal [<c0139300>] unmap_page_range+0x3d/0x59
Oct 21 08:20:48 hal [<c01393f5>] unmap_vmas+0xd9/0x1c7
Oct 21 08:20:48 hal [<c013c7de>] exit_mmap+0x61/0x14c
Oct 21 08:20:48 hal [<c01194e7>] mmput+0x7b/0xb8
Oct 21 08:20:48 hal [<c011ca1e>] do_exit+0x152/0x33d
Oct 21 08:20:48 hal [<c011ccd7>] sys_exit_group+0x0/0x11
Oct 21 08:20:48 hal [<c0123ee7>] get_signal_to_deliver+0x2bd/0x2e2
Oct 21 08:20:48 hal [<c010a55e>] do_signal+0x4f/0xbc
Oct 21 08:20:48 hal [<c011d27a>] sys_wait4+0x205/0x20f
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:20:48 hal [<c010a5f2>] do_notify_resume+0x27/0x35
Oct 21 08:20:48 hal [<c02bc83e>] work_notifysig+0x13/0x15
Oct 21 08:20:48 hal 
Oct 21 08:20:48 hal Trying to fix it up, but a reboot is needed
Oct 21 08:20:48 hal swap_free: Bad swap file entry 40ead110
Oct 21 08:20:48 hal swap_free: Bad swap file entry 80eac120
Oct 21 08:20:48 hal Bad page state at free_hot_cold_page
Oct 21 08:20:48 hal flags:0x0102001c mapping:c9ce20f4 mapped:0 count:0
Oct 21 08:20:48 hal Backtrace:
Oct 21 08:20:48 hal Call Trace:
Oct 21 08:20:48 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:20:48 hal [<c013219b>] free_hot_cold_page+0x48/0xd8
Oct 21 08:20:48 hal [<c013923b>] zap_pte_range+0x13c/0x174
Oct 21 08:20:48 hal [<c01392b7>] zap_pmd_range+0x44/0x50
Oct 21 08:20:48 hal [<c0139300>] unmap_page_range+0x3d/0x59
Oct 21 08:20:48 hal [<c01393f5>] unmap_vmas+0xd9/0x1c7
Oct 21 08:20:48 hal [<c013c7de>] exit_mmap+0x61/0x14c
Oct 21 08:20:48 hal [<c01194e7>] mmput+0x7b/0xb8
Oct 21 08:20:48 hal [<c011ca1e>] do_exit+0x152/0x33d
Oct 21 08:20:48 hal [<c011ccd7>] sys_exit_group+0x0/0x11
Oct 21 08:20:48 hal [<c0123ee7>] get_signal_to_deliver+0x2bd/0x2e2
Oct 21 08:20:48 hal [<c010a55e>] do_signal+0x4f/0xbc
Oct 21 08:20:48 hal [<c011d27a>] sys_wait4+0x205/0x20f
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c0117fb4>] default_wake_function+0x0/0x18
Oct 21 08:20:48 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:20:48 hal [<c010a5f2>] do_notify_resume+0x27/0x35
Oct 21 08:20:48 hal [<c02bc83e>] work_notifysig+0x13/0x15
Oct 21 08:20:48 hal 
Oct 21 08:20:48 hal Trying to fix it up, but a reboot is needed
Oct 21 08:20:48 hal mm/memory.c:100: bad pmd 09800000.
Oct 21 08:21:28 hal Bad page state at prep_new_page
Oct 21 08:21:28 hal flags:0x01000004 mapping:00000000 mapped:1 count:0
Oct 21 08:21:28 hal Backtrace:
Oct 21 08:21:28 hal Call Trace:
Oct 21 08:21:28 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:21:28 hal [<c0131eaa>] prep_new_page+0x24/0x3c
Oct 21 08:21:28 hal [<c0132361>] buffered_rmqueue+0x125/0x12e
Oct 21 08:21:28 hal [<c01323fd>] __alloc_pages+0x93/0x2ad
Oct 21 08:21:28 hal [<c013a482>] do_anonymous_page+0xd4/0x1db
Oct 21 08:21:28 hal [<c013a9d0>] handle_mm_fault+0x6c/0x106
Oct 21 08:21:28 hal [<c0116461>] do_page_fault+0x126/0x451
Oct 21 08:21:28 hal [<c0121474>] update_process_times+0x29/0x2f
Oct 21 08:21:28 hal [<c012136c>] update_wall_time+0xb/0x33
Oct 21 08:21:28 hal [<c0121672>] do_timer+0x4c/0xc1
Oct 21 08:21:28 hal [<c010eeb7>] timer_interrupt+0x48/0x112
Oct 21 08:21:28 hal [<c010b657>] handle_IRQ_event+0x27/0x4a
Oct 21 08:21:28 hal [<c011decd>] do_softirq+0x45/0x87
Oct 21 08:21:28 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:21:28 hal [<c02bd1ff>] error_code+0x2f/0x38
Oct 21 08:21:28 hal 
Oct 21 08:21:28 hal Trying to fix it up, but a reboot is needed
Oct 21 08:21:28 hal Bad page state at prep_new_page
Oct 21 08:21:28 hal flags:0x01000004 mapping:00000000 mapped:1 count:0
Oct 21 08:21:28 hal Backtrace:
Oct 21 08:21:28 hal Call Trace:
Oct 21 08:21:28 hal [<c0131bcb>] bad_page+0x41/0x64
Oct 21 08:21:28 hal [<c0131eaa>] prep_new_page+0x24/0x3c
Oct 21 08:21:28 hal [<c0132361>] buffered_rmqueue+0x125/0x12e
Oct 21 08:21:28 hal [<c01323fd>] __alloc_pages+0x93/0x2ad
Oct 21 08:21:28 hal [<c013a482>] do_anonymous_page+0xd4/0x1db
Oct 21 08:21:28 hal [<c013a9d0>] handle_mm_fault+0x6c/0x106
Oct 21 08:21:28 hal [<c0116461>] do_page_fault+0x126/0x451
Oct 21 08:21:28 hal [<c0121474>] update_process_times+0x29/0x2f
Oct 21 08:21:28 hal [<c012136c>] update_wall_time+0xb/0x33
Oct 21 08:21:28 hal [<c0121672>] do_timer+0x4c/0xc1
Oct 21 08:21:28 hal [<c0117b22>] schedule+0x66/0x4b7
Oct 21 08:21:28 hal [<c011decd>] do_softirq+0x45/0x87
Oct 21 08:21:28 hal [<c011633b>] do_page_fault+0x0/0x451
Oct 21 08:21:28 hal [<c02bd1ff>] error_code+0x2f/0x38
Oct 21 08:21:28 hal 
Oct 21 08:21:28 hal Trying to fix it up, but a reboot is needed
Oct 21 08:21:53 hal init: Switching to runlevel: 6


As you see, i wanted to reboot at the end, but the box stopped with a screen,
presenting me some information and told me to snip it and drop it at the kernel
mailing list. Unfortunately at that state my box was nearly shut down, so i
wasn't able to snip this information and drop it anywhere.
This happened to me on mm-sources during the test6 series, then i had to
reinstall as these crashes caused so massive corruption on my hd that
fsck.reiserfs was unable to repair it. Now i am on the test8 series of
mm-sources and it is happening again. i will switch back to xfs-sources until
this problem is solved for me.

Reproducible: Always
Steps to Reproduce:
1.emerge mm-sources
2.work with your box
3.see it happen - five minutes, sometimes five hours without problems

Actual Results:  
i have to shut down the box, then do a reset as it stops at the end


Portage 2.0.49-r13-2 (default-x86-1.4, gcc-3.3.1, glibc-2.3.2-r6, 2.4.20-xfs-r3)
=================================================================
System uname: 2.4.20-xfs-r3 i686 AMD Athlon(tm) XP 2000+
Gentoo Base System version 1.4.3.11
ccache version 2.3 [enabled]
ACCEPT_KEYWORDS="x86 ~x86"
AUTOCLEAN="yes"
CFLAGS="-march=athlon-xp -O3 -pipe -fstack-protector -fomit-frame-pointer
-funroll-loops"
CHOST="i686-pc-linux-gnu"
COMPILER="gcc3"
CONFIG_PROTECT="/etc /var/qmail/control /usr/kde/2/share/config
/usr/kde/3/share/config /usr/X11R6/lib/X11/xkb /usr/kde/3.1/share/config
/usr/share/config"
CONFIG_PROTECT_MASK="/etc/gconf /etc/env.d"
CXXFLAGS="-march=athlon-xp -O3 -pipe -fstack-protector -fomit-frame-pointer
-funroll-loops"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoaddcvs sandbox buildpkg ccache"
GENTOO_MIRRORS="http://gentoo.inode.at/ ftp://ftp.easynet.nl/mirror/gentoo/
http://ftp.easynet.nl/mirror/gentoo/
http://mirrors.sec.informatik.tu-darmstadt.de/gentoo"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY="/usr/local/portage"
SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage"
USE="x86 oss avi crypt cups encode foomaticdb gif jpeg libwww mad mikmod mpeg
ncurses nls pdflib png quicktime spell truetype xml2 xmms xv zlib gdbm berkdb
slang readline X gpm tcpd pam ssl python imlib oggvorbis gtk opengl 3dnow cdr
dga -nptl dv dvd faad ffmpeg flac gtk2 java javascript lcms matroska md5sum mmx
mozilla offensive pic samba slp sse tcltk tiff transcode unicode wmf xfs xosd
xvid -apm -arts -kde -gnome -libg++ -motif -qt -sdl -svga -perl -esd"
Comment 1 Brian Jackson (RETIRED) gentoo-dev 2003-10-21 03:05:41 UTC
We don't yet  support any of the 2.6 based kernels internally, we only provide
the ebuilds to those individuals who wish to test and report bugs upstream.
So, my advice is to report this upstream. Make sure you include the logs,
your system configuration, config, the output of /usr/src/linux/scripts/ver_linux,
and as much other info as you can come up with.