I've been working on a system-intensive java application, and no matter which JVM I try, the application eventully crashes, and all of the crash output points to a problem with glibc. This only occurs on amd64 machines, and I've been able to reproduce it on all 5 identical machines that I'm using, so I strongly discount it being hardware related. I'm attaching the crash output from IBM JDK and blackdown-jdk, as they show the crash data clearest. I'm working on rebuilding glibc with debugging, to provide a proper stacktrace back to the source of the error.
Created attachment 80755 [details] Crash output from IBM JDK
Created attachment 80756 [details] Blackdown 1.4 Crash #1
Created attachment 80757 [details] Blackdown 1.4 Crash #2
This is the emerge --info output for machine 'w4'. w1-w5 are all very close to identical. They all have identical hardware (amd64 3000+, 2GB ECC RAM, MSI MS-6741, dual WD1600JD-55 drives on linux md raid1). The only difference is w5 is running 2.6.15.4, while the rest are running 2.6.12 still. Portage 2.1_pre4-r1 (default-linux/amd64/2005.0, gcc-3.4.4, glibc-2.3.5-r2, 2.6.12 x86_64) ================================================================= System uname: 2.6.12 x86_64 AMD Athlon(tm) 64 Processor 3000+ Gentoo Base System version 1.6.14 distcc 2.18.3 x86_64-pc-linux-gnu (protocols 1 and 2) (default port 3632) [disabled] dev-lang/python: 2.3.5, 2.4.2 sys-apps/sandbox: 1.2.12 sys-devel/autoconf: 2.13, 2.59-r6 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r1 sys-devel/binutils: 2.16.1 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-Os -march=athlon64 -pipe -ggdb3" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3/share/config /usr/share/config /var/qmail/control" CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d" CXXFLAGS="-Os -march=athlon64 -pipe -ggdb3" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks fixpackages keepwork noclean sandbox sfperms splitdebug strict" GENTOO_MIRRORS="rsync://db/gentoo-distfiles http://mirror.datapipe.net/gentoo http://chod.cwru.edu/gentoo http://gentoo.netnitco.net ftp://gentoo.netnitco.net/pub/mirrors/gentoo/source/ ftp://194.117.143.69/mirrors/gentoo" MAKEOPTS="-j5" PKGDIR="/usr/portage/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/usr/local/portage" SYNC="rsync://db/gentoo-portage" USE="amd64 apache2 async authdaemond bash-completion berkdb bzlib cap caps chroot ck-server cracklib crypt curlwrappers dba dbm dbx dio emboss erandom exif fastcgi flatfile fortran gd-external hpn imap imlib innodb ipv6 jpeg lids memlimit mpi mysqli nagios-dns nagios-ntp nagios-ping nagios-ssh ncurses nethack nfs nls no-htdocs nptl pam pdflib perl pg-hier pg-intdatetime pg-vacuumdelay png posix python readline reiserfs rrdtool sasl session sftp sftplogging sharedext skey soap sockets spell ssl symlink sysvipc tcpd threads tiff urandom userlocales xattr xml xmlrpc zlib elibc_glibc kernel_linux userland_GNU" Unset: ASFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, LANG, LC_ALL, LDFLAGS, LINGUAS
This might be related to bug #123329.
bug #123329 seems to about stuff that fails on 2.3.6-r3, and I'm way back on 2.3.5-r2 since that's marked as stable amd64.
> bug #123329 seems to about stuff that fails on 2.3.6-r3, and I'm way back on > 2.3.5-r2 since that's marked as stable amd64. correct ... 2.3.6-r3 is the only one with the stuff from Bug 123329
ok, not much I can do about debug symbols at the moment. I compiled glibc-2.3.5-r2 with FEATURES='keepwork splitdebug' CFLAGS='-O2 -march=athlon64 -ggdb3' and it hosed the box - I guess I should have used FEATURES='test' before putting that glibc into the box. I had to use a livecd and restore the previous glibc. I'm running the glibc tests this time around.
Ok, as more research into this, disabling NPTL via compiling or using LD_ASSUME_KERNEL=2.4.9 makes the problem disappear.
give USE="nptl nptlonly" a spin maybe ?
(In reply to comment #10) > give USE="nptl nptlonly" a spin maybe ? > I have a similar problem with applets in firefox : When I go into a "java included" website, nothing bad happens, but when the mouse comes into the applet, firefox *always* crash with that message : # # An unexpected error has been detected by HotSpot Virtual Machine: # # SIGSEGV (0xb) at pc=0x00002b062b79f7da, pid=26338, tid=1100601680 # # Java VM: Java HotSpot(TM) 64-Bit Server VM (Blackdown-1.4.2-03 mixed mode) # Problematic frame: # V [libjvm.so+0x3327da] # # An error report file with more information is saved as hs_err_pid26338.log # # If you would like to submit a bug report, please visit: # http://www.blackdown.org/cgi-bin/jdk # Plugin: unexpected work request from child INTERNAL ERROR on Browser End: Code = 142b0000 System error?:: Succ
(In reply to comment #10) > give USE="nptl nptlonly" a spin maybe ? > I have a similar problem with applets in firefox : When I go into a "java included" website, nothing bad happens, but when the mouse comes into the applet, firefox *always* crash with that message : # # An unexpected error has been detected by HotSpot Virtual Machine: # # SIGSEGV (0xb) at pc=0x00002b062b79f7da, pid=26338, tid=1100601680 # # Java VM: Java HotSpot(TM) 64-Bit Server VM (Blackdown-1.4.2-03 mixed mode) # Problematic frame: # V [libjvm.so+0x3327da] # # An error report file with more information is saved as hs_err_pid26338.log # # If you would like to submit a bug report, please visit: # http://www.blackdown.org/cgi-bin/jdk # Plugin: unexpected work request from child INTERNAL ERROR on Browser End: Code = 142b0000 System error?:: Succès firefox-bin exited with non-zero status (255) Portage 2.1.1 (default-linux/amd64/2006.1, gcc-4.1.1, glibc-2.4-r3, 2.6.18 x86_64) ================================================================= System uname: 2.6.18 x86_64 AMD Athlon(tm) 64 Processor 3000+ Gentoo Base System version 1.12.5 Last Sync: Thu, 21 Sep 2006 13:00:01 +0000 app-admin/eselect-compiler: [Not Present] dev-java/java-config: 1.2.11-r1 dev-lang/python: 2.3.5-r2, 2.4.3-r1 dev-python/pycrypto: 2.0.1-r5 dev-util/ccache: [Not Present] dev-util/confcache: [Not Present] sys-apps/sandbox: 1.2.17 sys-devel/autoconf: 2.13, 2.59-r7 sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r2 sys-devel/binutils: 2.16.1-r3 sys-devel/gcc-config: 1.3.13-r3 sys-devel/libtool: 1.5.22 virtual/os-headers: 2.6.11-r2 ACCEPT_KEYWORDS="amd64" AUTOCLEAN="yes" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-march=athlon64 -O2 -pipe -fomit-frame-pointer" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/kde/3.5/env /usr/kde/3.5/share/config /usr/kde/3.5/shutdown /usr/share/X11/xkb /usr/share/config /usr/share/texmf/dvipdfm/config/ /usr/share/texmf/dvips/config/ /usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/ /usr/share/texmf/xdvi/" CONFIG_PROTECT_MASK="/etc/env.d /etc/gconf /etc/revdep-rebuild /etc/splash /etc/terminfo" CXXFLAGS="-march=athlon64 -O2 -pipe -fomit-frame-pointer" DISTDIR="/usr/portage/distfiles" FEATURES="autoconfig distlocks metadata-transfer sandbox sfperms strict" GENTOO_MIRRORS="http://mir.zyrianes.net/gentoo/ ftp://linux.rz.ruhr-uni-bochum.de/gentoo-mirror/" LANG="fr_FR@euro" LINGUAS="fr" MAKEOPTS="" PKGDIR="/usr/portage/packages" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --delete-after --stats --timeout=180 --exclude='/distfiles' --exclude='/local' --exclude='/packages'" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="amd64 X a52 aac aalib acpi alsa apm audiofile avi bzip2 cdparanoia cdr crypt cups curl dga directfb dts dv dvb dvd dvdr dvdread elibc_glibc encode esd exif extrafilters extras fame fbcon ffmpeg flac gd ggi gif gphoto2 gpm gtk gtk2 i8x0 icq imagemagick imap imlib input_devices_evdev input_devices_keyboard input_devices_mouse insecure-patches jabber jack java jpeg jpeg2k kernel_linux linguas_fr lirc lirc_devices_pctv lzo mad matroska mikmod mime mjpeg motif mozcalendar mp3 mpeg msn musepack nas ncurses new-login nls nptl nptlonly nsplugin offensive ogg openal opengl pam pcre pdf perl png python qt quicktime rar readline sdl sftplogging spell ssl svg tcltk tetex theora tiff truetype truetype-fonts type1-fonts unicode usb userland_GNU v4l v4l2 vcd video_cards_apm video_cards_ark video_cards_ati video_cards_chips video_cards_cirrus video_cards_cyrix video_cards_dummy video_cards_fbdev video_cards_glint video_cards_i128 video_cards_i810 video_cards_mga video_cards_neomagic video_cards_nv video_cards_rendition video_cards_s3 video_cards_s3virge video_cards_savage video_cards_siliconmotion video_cards_sis video_cards_sisusb video_cards_tdfx video_cards_tga video_cards_trident video_cards_tseng video_cards_v4l video_cards_vesa video_cards_vga video_cards_via video_cards_vmware video_cards_voodoo vorbis wifi wma wmf x264 xchatdccserver xine xinerama xml2 xpm xscreensaver xv xvid xvmc yahoo zlib" Unset: CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LC_ALL, LDFLAGS, PORTAGE_RSYNC_EXTRA_OPTS, PORTDIR_OVERLAY The only unmasked packages I use are : enlightenment e17 vanilla-sources
I re-installed my system for an other reason (changed HDD). I kept the exact same "USE", and now... it works fine... So, might it not be a matter of dependencies bad linked together, and not found by revdep-rebuild ?
followup on my end, this problem seem to solve itself when we got to glibc-2.4. I'm going to close as UPSTREAM since that glibc upgrade solved the issue. Probably one of the lurking bugs in NPTL with glibc-2.3.* that caused it.