Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 770976 - x11-wm/stumpwm-18.05 - emerge hangs since 4 hours (possible race condition)
Summary: x11-wm/stumpwm-18.05 - emerge hangs since 4 hours (possible race condition)
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: All Linux
: Normal normal (vote)
Assignee: No maintainer - Look at https://wiki.gentoo.org/wiki/Project:Proxy_Maintainers if you want to take care of it
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2021-02-16 16:25 UTC by Toralf Förster
Modified: 2022-11-25 12:37 UTC (History)
1 user (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Toralf Förster gentoo-dev 2021-02-16 16:25:57 UTC
usually it takes just a minute or so to compile it here at the tinderbox.
Currently I do test "-j2" - not sure if it is related.
The process table gives:

init,1
  └─sudo,1435 /opt/tb/bin/bwrap.sh -m /home/tinderbox/img1/17.1_desktop_plasma_systemd-20210203-001541 -s /opt/tb/bin/job.sh
      └─bwrap.sh,1459 /opt/tb/bin/bwrap.sh -m /home/tinderbox/img1/17.1_desktop_plasma_systemd-20210203-001541 -s /opt/tb/bin/job.sh
          └─bwrap,1770 --bind /home/tinderbox/img1/17.1_desktop_plasma_systemd-20210203-001541 / --bind /home/tinderbox/tb/data /mnt/tb/data --bind /home/tinderbox/distfiles /var/cache/dist
files --ro-bind /home/tinderbox/tb/sdata /mnt/tb/sdata --ro-bind /var/db/repos /mnt/repos --tmpfs /var/tmp/portage --tmpfs /dev/shm --dev /dev --proc /proc --mqueue /dev/mqueue --unshare-cg
roup --unshare-ipc --unshare-pid --unshare-uts --hostname 17_1_desktop_plasma_systemd-20210203-001541 --chdir /var/tmp/tb --die-with-parent /bin/bash -l -c /entrypoint
              └─bwrap,1830 --bind /home/tinderbox/img1/17.1_desktop_plasma_systemd-20210203-001541 / --bind /home/tinderbox/tb/data /mnt/tb/data --bind /home/tinderbox/distfiles /var/cache/
distfiles --ro-bind /home/tinderbox/tb/sdata /mnt/tb/sdata --ro-bind /var/db/repos /mnt/repos --tmpfs /var/tmp/portage --tmpfs /dev/shm --dev /dev --proc /proc --mqueue /dev/mqueue --unshar
e-cgroup --unshare-ipc --unshare-pid --unshare-uts --hostname 17_1_desktop_plasma_systemd-20210203-001541 --chdir /var/tmp/tb --die-with-parent /bin/bash -l -c /entrypoint
                  └─entrypoint,2043 /entrypoint
                      └─entrypoint,7586 /entrypoint
                          └─emerge,7588 -b /usr/lib/python-exec/python3.8/emerge --update x11-wm/stumpwm-contrib
                              └─python3.8,17011 /usr/lib/portage/python3.8/pid-ns-init 6230
                                  └─python3.8,17030 /usr/lib/portage/python3.8/pid-ns-init 250 250 250 18 0,1,2 /usr/bin/sandbox [x11-wm/stumpwm-18.05] sandbox /usr/lib/portage/python3.8/eb
uild.sh compile
                                      └─sandbox,17197,portage /usr/lib/portage/python3.8/ebuild.sh compile
                                          └─ebuild.sh,17201 /usr/lib/portage/python3.8/ebuild.sh compile
                                              └─ebuild.sh,17262 /usr/lib/portage/python3.8/ebuild.sh compile
                                                  └─emake,17282 /usr/lib/portage/python3.8/ebuild-helpers/emake
                                                      └─make,17292 -j2
                                                          └─sbcl,17317 --load ./make-image.lisp
                                                              └─{sbcl},17323


will kill it now at image run/17.1_desktop_plasma_systemd-20210203-001541
Comment 1 Ionen Wolkens gentoo-dev 2021-02-16 16:49:39 UTC
The -j does look related, it either work or hang kinda randomly for me and looks like a race condition.

MAKEOPTS=-j1 always worked so far
Comment 2 Toralf Förster gentoo-dev 2021-02-16 16:50:37 UTC
Maybe it hangs here:

>>> Install x11-wm/stumpwm-18.05 into /var/tmp/portage/x11-wm/stumpwm-18.05/image/
!!! dobin: stumpwm does not exist
 * ERROR: x11-wm/stumpwm-18.05::gentoo failed (install phase):
 *   dobin failed
 *
Comment 3 Toralf Förster gentoo-dev 2021-03-23 17:38:23 UTC
masked this package at the tinderbox
Comment 4 Toralf Förster gentoo-dev 2021-03-23 17:39:07 UTC
hangs here since 28 hours:


init,1
  └─sudo,22179 /opt/tb/bin/bwrap.sh -m /home/tinderbox/img2/17.1_no-multilib-20210315-144814 -s /opt/tb/bin/job.sh
      └─bwrap.sh,22206 /opt/tb/bin/bwrap.sh -m /home/tinderbox/img2/17.1_no-multilib-20210315-144814 -s /opt/tb/bin/job.sh
          └─bwrap,22321 --bind /home/tinderbox/img2/17.1_no-multilib-20210315-144814 / --bind /home/tinderbox/tb/data /mnt/tb/data --bind /home/tinderbox/distfiles /var/cache/distfiles --ro
-bind /home/tinderbox/tb/sdata /mnt/tb/sdata --ro-bind /var/db/repos /mnt/repos --tmpfs /var/tmp/portage --tmpfs /dev/shm --dev /dev --proc /proc --mqueue /dev/mqueue --unshare-cgroup --uns
hare-ipc --unshare-pid --unshare-uts --hostname 17_1_no-multilib-20210315-144814 --chdir /var/tmp/tb --die-with-parent /bin/bash -l -c /entrypoint
              └─bwrap,22329 --bind /home/tinderbox/img2/17.1_no-multilib-20210315-144814 / --bind /home/tinderbox/tb/data /mnt/tb/data --bind /home/tinderbox/distfiles /var/cache/distfiles
--ro-bind /home/tinderbox/tb/sdata /mnt/tb/sdata --ro-bind /var/db/repos /mnt/repos --tmpfs /var/tmp/portage --tmpfs /dev/shm --dev /dev --proc /proc --mqueue /dev/mqueue --unshare-cgroup -
-unshare-ipc --unshare-pid --unshare-uts --hostname 17_1_no-multilib-20210315-144814 --chdir /var/tmp/tb --die-with-parent /bin/bash -l -c /entrypoint
                  └─entrypoint,22347 /entrypoint
                      └─entrypoint,15488 /entrypoint
                          └─emerge,15489 -b /usr/lib/python-exec/python3.8/emerge --update x11-wm/stumpwm
                              └─python3.8,27867 /usr/lib/portage/python3.8/pid-ns-init 4012
                                  └─python3.8,27875 /usr/lib/portage/python3.8/pid-ns-init 250 250 250 18 0,1,2 /usr/bin/sandbox [x11-wm/stumpwm-18.05] sandbox /usr/lib/portage/python3.8/eb
uild.sh compile
                                      └─sandbox,27907,portage /usr/lib/portage/python3.8/ebuild.sh compile
                                          └─ebuild.sh,27909 /usr/lib/portage/python3.8/ebuild.sh compile
                                              └─ebuild.sh,27957 /usr/lib/portage/python3.8/ebuild.sh compile
                                                  └─emake,27964 /usr/lib/portage/python3.8/ebuild-helpers/emake
                                                      └─make,27970 -j 1
                                                          └─sbcl,27983 --load ./make-image.lisp
                                                              └─{sbcl},28128
Comment 5 Toralf Förster gentoo-dev 2021-12-02 15:36:14 UTC
still happened today, a gdb bt does not enlight too much, or ?

#38 0x0000000050100117 in ?? ()
#39 0x0000000000000000 in ?? ()
(gdb) bt thread apply all
No symbol table is loaded.  Use the "file" command.
(gdb) thread apply all bt

Thread 2 (LWP 11423 "finalizer"):
#0  0x00007fc353807cc6 in ?? () from target:/lib64/libc.so.6
#1  0x00007fc35380a40f in pthread_cond_wait () from target:/lib64/libc.so.6
#2  0x00005648fe5c04df in finalizer_thread_wait ()
#3  0x0000000052b97290 in ?? ()
#4  0x00007fc352f96990 in ?? ()
#5  0x0000000052ba05b9 in ?? ()
#6  0x00007fc352f96904 in ?? ()
#7  0x00000000027eb0cb in ?? ()
#8  0x0000000000000000 in ?? ()

Thread 1 (LWP 11351 "sbcl"):
#0  0x00007fc35388219f in poll () from target:/lib64/libc.so.6
#1  0x0000000052bf3357 in ?? ()
#2  0x00007fc353617ff0 in ?? ()
Comment 6 Rahil Bhimjiani 2022-05-17 08:08:47 UTC
(In reply to Ionen Wolkens from comment #1)
> The -j does look related, it either work or hang kinda randomly for me and
> looks like a race condition.
> 
> MAKEOPTS=-j1 always worked so far

Yeah it was stuck at "makeinfo stumpwm.texi" but when I -j1 it worked and finished within seconds.
Comment 7 Toralf Förster gentoo-dev 2022-06-21 11:42:55 UTC
(In reply to Rahil Bhimjiani from comment #6)
> (In reply to Ionen Wolkens from comment #1)
> > The -j does look related, it either work or hang kinda randomly for me and
> > looks like a race condition.
> > 
> > MAKEOPTS=-j1 always worked so far
> 
> Yeah it was stuck at "makeinfo stumpwm.texi" but when I -j1 it worked and
> finished within seconds.

so this ebuild should pin -j1
Comment 8 Toralf Förster gentoo-dev 2022-06-21 11:55:37 UTC
OTOH it even hangs with -j1:

init,1
  └─sudo,7293 /opt/tb/bin/bwrap.sh -m 17.1_desktop_systemd-j4-20220620-111613 -e /opt/tb/bin/job.sh
      └─bwrap.sh,7338 /opt/tb/bin/bwrap.sh -m 17.1_desktop_systemd-j4-20220620-111613 -e /opt/tb/bin/job.sh
          └─bwrap,7431 --clearenv --setenv HOME /root --setenv MAILTO tinderbox --setenv SHELL /bin/bash --setenv TERM linux
 --hostname 17-1-desktop-systemd-j4-20220620-111613 --die-with-parent --chdir /var/tmp/tb --unshare-cgroup --unshare-ipc --u
nshare-pid --unshare-uts --bind /home/tinderbox/img/17.1_desktop_systemd-j4-20220620-111613 / --dev /dev --mqueue /dev/mqueu
e --perms 1777 --tmpfs /dev/shm --ro-bind /home/tinderbox/tb/sdata/ssmtp.conf /etc/ssmtp/ssmtp.conf --bind /home/tinderbox/t
b/data /mnt/tb/data --proc /proc --tmpfs /run --ro-bind /sys /sys --perms 1777 --tmpfs /tmp --bind /home/tinderbox/distfiles
 /var/cache/distfiles --perms 1777 --tmpfs /var/tmp/portage /bin/bash -l -c /entrypoint
              └─bwrap,7443 --clearenv --setenv HOME /root --setenv MAILTO tinderbox --setenv SHELL /bin/bash --setenv TERM l
inux --hostname 17-1-desktop-systemd-j4-20220620-111613 --die-with-parent --chdir /var/tmp/tb --unshare-cgroup --unshare-ipc
 --unshare-pid --unshare-uts --bind /home/tinderbox/img/17.1_desktop_systemd-j4-20220620-111613 / --dev /dev --mqueue /dev/m
queue --perms 1777 --tmpfs /dev/shm --ro-bind /home/tinderbox/tb/sdata/ssmtp.conf /etc/ssmtp/ssmtp.conf --bind /home/tinderb
ox/tb/data /mnt/tb/data --proc /proc --tmpfs /run --ro-bind /sys /sys --perms 1777 --tmpfs /tmp --bind /home/tinderbox/distf
iles /var/cache/distfiles --perms 1777 --tmpfs /var/tmp/portage /bin/bash -l -c /entrypoint
                  └─entrypoint,7459 /entrypoint
                      └─timeout,22249 --signal=15 --kill-after=5m 8h bash -c eval emerge --update x11-wm/stumpwm
                          └─bash,22255 -c eval emerge --update x11-wm/stumpwm
                              └─emerge,22262 -b /usr/lib/python-exec/python3.9/emerge --update x11-wm/stumpwm
                                  └─python3.9,12356 /usr/lib/portage/python3.9/pid-ns-init 1357
                                      └─python3.9,12421 /usr/lib/portage/python3.9/pid-ns-init 250 250 250 18 0,1,2 /usr/bin
/sandbox [x11-wm/stumpwm-18.05] sandbox /usr/lib/portage/python3.9/ebuild.sh compile
                                          └─sandbox,12852,portage /usr/lib/portage/python3.9/ebuild.sh compile
                                              └─ebuild.sh,12863 /usr/lib/portage/python3.9/ebuild.sh compile
                                                  └─ebuild.sh,13115 /usr/lib/portage/python3.9/ebuild.sh compile
                                                      └─emake,13159 /usr/lib/portage/python3.9/ebuild-helpers/emake
                                                          └─make,13192 -j1
                                                              └─sbcl,13276 --load ./make-image.lisp
Comment 9 Larry the Git Cow gentoo-dev 2022-11-25 12:37:57 UTC
The bug has been closed via the following commit(s):

https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=5f035ea9e8f2ac363c6c3989c0b413514104de39

commit 5f035ea9e8f2ac363c6c3989c0b413514104de39
Author:     Joonas Niilola <juippis@gentoo.org>
AuthorDate: 2022-11-25 12:30:40 +0000
Commit:     Joonas Niilola <juippis@gentoo.org>
CommitDate: 2022-11-25 12:37:55 +0000

    x11-wm/stumpwm: add 22.05_p20220818, drop 22.05
    
     - add optfeature,
     - rename the ebuild to match the snapshot commit date,
     - restrict make with -j1 - had random build failures with `nproc`.
    
    Closes: https://bugs.gentoo.org/770976
    Signed-off-by: Joonas Niilola <juippis@gentoo.org>

 .../{stumpwm-22.05.ebuild => stumpwm-22.05_p20220818.ebuild}     | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)