emerge --regen --jobs fails very early because it doesn't keep track how many filehandles it opens: Processing app-crypt/jacksum Processing app-crypt/johntheripper /usr/lib64/portage/bin/ebuild.sh: cannot make pipe for command substitution: Too many open files Traceback (most recent call last): File "/usr/bin/emerge", line 51, in <module> File "/usr/lib64/portage/pym/_emerge/main.py", line 1044, in emerge_main File "/usr/lib64/portage/pym/_emerge/actions.py", line 3824, in run_action File "/usr/lib64/portage/pym/_emerge/actions.py", line 1993, in action_regen File "/usr/lib64/portage/pym/portage/util/_async/run_main_scheduler.py", line 26, in run_main_scheduler File "/usr/lib64/portage/pym/_emerge/AsynchronousTask.py", line 30, in start File "/usr/lib64/portage/pym/portage/util/_async/AsyncScheduler.py", line 76, in _start File "/usr/lib64/portage/pym/_emerge/PollScheduler.py", line 127, in _schedule File "/usr/lib64/portage/pym/portage/util/_async/AsyncScheduler.py", line 56, in _schedule_tasks File "/usr/lib64/portage/pym/_emerge/AsynchronousTask.py", line 30, in start File "/usr/lib64/portage/pym/_emerge/EbuildMetadataPhase.py", line 117, in _start File "/usr/lib64/portage/pym/portage/package/ebuild/doebuild.py", line 632, in doebuild File "/usr/lib64/portage/pym/portage/repository/config.py", line 242, in load_manifest File "/usr/lib64/portage/pym/portage/manifest.py", line 161, in __init__ File "/usr/lib64/portage/pym/portage/manifest.py", line 210, in _read File "/usr/lib64/portage/pym/portage/manifest.py", line 196, in _readManifest IOError: [Errno 24] Too many open files: '/usr/portage/app-crypt/johntheripper/Manifest' emergelog(): [Errno 24] Too many open files: '/var/log/emerge.log' Portage 2.1.11.55 (default/linux/amd64/13.0, gcc-4.6.3, glibc-2.15-r3, 3.8.3-gentoo x86_64) ================================================================= System uname: Linux-3.8.3-gentoo-x86_64-AMD_FX-tm-8350_Eight-Core_Processor-with-gentoo-2.1 KiB Mem: 32921688 total, 21527436 free KiB Swap: 8388604 total, 8388604 free Timestamp of tree: Unknown ld GNU ld (GNU Binutils) 2.22 app-shells/bash: 4.2_p37 dev-lang/python: 3.2.3 dev-util/pkgconfig: 0.27.1 sys-apps/baselayout: 2.1-r1 sys-apps/openrc: 0.11.8 sys-apps/sandbox: 2.5 sys-devel/autoconf: 2.69 sys-devel/automake: 1.11.6 sys-devel/binutils: 2.22-r1 sys-devel/gcc: 4.6.3 sys-devel/gcc-config: 1.7.3 sys-devel/libtool: 2.4-r1 sys-devel/make: 3.82-r4 sys-kernel/linux-headers: 3.6 (virtual/os-headers) sys-libs/glibc: 2.15-r3 Repositories: gentoo ACCEPT_KEYWORDS="amd64" ACCEPT_LICENSE="* -@EULA" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/polkit-1/actions" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/gconf /etc/gentoo-release /etc/sandbox.d /etc/terminfo" CXXFLAGS="-O2 -pipe" DISTDIR="/usr/portage/distfiles" FCFLAGS="-O2 -pipe" FEATURES="assume-digests binpkg-logs config-protect-if-modified distlocks ebuild-locks fixlafiles merge-sync news parallel-fetch protect-owned sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch" FFLAGS="-O2 -pipe" GENTOO_MIRRORS="http://distfiles.gentoo.org" LANG="en_US.utf8" LDFLAGS="-Wl,-O1 -Wl,--as-needed" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="acl amd64 berkdb bindist bzip2 cli cracklib crypt cxx dri fortran gdbm gpm iconv ipv6 mmx modules mudflap multilib ncurses nls nptl openmp pam pcre readline session sse sse2 ssl tcpd unicode zlib" ABI_X86="64" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="kexi words flow plan sheets stage tables krita karbon braindump" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf superstar2 timing tsip tripmate tnt ubx" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="presenter-console presenter-minimizer" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php5-3" PYTHON_SINGLE_TARGET="python2_7" PYTHON_TARGETS="python2_7 python3_2" RUBY_TARGETS="ruby18 ruby19" USERLAND="GNU" VIDEO_CARDS="fbdev glint intel mach64 mga nouveau nv r128 radeon savage sis tdfx trident vesa via vmware dummy v4l" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LC_ALL, MAKEOPTS, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, USE_PYTHON
I looks like python's "resource" module has a getrlimit function that can query the open file descriptor limit. I don't see a way to get the number of open file descriptors though. I guess we could use our portage.process.get_open_fds() function, which lists the contents of /proc/self/fd.
I ran into almost the same exception. Setting ulimin -n 40960 helped. It looks like the problem is that emerge --regen leaks file descriptors. /proc/sys/fs/file-nr grows as it works.
(In reply to Yuriy Taraday from comment #2) > I ran into almost the same exception. Setting ulimin -n 40960 helped. > It looks like the problem is that emerge --regen leaks file descriptors. > /proc/sys/fs/file-nr grows as it works. I don't think there's a leak, because if there was then we would notice the leak even for people who use sensible --jobs and --load-average settings. Are you doing like in comment #0 and using unlimited jobs with no --load-average cap?
(In reply to Zac Medico from comment #3) > I don't think there's a leak, because if there was then we would notice the > leak even for people who use sensible --jobs and --load-average settings. > Are you doing like in comment #0 and using unlimited jobs with no > --load-average cap? Yes, I did. --load-average fixed this, thanks. Looks like it's not a leakage but overuse. I don't see how 10 processes can eat about 600 descriptors.
(In reply to Yuriy Taraday from comment #4) > I don't see how 10 processes can eat about 600 descriptors. Well, the main emerge process should only create one file descriptor per process (2 if you count the write end of the pipe which is closed immediately after the fork, see /usr/lib/portage/pym/_emerge/EbuildMetadataPhase.py).
Plus one more, for a total of 3 per process, if you also count the Manifest file which is opened and closed immediately before the fork.
Actually, since the code that spawns the subprocesses is single-threaded, the number of simultaneously open file descriptors is really only one per process.
(In reply to Zac Medico from comment #7) > Actually, since the code that spawns the subprocesses is single-threaded, > the number of simultaneously open file descriptors is really only one per > process. Ok... So it might be ebuild.sh's fault.
A centralized POSIX jobserver (bug 537484) could help keep emerge --jobs under control. We can parse MAKEFLAGS and refuse to start a new job when a jobserver token allocation would block.