heartbeat requires a directory /var/run/heartbeat to allocate a file named register. Due this directory does not exist (and will be deleted after each boot) it must be allocated (in my opinion) by /etc/init.d/heartbeat. If this is mising heartbeat kills himself afterwards. I suggest to add a statement like checkpath -d -m 0755 -o root:root /var/run/heartbeat in the function start() Reproducible: Always Steps to Reproduce: 1. install heartbeat 2. configure it 3. start it Actual Results: heartbeat aborts Expected Results: heartbeat runs /var/log/messages part in the failed case: ----------------------- cut ------------------------------ 2012-10-15T09:48:06.761660+02:00 slswn8 /etc/init.d/heartbeat[25259]: Use of the opts variable is deprecated and will be 2012-10-15T09:48:06.763134+02:00 slswn8 /etc/init.d/heartbeat[25260]: removed in the future. 2012-10-15T09:48:06.764672+02:00 slswn8 /etc/init.d/heartbeat[25261]: Please use extra_commands, extra_started_commands or extra_stopped_commands. 2012-10-15T09:48:06.822610+02:00 slswn8 heartbeat: [25306]: WARN: Core dumps could be lost if multiple dumps occur. 2012-10-15T09:48:06.822673+02:00 slswn8 heartbeat: [25306]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability 2012-10-15T09:48:06.822713+02:00 slswn8 heartbeat: [25306]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability 2012-10-15T09:48:06.822749+02:00 slswn8 heartbeat: [25306]: info: Pacemaker support: false 2012-10-15T09:48:06.829318+02:00 slswn8 heartbeat: [25306]: WARN: Logging daemon is disabled --enabling logging daemon is recommended 2012-10-15T09:48:06.829331+02:00 slswn8 heartbeat: [25306]: info: ************************** 2012-10-15T09:48:06.829346+02:00 slswn8 heartbeat: [25306]: info: Configuration validated. Starting heartbeat 3.0.5 2012-10-15T09:48:06.830189+02:00 slswn8 heartbeat: [25307]: info: heartbeat: version 3.0.5 2012-10-15T09:48:06.830334+02:00 slswn8 heartbeat: [25307]: WARN: No Previous generation - starting at 1350287287 2012-10-15T09:48:06.941949+02:00 slswn8 heartbeat: [25307]: info: Heartbeat generation: 1350287287 2012-10-15T09:48:06.941973+02:00 slswn8 heartbeat: [25307]: info: No uuid found for current node - generating a new uuid. 2012-10-15T09:48:06.969361+02:00 slswn8 heartbeat: [25307]: info: Creating FIFO /var/lib/heartbeat/fifo. 2012-10-15T09:48:06.972583+02:00 slswn8 heartbeat: [25307]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface bond0 2012-10-15T09:48:06.972952+02:00 slswn8 heartbeat: [25307]: info: glib: UDP Broadcast heartbeat closed on port 694 interface bond0 - Status: 1 2012-10-15T09:48:06.980360+02:00 slswn8 heartbeat: [25307]: info: Local status now set to: 'up' 2012-10-15T09:48:06.980594+02:00 slswn8 heartbeat: [25307]: info: Link slswn8:bond0 up. 2012-10-15T09:48:07.481736+02:00 slswn8 heartbeat: [25307]: info: Link slswn7:bond0 up. 2012-10-15T09:48:07.983100+02:00 slswn8 heartbeat: [25307]: WARN: Late heartbeat: Node slswn8: interval 1010 ms 2012-10-15T09:48:07.983129+02:00 slswn8 heartbeat: [25307]: info: Status update for node slswn7: status active 2012-10-15T09:48:07.983812+02:00 slswn8 heartbeat: [25307]: info: Comm_now_up(): updating status to active 2012-10-15T09:48:07.983865+02:00 slswn8 heartbeat: [25307]: info: Local status now set to: 'active' 2012-10-15T09:48:07.984081+02:00 slswn8 heartbeat: [25307]: ERROR: socket_wait_conn_new: trying to create in /var/run/heartbeat/register bind:: No such file or directory 2012-10-15T09:48:08.009077+02:00 slswn8 harc[25316]: info: Running /etc/ha.d//rc.d/status status 2012-10-15T09:48:08.980643+02:00 slswn8 heartbeat: [25311]: CRIT: Emergency Shutdown: Master Control process died. 2012-10-15T09:48:08.980662+02:00 slswn8 heartbeat: [25311]: CRIT: Killing pid 25307 with SIGTERM 2012-10-15T09:48:08.980672+02:00 slswn8 heartbeat: [25311]: CRIT: Killing pid 25312 with SIGTERM 2012-10-15T09:48:08.980681+02:00 slswn8 heartbeat: [25311]: CRIT: Killing pid 25313 with SIGTERM 2012-10-15T09:48:08.980688+02:00 slswn8 heartbeat: [25311]: CRIT: Emergency Shutdown(MCP dead): Killing ourselves. 2012-10-15T09:48:56.970581+02:00 slswn8 /etc/init.d/heartbeat[25343]: Use of the opts variable is deprecated and will be 2012-10-15T09:48:56.972028+02:00 slswn8 /etc/init.d/heartbeat[25344]: removed in the future. 2012-10-15T09:48:56.973455+02:00 slswn8 /etc/init.d/heartbeat[25345]: Please use extra_commands, extra_started_commands or extra_stopped_commands. ----------------------- cut ------------------------------ emerge --info: ----------------------- cut ------------------------------ Portage 2.1.11.9 (default/linux/amd64/10.0, gcc-4.5.4, glibc-2.15-r2, 3.4.9-gentoo x86_64) ================================================================= System uname: Linux-3.4.9-gentoo-x86_64-Intel-R-_Xeon-R-_CPU_E5520_@_2.27GHz-with-gentoo-2.1 Timestamp of tree: Thu, 11 Oct 2012 00:45:01 +0000 app-shells/bash: 4.2_p37 dev-lang/python: 2.7.3-r2, 3.2.3 dev-util/cmake: 2.8.9 dev-util/pkgconfig: 0.27.1 sys-apps/baselayout: 2.1-r1 sys-apps/openrc: 0.9.8.4 sys-apps/sandbox: 2.5 sys-devel/autoconf: 2.68 sys-devel/automake: 1.11.6 sys-devel/binutils: 2.22-r1 sys-devel/gcc: 4.5.4 sys-devel/gcc-config: 1.7.3 sys-devel/libtool: 2.4-r1 sys-devel/make: 3.82-r3 sys-kernel/linux-headers: 3.4-r2 (virtual/os-headers) sys-libs/glibc: 2.15-r2 Repositories: gentoo ACCEPT_KEYWORDS="amd64" ACCEPT_LICENSE="* -@EULA" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/gconf /etc/gentoo-release /etc/php/apache2-php5.4/ext-active/ /etc/php/cgi-php5.4/ext-active/ /etc/php/cli-php5.4/ext-active/ /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo" CXXFLAGS="-O2 -pipe" DISTDIR="/usr/portage/distfiles" FCFLAGS="-O2 -pipe" FEATURES="assume-digests binpkg-logs config-protect-if-modified distlocks ebuild-locks fixlafiles news parallel-fetch parse-eapi-ebuild-head protect-owned sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch" FFLAGS="-O2 -pipe" GENTOO_MIRRORS="http://distfiles.gentoo.org" LDFLAGS="-Wl,-O1 -Wl,--as-needed" LINGUAS="de en" MAKEOPTS="-j9" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="" SYNC="rsync://rsync.gentoo.org/gentoo-portage" USE="acl amd64 apache2 berkdb bzip2 cli cracklib crypt cups cxx dri fortran gdbm iconv mmx modules mudflap multilib mysql ncurses nls nptl openmp pam pcre pppd readline session sse sse2 ssl tcpd unicode zlib" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="kexi words flow plan sheets stage tables krita karbon braindump" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf superstar2 timing tsip tripmate tnt ubx" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="presenter-console presenter-minimizer" LINGUAS="de en" PHP_TARGETS="php5-3" PYTHON_TARGETS="python3_2 python2_7" RUBY_TARGETS="ruby18 ruby19" USERLAND="GNU" VIDEO_CARDS="fbdev glint intel mach64 mga neomagic nouveau nv r128 radeon savage sis tdfx trident vesa via vmware dummy v4l" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LANG, LC_ALL, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, USE_PYTHON ----------------------- cut ------------------------------
Indeed, it's fixed in tree, thanks a lot Petric +*heartbeat-3.0.5-r1 (29 Oct 2012) + + 29 Oct 2012; Ultrabug <ultrabug@gentoo.org> +heartbeat-3.0.5-r1.ebuild, + files/heartbeat-init: + Fix init script vital directory creation wrt #438672 thx to Petric Frank. +
Please reopen this issue. The following addition is required on a heartbeat/pacemaker setup: checkpath -q -d -m 0750 -o hacluster:hacluster /var/run/heartbeat/ccm checkpath -q -d -m 0750 -o hacluster:haclient /var/run/crm
(In reply to comment #2) > Please reopen this issue. The following addition is required on a > heartbeat/pacemaker setup: > > checkpath -q -d -m 0750 -o hacluster:hacluster /var/run/heartbeat/ccm > checkpath -q -d -m 0750 -o hacluster:haclient /var/run/crm You mean on the pacemaker init script then right ?
Please reopen this issue. A basic two-node cluster requires another directory: ndadm@ndspc047:~ $ sudo /etc/ha.d/resource.d/IPaddr 192.168.20.48 start IPaddr[10503]: INFO: Using calculated nic for 192.168.20.48: eth0 IPaddr[10503]: INFO: Using calculated netmask for 192.168.20.48: 255.255.252.0 IPaddr[10503]: DEBUG: Using calculated broadcast for 192.168.20.48: 192.168.23.255 touch: cannot touch '/var/run/resource-agents/IPaddr-eth0': No such file or directory IPaddr[10489]: ERROR: Generic error ERROR: Generic error Please add checkpath -q -d -m 0755 -o root:root /var/run/resource-agents to start() in the heartbeat init script.
I have this issue too with /var/run/resource-agents and IPaddr