Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 159376 - udev 103 + root on RAID goes into infinite loop
Summary: udev 103 + root on RAID goes into infinite loop
Status: RESOLVED DUPLICATE of bug 151414
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: x86 Linux
: High major (vote)
Assignee: udev maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-12-29 07:33 UTC by ivo welch
Modified: 2007-01-26 20:48 UTC (History)
1 user (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description ivo welch 2006-12-29 07:33:30 UTC
hi jakub+co:  I know I have reported too many bugs that turned out to be repetitive, but "ALL udev" did not yield anything like the following:


All of a sudden, my celeron system would no longer boot. I was getting very strange udev messages, something about sockets vs. files, but they are scrolling by my screen forever, so it is hard to tell. in effect, the system no longer boots. the same system did boot up earlier. udev however recently went to version 103, and I believe this was my first boot since then.

my first instinct: boot the gentoo live cd, change root over, and rebuilt udev. same error. ok, so I decided to install gentoo on another partition (/dev/hda4), where (after a day of work), it was finally up and running fine.

me thinks, good. working again. I made a complete device copy
  tar cvfl - / | ( cd /mnt/md1 ; tar xpf -)
compile a kernel (this time, not my usual vanilla, but a gentoo kernel), set up grub correctly to boot from /dev/md1/, all fine. reboot---the same darn scroll-forever error by udev. the system does not boot, even though nothing from my previous gentoo system is still on this new /dev/md1 gentoo build.  I would have thought the issue is hardware related, except here is the odd thing: the new system boots just fine when the root partition sits on /dev/hda4.

what can possibly be different between /dev/md1 and /dev/hda4, and what could possibly have become different?  the only difference seems to be that /dev/md1 is a RAID device. A quick check reveals that RAID support is still in the new kernel. once booted from /dev/hda4, this same kernel can mount /dev/md1 just fine, too.

so, I am now wondering whether udev 103 (appeared on Dec 11, and since then I had not rebooted) has trouble with RAID root devices. does anyone know? (if not, some of you may learn this, too, if you have to reboot! :-( ).

more info: grub boot line =

title gentoo-hd0-md1
root (hd0,0)
kernel /boot/vmlinuz root=/dev/md1 video=vesafb vga=0x31A

and from .config.gz (once the very same kernel is booted from /dev/hda4):

CONFIG_MD=y
CONFIG_BLK_DEV_MD=y
CONFIG_MD_LINEAR=y
CONFIG_MD_RAID0=y
CONFIG_MD_RAID1=y
# CONFIG_MD_RAID10 is not set
# CONFIG_MD_RAID456 is not set
# CONFIG_MD_MULTIPATH is not set
# CONFIG_MD_FAULTY is not set 


A minor issue is that it would be nice if udev would sleep for a second after an error message during boot or stop after it has output too many error messages.  I wish I could tell you exactly what the error says, but I can't figure it out---scrolls by too quickly.

/iaw



Portage 2.1.1-r2 (default-linux/x86/2006.1/desktop, gcc-4.1.1, glibc-2.4-r3, 2.6.18-gentoo-r6 i686)
=================================================================
System uname: 2.6.18-gentoo-r6 i686 Intel(R) Celeron(R) CPU 2.00GHz
Gentoo Base System version 1.12.1
Last Sync: Thu, 28 Dec 2006 17:00:02 +0000
app-admin/eselect-compiler: [Not Present]
dev-java/java-config: [Not Present]
dev-lang/python: 2.4.3-r4
dev-python/pycrypto: 2.0.1-r5
dev-util/ccache: [Not Present]
dev-util/confcache: [Not Present]
sys-apps/sandbox: 1.2.17
sys-devel/autoconf: 2.13, 2.59-r7
sys-devel/automake: 1.4_p6, 1.5, 1.6.3, 1.7.9-r1, 1.8.5-r3, 1.9.6-r2
sys-devel/binutils: 2.16.1-r3
sys-devel/gcc-config: 1.3.13-r3
sys-devel/libtool: 1.5.22
virtual/os-headers: 2.6.11-r2
ACCEPT_KEYWORDS="x86"
AUTOCLEAN="yes"
CBUILD="i686-pc-linux-gnu"
CFLAGS="-O2 -march=i686 -fomit-frame-pointer"
CHOST="i686-pc-linux-gnu"
CONFIG_PROTECT="/etc"
CONFIG_PROTECT_MASK="/etc/env.d /etc/gconf /etc/revdep-rebuild /etc/terminfo"
CXXFLAGS="-O2 -march=i686 -fomit-frame-pointer"
DISTDIR="/usr/portage/distfiles"
FEATURES="autoconfig distlocks metadata-transfer sandbox sfperms strict"
GENTOO_MIRRORS="ftp://ftp.gtlib.cc.gatech.edu/pub/gentoo http://mirror.clarkson.edu/pub/distributions/gentoo ftp://gentoo.mirrors.pair.com ftp://mirror.nutsmaas.nl/gentoo/"
PKGDIR="/usr/portage/packages"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --delete-after --stats --timeout=180 --exclude='/distfiles' --exclude='/local' --exclude='/packages'"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="x86 X aalib acpi alsa alsa_cards_ali5451 alsa_cards_als4000 alsa_cards_atiixp alsa_cards_atiixp-modem alsa_cards_bt87x alsa_cards_ca0106 alsa_cards_cmipci alsa_cards_emu10k1x alsa_cards_ens1370 alsa_cards_ens1371 alsa_cards_es1938 alsa_cards_es1968 alsa_cards_fm801 alsa_cards_hda-intel alsa_cards_intel8x0 alsa_cards_intel8x0m alsa_cards_maestro3 alsa_cards_trident alsa_cards_usb-audio alsa_cards_via82xx alsa_cards_via82xx-modem alsa_cards_ymfpci alsa_pcm_plugins_adpcm alsa_pcm_plugins_alaw alsa_pcm_plugins_asym alsa_pcm_plugins_copy alsa_pcm_plugins_dmix alsa_pcm_plugins_dshare alsa_pcm_plugins_dsnoop alsa_pcm_plugins_empty alsa_pcm_plugins_extplug alsa_pcm_plugins_file alsa_pcm_plugins_hooks alsa_pcm_plugins_iec958 alsa_pcm_plugins_ioplug alsa_pcm_plugins_ladspa alsa_pcm_plugins_lfloat alsa_pcm_plugins_linear alsa_pcm_plugins_meter alsa_pcm_plugins_mulaw alsa_pcm_plugins_multi alsa_pcm_plugins_null alsa_pcm_plugins_plug alsa_pcm_plugins_rate alsa_pcm_plugins_route alsa_pcm_plugins_share alsa_pcm_plugins_shm alsa_pcm_plugins_softvol apache2 arts berkdb bitmap-fonts bonobo bzip2 bzlib cairo calendar cdr cli cracklib crypt cups curl dbus directfb dlloader dri dv dvd dvdr eds elibc_glibc emacs emboss encode esd f77 fam fbcon firefox fortran ftp g77 gcj gd gdbm ggi gif gnome gphoto2 gpm gstreamer gtk gtkhtml guile hal hardened i386 iconv ieee1394 imagemagick imap imlib input_devices_evdev input_devices_keyboard input_devices_mouse ipv6 isdnlog java javascript jpeg kde kdeenablefinal kernel_linux ldap libedit libg++ libwww lm_sensors mad mbox mikmod mime mmx motif mozilla mp3 mpeg mval mysql nas ncurses nls nptl nptlonly ogg opengl oss pam pcre pdf perl png postgres ppds pppd python qt qt3 qt4 quicktime readline reflection ruby samba scanner sdl session slang snmp soap spell spl ssl symlink tcltk tcpd tetex threads tidy truetype truetype-fonts type1-fonts udev ungif unicode usb userland_GNU video_cards_video_cards_sis vorbis win32codecs xinerama xml xml2 xorg xv zlib"
Unset: CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LANG, LC_ALL, LDFLAGS, LINGUAS, MAKEOPTS, PORTAGE_RSYNC_EXTRA_OPTS, PORTDIR_OVERLAY
_________________
Comment 1 Matthias Schwarzott gentoo-dev 2007-01-17 13:24:13 UTC
Thing I saw when looking at udev-104 is:
persistent-storage rules now skip md-devices:

In /etc/udev/rules.d/60-persistent-storage.rules:

 # skip rules for inappropriate block devices
-KERNEL=="ram*|loop*|fd*|nbd*|gnbd*|dm-*", GOTO="persistent_storage_end"
+KERNEL=="ram*|loop*|fd*|nbd*|gnbd*|dm-*|md*", GOTO="persistent_storage_end"
Comment 2 ivo welch 2007-01-19 15:25:05 UTC
(In reply to comment #1)
> Thing I saw when looking at udev-104 is:
> persistent-storage rules now skip md-devices:
> 
> In /etc/udev/rules.d/60-persistent-storage.rules:
> 
>  # skip rules for inappropriate block devices
> -KERNEL=="ram*|loop*|fd*|nbd*|gnbd*|dm-*", GOTO="persistent_storage_end"
> +KERNEL=="ram*|loop*|fd*|nbd*|gnbd*|dm-*|md*", GOTO="persistent_storage_end"
> 

Tried it, but this is not the problem.  I did manage to see more info before I get the reboot (a control-s does stop it for a while, but then it reboots):

get_ctrl_msg: unable to receive user udevd message: socket operation on non-socket
get_netlink_msg: unable to receive kernel netlink message: socket operation

I also tried out whether a new kernel (2.6.19.2) would fix it, but it did not.
Comment 3 Jakub Moc (RETIRED) gentoo-dev 2007-01-26 20:48:42 UTC

*** This bug has been marked as a duplicate of bug 151414 ***