Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 49507 - cannot boot selfbuilt kernels with silo 1.4.4, but can with 1.3.* after installing from livecd
Summary: cannot boot selfbuilt kernels with silo 1.4.4, but can with 1.3.* after insta...
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: Sparc Linux
: High normal (vote)
Assignee: Sparc Porters
URL: http://forums.gentoo.org/viewtopic.ph...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-04-30 08:28 UTC by Arvid de Jong
Modified: 2004-07-27 13:43 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Arvid de Jong 2004-04-30 08:28:27 UTC
I just installed gentoo on my sparc64, and built my own kernel, 2.6.6-rc2. It wouldn't boot with silo-1.4.4 so I tried sparc-sources-2.4.26. Both gave me "Illegal Instruction". I found other people on the forums with the same problem and downgrading to silo 1.3.* works according to them. So I downgraded to silo-1.3.2-r1 and my 2.4-kernel booted. 2.6.6 paniced in a way it didnt with silo-1.4.4 :).

Reproducible: Couldn't Reproduce
Steps to Reproduce:
gentoo 2004.0 comes with silo 1.4.4
1. reboot: Illegal instruction
2. searched forums: emerge silo 1.3.2-r1
3. silo
/etc/silo.conf seems to be valid (ofcourse did I do this before too)
4. reboot: works
5. try to reproduce through upgrading again... doesnt give me an error, it just boots...




Ultra5 with 400MHz processor

Portage 2.0.50-r6 (default-sparc64-2004.0, gcc-3.3.3, glibc-2.3.2-r9, 2.4.
26-sparc)
=================================================================
System uname: 2.4.26-sparc sparc64 sun4u
Gentoo Base System version 1.4.9
distcc 2.13 sparc-unknown-linux-gnu (protocols 1 and 2) (default port 3632) 
[disabled]
Autoconf: sys-devel/autoconf-2.58-r1
Automake: sys-devel/automake-1.8.3
ACCEPT_KEYWORDS="sparc"
AUTOCLEAN="yes"
CFLAGS="-mcpu=ultrasparc -O3 -pipe"
CHOST="sparc-unknown-linux-gnu"
COMPILER="gcc3"
CONFIG_PROTECT="/etc /usr/kde/2/share/config /usr/kde/3/share/config 
/usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-mcpu=ultrasparc -O3 -pipe"
DISTDIR="/usr/portage/distfiles"
FEATURES="ccache sandbox"
GENTOO_MIRRORS="http://ftp.snt.utwente.nl/pub/os/linux/gentoo/ ftp://sunsite.
informatik.rwth-aachen.de/pub/Linux/gentoo/ http://gentoo.oregonstate.edu http:
//www.ibiblio.org/pub/Linux/distributions/gentoo"
MAKEOPTS="-j5"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY=""
SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage"
USE="X apache2 arts avi berkdb crypt cups encode esd fbcon foomaticdb gdbm gif 
gtk imlib ipv6 jpeg kde libwww mad mikmod motif mpeg ncurses nls oggvorbis 
opengl oss pam pdflib perl png python qt readline sdl slang sparc spell ssl tcpd 
truetype xml2 xmms xv zlib"
Comment 1 Jason Wever (RETIRED) gentoo-dev 2004-05-01 08:44:33 UTC
Couple of questions;

What stage did you start with (including stage # and release version)?
Did you get the error before or after the SILO prompt?
Did you try power cycling the box before downgrading SILO?
Comment 2 Arvid de Jong 2004-05-01 09:29:30 UTC
I used stage3-sparc64-2004.1.tar.bz2
I got the error after the silo prompt. I got that, pressed enter, it said the kernelversion, and the error. That was it.
I didn't powercycle, I just typed 'reboot'.
Comment 3 Henry Bauer 2004-05-03 08:44:03 UTC
three Sun Ultra 2s:
2x300Mhz, 1 gig ram
2x300Mhz, 1 gig ram
1x300Mhz, 512 Mb ram

new install (disks came as boot-into-solaris-install), started with stage2 of 2004.0.  Built binary packages on one box then distributed to the other two.

Rebooted many, many times before downgrading, always the same illegal instruction error.  1.3.3 worked on the first try.

Other posts on this (including mine):
http://forums.gentoo.org/viewtopic.php?t=159221
Comment 4 Ciaran McCreesh 2004-05-03 16:26:51 UTC
What's the output of .registers immediately after the error?
Comment 5 Marcelo Bartsch 2004-05-05 07:04:19 UTC
My machine SUN ULTRA2 dual UltraSparcI 200MHZ Gentoo 2004.0 netboot image, using silo 1.4* Illegal instruction on boot, 1.3* boot perfects.

stage used for install was stage3-sparc64-2004.1.tar.bz2

I got the error after the silo prompt. I got that, pressed enter, it said the kernelversion, and the error. That was it.  I didn't powercycle, I just typed 'reboot'.


# uname -a
Linux talus 2.4.26-sparc-smp #1 SMP Mon May 3 19:29:48 CLT 2004 sparc64 sun4u TI UltraSparc I   (SpitFire) GNU/Linux

# cat /proc/cpuinfo
cpu             : TI UltraSparc I   (SpitFire)
fpu             : UltraSparc I integrated FPU
promlib         : Version 3 Revision 25
prom            : 3.25.0
type            : sun4u
ncpus probed    : 2
ncpus active    : 2
Cpu0Bogo        : 399.76
Cpu0ClkTck      : 000000000bebdae7
Cpu1Bogo        : 399.76
Cpu1ClkTck      : 000000000bebdae7
MMU Type        : Spitfire
State:
CPU0:           online
CPU1:           online

# emerge info
Portage 2.0.50-r6 (default-sparc64-2004.0, gcc-3.3.3, glibc-2.3.2-r9, 2.4.26-sparc-smp)
=================================================================
System uname: 2.4.26-sparc-smp sparc64 sun4u
Gentoo Base System version 1.4.9
Autoconf: sys-devel/autoconf-2.58-r1
Automake: sys-devel/automake-1.8.3
ACCEPT_KEYWORDS="sparc"
AUTOCLEAN="yes"
CFLAGS="-O2 -mcpu=ultrasparc"
CHOST="sparc-unknown-linux-gnu"
COMPILER="gcc3"
CONFIG_PROTECT="/etc /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config /usr/kde/3/share/config /usr/share/config /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-O2 -mcpu=ultrasparc"
DISTDIR="/usr/portage/distfiles"
FEATURES="ccache sandbox"
GENTOO_MIRRORS="http://mirror.datapipe.net/gentoo ftp://ftp.gtlib.cc.gatech.edu/pub/gentoo http://open-systems.ufl.edu/mirrors/gentoo"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
PORTDIR_OVERLAY=""
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="X arts avi berkdb crypt cups encode esd fbcon foomaticdb gdbm gif gnome gtk imlib jpeg kde ldap libwww mad mikmod motif mpeg mysql ncurses nls oggvorbis opengl oss pam pdflib perl png postgres python qt readline ruby sdl slang sparc spell ssl tcltk tcpd truetype xml2 xmms xv zlib"
Comment 6 jason wohlgemuth 2004-05-14 08:28:11 UTC
Sun Sparc Classic. OBP is the last one made for that model I think. I don't have the number at the moment. I'm going to downgrade to Silo 1.3x tonight.
Comment 7 Marcelo Bartsch 2004-05-17 09:50:31 UTC
Today i try to install OpenBSD on my ULTRA2, i dd the miniroot.fs to my swap device, i boot from my swap device, abort install, STOP-A , reboot and silo wont start, i powercycle the ULTRA2, and silo boot again.
Using this aproach i can reproduce the bug ALWAYS, so grab openbsd 3.5 miniroot dd into your swap partition and boot from it :)

here is the output from .register before a failed boot:
{0} ok .registers
        Normal          Alternate       MMU               Vector
0:                 0                0                0                0
1:                 0         fffda6c0                d              7f1
2:          f0000000         f0000000 800001fff00be08e         f0000000
3:          fffe0000                0         fefbe000                0
4:                16                0         fefbe000             4000
5:          f0023b44                f         fffda000          1b0e368
6:          fffeff38       4415001600 8000000067f620b6                e
7:          fffeefc8               4e                2         f00556a8
%PC  f002353c %nPC f0023540
%TBA f0000000 %CCR 44 XCC:nZvc   ICC:nZvc

After the failed boot:
{0} ok boot
Boot device: disk:a  File and args:
SILO Version 1.4.4
boot:
Allocated 8 Megs of memory at 0x40000000 for kernel
Loaded kernel version 2.4.26
Loading initial ramdisk (943694 bytes at 0x40C00000)...
Illegal Instruction
{0} ok .registers
        Normal          Alternate       MMU               Vector
0:                 0                0                0                0
1:          40004000         fffda6c0       4400001600              7f1
2:          40004000         f0000000               14         f0000000
3:            400000                0         fffe0000                0
4:            3a5400                0 ffffffffffffffff             4000
5:  e000000067408077                f         fffda000          1b0e368
6:                 0       8800001607 8000000067f620b6                e
7:               1f8               10                2         f00556a8
%PC  400044cc %nPC 400044d0
%TBA f0000000 %CCR 44 XCC:nZvc   ICC:nZvc

register after the machine was power-cycled and boots ok:

{0} ok .registers
        Normal          Alternate       MMU               Vector
0:                 0                0                0                0
1:          f005eaf0         fffda6c0       4400001e06              7f1
2:          f0000000         f0000000               14         f0000000
3:          fffe0000                7         fffe0000   844809c0210009
4:            3a09e0                7 ffffffffffffffff             4000
5:          f0032714                f         fffda000  200a02081200000
6:          fffefec0       4400001607 8000000067f620b6                e
7:          fffeefb8               4e                2         f00556a8
%PC  f0011520 %nPC f0008014
%TBA f0000000 %CCR 44 XCC:nZvc   ICC:nZvc
{0} ok
Comment 8 Joshua Kinard gentoo-dev 2004-05-18 02:38:18 UTC
This thread on the linux-sparc ML seems to relate to this bug:
http://marc.theaimsgroup.com/?l=linux-sparc&m=108329396131966&w=2
Comment 9 brianakee 2004-06-05 21:00:14 UTC
I was having kernel related booting problems on several Ultra5 systems. kernel versions included 2.4.x and 2.6.x.

problems were noticed with Silo 1.4.4/1.4.5
various kernel versions including 2.6.6 2.6.5 2.4.24 2.4.26
OpenBoot versions 3.19 3.25
errors included "kernel illegal instruction" and something like "Fast Data Access MMU Miss" (sorry, I did not record the last error message exactly).

Some kernels would boot on some (similar) machines, but not others. I upgraded the OBP to 3.31 (latest for Ultra5/10) and voila. All of the kernels the I have had trouble with are now booting without error. It may be worthwhile to try this if you are able to update your OpenBoot Prom.

All of the kernels tested were copied from a system that would not boot without the Install CD. Interestingly, I never had problem with the kernel on the CD-ROM. 

Hop this helps.

TM

Comment 10 Ciaran McCreesh 2004-07-27 13:43:00 UTC
Ben tells us that 1.4.8, which I just marked stable, is fixed. Please reopen if not.