Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 255235 - sys-kernel/gentoo-sources - tulip driver: NETDEV WATCHDOG: eth0 (tulip): transmit timed out
Summary: sys-kernel/gentoo-sources - tulip driver: NETDEV WATCHDOG: eth0 (tulip): tran...
Status: RESOLVED NEEDINFO
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: Sparc Linux
: High normal (vote)
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL: http://marc.info/?l=linux-sparc&m=123...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-01-17 04:16 UTC by Jim Gifford
Modified: 2009-11-20 23:38 UTC (History)
3 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
kernel config file (2.6.32-rc7.config,38.98 KB, text/plain)
2009-11-20 23:35 UTC, Lasse Jansson
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Jim Gifford 2009-01-17 04:16:30 UTC
sys-kernel/gentoo-sources-2.6.27-r2

Tulip Driver Crashes, appears as sched conflict. Have attempted to report upstream several times with no responses. 

[   80.042203] EXT3 FS on hda2, internal journal
[   80.952247] Adding 1510088k swap on /dev/hda4.  Priority:-1 extents:1 
across:1510088k
[131219.643283] ------------[ cut here ]------------
[131219.643314] WARNING: at net/sched/sch_generic.c:219 
dev_watchdog+0x250/0x280()
[131219.643327] NETDEV WATCHDOG: eth0 (tulip): transmit timed out
[131219.643337] Modules linked in: openpromfs tulip
[131219.643353] Call Trace:
[131219.643369]  [00000000005ecf10] dev_watchdog+0x250/0x280
[131219.643392]  [000000000045f64c] run_timer_softirq+0x16c/0x1e0
[131219.643409]  [000000000045ab7c] __do_softirq+0x5c/0xe0
[131219.643435]  [000000000042e3f0] do_softirq+0x70/0xa0
[131219.643450]  [000000000045a764] irq_exit+0x84/0xc0
[131219.643466]  [000000000043327c] timer_interrupt+0x5c/0xc0
[131219.643484]  [00000000004209d4] tl0_irq14+0x14/0x20
[131219.643499]  [0000000000428050] cpu_idle+0x30/0xc0
[131219.643522]  [000000000070c8bc] start_kernel+0x27c/0x2c0
[131219.643538]  [0000000000643744] tlb_fixup_done+0x80/0x9c
[131219.643550]  [0000000000000000] 0x0
[131219.643560] ---[ end trace b95673019322adce ]---
[131219.644916] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 
CSR6 0x20ec000)
[131227.640686] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100



Reproducible: Always

Steps to Reproduce:
System on the network just has to transfer data. It will happen between 1 hour to 14 hours to crash.



Actual Results:  
I use the server as a mail server. When a big burst of traffic comes in, system lockes up the eth0 interface and some times eth1, with has a lighter traffic load. When issue appears, in the log. The results are what is show above.

I have about a 3to1 difference on the load of the interfaces.

This error fills up the log files
[131219.644916] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 
CSR6 0x20ec000)
[131227.640686] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 
CSR6 0x20ec000)


Expected Results:  
No errors no system lockup


System is a Netra X1, using base config.

Portage 2.1.4.5 (default/linux/sparc/2008.0, gcc-4.1.2, glibc-2.6.1-r0, 2.6.27-gentoo-r2 sparc64)
=================================================================
System uname: 2.6.27-gentoo-r2 sparc64 sun4u
Timestamp of tree: Sun, 11 Jan 2009 11:00:01 +0000
distcc 2.18.3 sparc-unknown-linux-gnu (protocols 1 and 2) (default port 3632) [disabled]
ccache version 2.4 [disabled]
app-shells/bash:     3.2_p33
dev-lang/python:     2.5.2-r7
dev-util/ccache:     2.4-r7
sys-apps/baselayout: 1.12.11.1
sys-apps/sandbox:    1.2.18.1-r2
sys-devel/autoconf:  2.63
sys-devel/automake:  1.10.2
sys-devel/binutils:  2.18-r3
sys-devel/gcc-config: 1.4.0-r4
sys-devel/libtool:   1.5.26
virtual/os-headers:  2.6.27-r2
ACCEPT_KEYWORDS="sparc"
CBUILD="sparc-unknown-linux-gnu"
CFLAGS="-O2 -mcpu=ultrasparc -pipe"
CHOST="sparc-unknown-linux-gnu"
CONFIG_PROTECT="/etc"
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/gconf /etc/php/apache2-php5/ext-active/ /etc/php/cgi-php5/ext-active/ /etc/php/cli-php5/ext-active/ /etc/revdep-rebuild /etc/terminfo /etc/udev/rules.d"
CXXFLAGS="-O2 -mcpu=ultrasparc -pipe"
DISTDIR="/usr/portage/distfiles"
FEATURES="distlocks metadata-transfer sandbox sfperms strict unmerge-orphans userfetch"
GENTOO_MIRRORS="http://distfiles.gentoo.org http://distro.ibiblio.org/pub/linux/distributions/gentoo"
LDFLAGS="-Wl,-O1"
PKGDIR="/usr/portage/packages"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
SYNC="rsync://rsync.jg555.com/gentoo-portage"
USE="acl apache2 berkdb bzip2 clamav cli cracklib crypt ctype cups dri exiscan-acl fortran gcc64 gdbm gpm iconv ipv6 isdnlog maildir midi mudflap mysql ncurses nls nptl nptlonly openmp pam pcre perl php pop3d pppd python readline reflection session sparc spf spl srs ssl symlink sysfs tcpd unicode xml xorg zlib" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic auth_digest authn_anon authn_dbd authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache dav dav_fs dav_lock dbd deflate dir disk_cache env expires ext_filter file_cache filter headers ident imagemap include info log_config logio mem_cache mime mime_magic negotiation proxy proxy_ajp proxy_balancer proxy_connect proxy_http rewrite setenvif so speling status unique_id userdir usertrack vhost_alias" ELIBC="glibc" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" USERLAND="GNU" VIDEO_CARDS="fbdev glint mach64 mga r128 radeon sunbw2 suncg14 suncg3 suncg6 sunffb sunleo tdfx voodoo"
Unset:  CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, FFLAGS, INSTALL_MASK, LANG, LC_ALL, LINGUAS, MAKEOPTS, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, PORTDIR_OVERLAY
Comment 1 Jim Gifford 2009-01-17 05:48:52 UTC
Error just occurred again.
[   80.646694] Adding 1510088k swap on /dev/hda4.  Priority:-1 extents:1 across:1510088k
[  976.843639] ------------[ cut here ]------------
[  976.843674] WARNING: at net/sched/sch_generic.c:219 dev_watchdog+0x250/0x280()
[  976.843687] NETDEV WATCHDOG: eth0 (tulip): transmit timed out
[  976.843697] Modules linked in: openpromfs tulip
[  976.843713] Call Trace:
[  976.843730]  [00000000005ecf10] dev_watchdog+0x250/0x280
[  976.843753]  [000000000045f64c] run_timer_softirq+0x16c/0x1e0
[  976.843770]  [000000000045ab7c] __do_softirq+0x5c/0xe0
[  976.843797]  [000000000042e3f0] do_softirq+0x70/0xa0
[  976.843813]  [000000000045a764] irq_exit+0x84/0xc0
[  976.843829]  [000000000043327c] timer_interrupt+0x5c/0xc0
[  976.843848]  [00000000004209d4] tl0_irq14+0x14/0x20
[  976.843863]  [000000000042804c] cpu_idle+0x2c/0xc0
[  976.843886]  [000000000070c8bc] start_kernel+0x27c/0x2c0
[  976.843901]  [0000000000643744] tlb_fixup_done+0x80/0x9c
[  976.843914]  [0000000000000000] 0x0
[  976.843923] ---[ end trace 092bbeaef2cbf0f9 ]---
[  976.845279] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 CSR6 0x20e0000)
[  984.840920] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 CSR6 0x20e0000)
Comment 2 Wormo (RETIRED) gentoo-dev 2009-01-17 07:38:36 UTC
Could you provide links to the upstream reports? Also, does this bug appear with all kernels you have tried, or do you know of an earlier version that worked properly?
Comment 3 Jim Gifford 2009-01-17 07:44:42 UTC
Started with the 2.6.18 sources.

In the thread, they confirmed it was an issue, but should of been fixed with a later kernel version.

http://marc.info/?l=linux-sparc&m=122305562826696&w=2

At one time I even bought a replacement X1, and it has the same issue.
Comment 4 Wormo (RETIRED) gentoo-dev 2009-01-17 07:54:02 UTC
Have you ever tried a 2.6.18 kernel to see if it actually works for you? Or are you going by what the other guy said, who was guessing that yours was the same problem.
Comment 5 Jeroen Roovers (RETIRED) gentoo-dev 2009-01-17 16:19:10 UTC
Did you replace the tulip NIC at any point? Did you try any other kernel versions?
Comment 6 Jim Gifford 2009-01-17 18:31:38 UTC
The NIC is built on to the board. I have another Netra X1, with the same issue. It was running Solaris before i switch it Gentoo, with no issues. It is showing the same symptoms.
Comment 7 Wormo (RETIRED) gentoo-dev 2009-01-17 19:01:54 UTC
Yes, that's a good sign it's not just a failed card. So does kernel 2.6.18 really work properly -- in which case it should be possible to use git-bisect to find the exact patch that started the problems? That could be a good starting point for trying to fix this.
Comment 8 Daniel Drake (RETIRED) gentoo-dev 2009-02-01 01:17:08 UTC
Please reopen after providing the requested info
Comment 9 Jim Gifford 2009-02-01 04:20:58 UTC
I have spent the last few days going through the tulip git bisect. None of those changes have corrected the issue. I beleive the problem is elsewhere and I don't even know where to begin to look for it.
Comment 10 Daniel Drake (RETIRED) gentoo-dev 2009-02-01 14:59:06 UTC
(In reply to comment #9)
> I have spent the last few days going through the tulip git bisect. 

Can you explain what you mean by this?

Did kernel 2.6.18 work properly? Failing that, is there any working kernel at all, or has this always been broken on Linux?

Comment 11 Jim Gifford 2009-02-02 16:06:58 UTC
It does work with 2.6.18. I applied all the tulip updates to 2.6.18, from the current kernel, and it still works. So the problem has to be elsewhere.

Comment 12 Jim Gifford 2009-02-09 22:52:16 UTC
Testing the 2.6.28-r1 kernel release. So far it's looking promising. May be a mute point at this time.

I've back traced all the kernel's and I have ruled out the tulip driver as being the issues, don't know what's causing it, but the same oops appears everytime.

------------[ cut here ]------------
WARNING: at net/sched/sch_generic.c:219 
dev_watchdog+0x250/0x280()
NETDEV WATCHDOG: eth0 (tulip): transmit timed out
Modules linked in: openpromfs tulip
Call Trace:
[00000000005ecf10] dev_watchdog+0x250/0x280
[000000000045f64c] run_timer_softirq+0x16c/0x1e0
[000000000045ab7c] __do_softirq+0x5c/0xe0
[000000000042e3f0] do_softirq+0x70/0xa0
[000000000045a764] irq_exit+0x84/0xc0
[000000000043327c] timer_interrupt+0x5c/0xc0
[00000000004209d4] tl0_irq14+0x14/0x20
[0000000000428050] cpu_idle+0x30/0xc0
[000000000070c8bc] start_kernel+0x27c/0x2c0
[0000000000643744] tlb_fixup_done+0x80/0x9c
[0000000000000000] 0x0
 ---[ end trace b95673019322adce ]---
Comment 13 Jim Gifford 2009-02-10 16:14:47 UTC
We are at 18hours with no issues on 2.6.28-r1.

top - 08:13:39 up 18:11,  1 user,  load average: 0.00, 0.00, 0.00
Tasks:  74 total,   1 running,  73 sleeping,   0 stopped,   0 zombie
Cpu(s):  0.7%us,  0.3%sy,  0.0%ni, 99.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:    501432k total,   398520k used,   102912k free,    50072k buffers
Swap:  1510088k total,        0k used,  1510088k free,   131640k cached
Comment 14 Jim Gifford 2009-02-10 21:03:51 UTC
Just happened again. This time it was after top - 13:02:56 up 23:00

[   79.332063] Adding 1510088k swap on /dev/hda4.  Priority:-1 extents:1 across:1510088k
[82739.527489] ------------[ cut here ]------------
[82739.527512] WARNING: at net/sched/sch_generic.c:226 dev_watchdog+0x250/0x280()
[82739.527525] NETDEV WATCHDOG: eth0 (tulip): transmit timed out
[82739.527535] Modules linked in: openpromfs tulip
[82739.527551] Call Trace:
[82739.527568]  [00000000005f4770] dev_watchdog+0x250/0x280
[82739.527589]  [000000000045cf2c] run_timer_softirq+0x16c/0x1e0
[82739.527613]  [0000000000458374] __do_softirq+0x54/0x120
[82739.527638]  [000000000042e350] do_softirq+0x70/0xa0
[82739.527653]  [0000000000457f04] irq_exit+0x84/0xc0
[82739.527668]  [0000000000431cbc] timer_interrupt+0x5c/0xc0
[82739.527686]  [00000000004209d4] tl0_irq14+0x14/0x20
[82739.527700]  [0000000000427f60] cpu_idle+0x20/0xc0
[82739.527724]  [000000000072e894] start_kernel+0x274/0x2c0
[82739.527739]  [000000000064ac44] tlb_fixup_done+0x80/0x9c
[82739.527751]  [0000000000000000] 0x0
[82739.527761] ---[ end trace d1091490855f4929 ]---
[82739.529118] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 CSR6 0x20ec000)
[82747.524814] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 CSR6 0x20ec000)
[82755.520807] 0000:00:0c.0: tulip_stop_rxtx() failed (CSR5 0xfc508100 CSR6 0x20ec000)

Wonder if it's related to some of the issues that have been discussed on linux-sparc list related to NMI.
Comment 15 Mike Pagano gentoo-dev 2009-05-01 22:35:56 UTC
It's been awhile here. Any better results with later kernels.
Comment 16 Mike Pagano gentoo-dev 2009-05-29 12:50:07 UTC
Let us know if this is still an issue with later kernels.
Comment 17 Lasse Jansson 2009-11-20 23:35:58 UTC
Created attachment 210755 [details]
kernel config file
Comment 18 Lasse Jansson 2009-11-20 23:38:02 UTC
(In reply to comment #17)
> Created an attachment (id=210755) [details]
> kernel config file
> 

Hi,

I found this bug report while Googling around in hope of getting both Ethernet interfaces to work in an old SUNFire V100 machine.

xxxx ~ # emerge --info
Portage 2.1.6.13 (default/linux/sparc/10.0, gcc-4.3.4, glibc-2.9_p20081201-r2, 2.6.32-rc7 sparc64)
=================================================================                                 
System uname: Linux-2.6.32-rc7-sparc64-sun4u-with-gentoo-1.12.11.1                                
Timestamp of tree: Wed, 18 Nov 2009 08:15:01 +0000                                                
app-shells/bash:     4.0_p28                                                                      
dev-lang/python:     2.6.2-r1                                                                     
sys-apps/baselayout: 1.12.11.1                                                                    
sys-apps/sandbox:    1.6-r2                                                                       
sys-devel/autoconf:  2.63-r1                                                                      
sys-devel/automake:  1.10.2                                                                       
sys-devel/binutils:  2.18-r3                                                                      
sys-devel/gcc-config: 1.4.1                                                                       
sys-devel/libtool:   2.2.6a                                                                       
virtual/os-headers:  2.6.27-r2                                                                    
ACCEPT_KEYWORDS="sparc"                                                                           
CBUILD="sparc-unknown-linux-gnu"                                                                  
CFLAGS="-O2 -mcpu=ultrasparc -pipe"                                                               
CHOST="sparc-unknown-linux-gnu"                                                                   
CONFIG_PROTECT="/etc"                                                                             
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/gconf /etc/sandbox.d /etc/terminfo /etc/udev/rules.d"
CXXFLAGS="-O2 -mcpu=ultrasparc -pipe"                                                                               
DISTDIR="/usr/portage/distfiles"                                                                                    
FEATURES="distlocks fixpackages parallel-fetch protect-owned sandbox sfperms strict unmerge-orphans userfetch"
GENTOO_MIRRORS="ftp://mirror.mdfnet.se/gentoo ftp://ftp.ing.umu.se/linux/gentoo/ http://ftp.df.lth.se/pub/gentoo/ ftp://ftp.ds.karen.hj.se/gentoo/ http://ftp
.ing.umu.se/linux/gentoo/ http://mirror.mdfnet.se/mirror/gentoo http://ftp.ds.karen.hj.se/gentoo/ ftp://ftp.df.lth.se/pub/gentoo/ "
LDFLAGS="-Wl,-O1"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_CONFIGROOT="/"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --compress --force --whole-file --delete --stats --timeout=180 --exclude=/distfiles --ex
clude=/local --exclude=/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/portage"
SYNC="rsync://rsync.europe.gentoo.org/gentoo-portage"
USE="acl berkdb bzip2 cli cracklib crypt cups dri fortran gcc64 gdbm gpm iconv ipv6 modules mudflap ncurses nls nptl nptlonly openmp pam pcre perl pppd pytho
n readline reflection session sparc spl ssl sysfs tcpd unicode xorg zlib" ALSA_PCM_PLUGINS="adpcm alaw asym copy dmix dshare dsnoop empty extplug file hooks
iec958 ioplug ladspa lfloat linear meter mmap_emul mulaw multi null plug rate route share shm softvol" APACHE2_MODULES="actions alias auth_basic authn_alias
authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache dav dav_fs dav_lock d
eflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif s
peling status unique_id userdir usertrack vhost_alias" ELIBC="glibc" INPUT_DEVICES="keyboard mouse evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633
 glk hd44780 lb216 lcdm001 mtxorb ncurses text" USERLAND="GNU" VIDEO_CARDS="fbdev glint mach64 mga r128 radeon sunbw2 suncg14 suncg3    suncg6 sunffb sunleo
tdfx voodoo"
Unset:  CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, FFLAGS, INSTALL_MASK, LANG, LC_ALL, LINGUAS, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OP
TS, PORTDIR_OVERLAY

xxxx ~ # uname -a
Linux <my machine's FQDN> 2.6.32-rc7 #13 Thu Nov 19 17:52:03 CET 2009 sparc64 sun4u TI UltraSparc IIe (Hummingbird) GNU/Linux

xxxx ~ # lspci
00:03.0 Non-VGA unclassified device: ALi Corporation M7101 Power Management Controller [PMU]
00:05.0 Ethernet controller: Davicom Semiconductor, Inc. 21x4x DEC-Tulip compatible 10/100 Ethernet (rev 31)
00:07.0 ISA bridge: ALi Corporation M1533/M1535/M1543 PCI to ISA Bridge [Aladdin IV/V/V+]
00:0a.0 USB Controller: ALi Corporation USB 1.1 Controller (rev 03)
00:0c.0 Ethernet controller: Davicom Semiconductor, Inc. 21x4x DEC-Tulip compatible 10/100 Ethernet (rev 31)
00:0d.0 IDE interface: ALi Corporation M5229 IDE (rev c3)

xxxx ~ # dmesg | grep eth
[   62.908134] eth0: Davicom DM9102/DM9102A rev 49 at Port 0x1fe02010000, EEPROM not present, 00:03:ba:14:b7:c9, IRQ 9.
[   63.241598] eth1: Davicom DM9102/DM9102A rev 49 at Port 0x1fe02010100, EEPROM not present, 00:03:ba:14:b7:ca, IRQ 10.

xxxx ~ # dmesg | grep tulip
[   62.716219] tulip0: Old style EEPROM with no media selection information.
[   62.805744] tulip0:  MII transceiver #1 config 1000 status 782d advertising 01e1.
[   63.049523] tulip1: Old style EEPROM with no media selection information.
[   63.138982] tulip1:  MII transceiver #1 config 3100 status 7829 advertising 01e1.

-------------

After experiencing the same errors as posted in the original bug report, and additionally only being able to use one of the two interfaces on my machine, I now have both of the interfaces on my machine working with the tulip driver.

-------------

The following information helped me to install Gentoo, and to find a working solution to the Ethernet problems that I experienced to start with:

1. http://www.sheepguardingllama.com/?p=2050

2. http://www.sheepguardingllama.com/?p=2057

3. http://docs.sun.com/app/docs/doc/816-2756-10?l=en

4. http://www.gsp.com/cgi-bin/man.cgi?section=4&topic=dc
- Especially the section "NOTES"

- When installing Gentoo, I used the following command to compile the kernel:
make CROSS_COMPILE=sparc64-unknown-linux-gnu- && make CROSS_COMPILE=sparc64-unknown-linux-gnu- image modules_install
- http://www.gentoo.org/doc/en/handbook/handbook-sparc.xml?part=1&chap=7 is slightly erroneous here.

I went through the following steps to get things working, after installing Gentoo from ...gentoo/releases/sparc/autobuilds/20091102/install-sparc64-minimal-20091102.iso:

* Got to the ok prompt, and issued the following commands

set-defaults

setenv boot-device disk

#hda in SUN's language
nvalias disk /pci@1f,0/ide@d/disk@0,0:a fstype ext2

-------------

#The following command is the key to getting the two Ethernet interfaces working, I think.
setenv local-mac-address? true

-------------

#I also issued the following commands, but I think they do not really matter:
nvalias eth0 /pci@1f,0/ethernet@c
nvalias eth1 /pci@1f,0/ethernet@5

I have attached the kernel .config I have used.

Regards,

Lasse