Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 142439 - Sky2 driver stop working on data transferring
Summary: Sky2 driver stop working on data transferring
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: x86 Linux
: Highest major (vote)
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-08-01 09:16 UTC by Alex B. Bychko
Modified: 2011-10-27 14:17 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments
My emerge --info output (nerff_einfo,3.65 KB, text/plain)
2006-08-01 09:17 UTC, Alex B. Bychko
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Alex B. Bychko 2006-08-01 09:16:03 UTC
Initial state:

Linux nerff 2.6.17-gentoo-r4 #1 PREEMPT Sun Jul 23 13:14:18 NOVST 2006 i686 AMD Athlon(tm) 64 Processor 3200+ GNU/Linux

gcc version 4.1.1 (Gentoo 4.1.1)

nerff ~ # modinfo sky2
filename:       /lib/modules/2.6.17-gentoo-r4/kernel/drivers/net/sky2.ko
description:    Marvell Yukon 2 Gigabit Ethernet driver
author:         Stephen Hemminger <shemminger@osdl.org>
license:        GPL
version:        1.5
vermagic:       2.6.17-gentoo-r4 preempt mod_unload K8 REGPARM gcc-4.1
depends:
alias:          pci:v00001148d00009000sv*sd*bc*sc*i*
alias:          pci:v00001148d00009E00sv*sd*bc*sc*i*
alias:          pci:v00001186d00004B00sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004340sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004341sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004342sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004343sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004344sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004345sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004346sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004347sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004350sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004351sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004352sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004360sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004361sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004362sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004363sv*sd*bc*sc*i*
alias:          pci:v000011ABd00004364sv*sd*bc*sc*i*
srcversion:     4692B864E577184E726261F
parm:           idle_timeout:Idle timeout workaround for lost interrupts (ms) (int)
parm:           disable_msi:Disable Message Signaled Interrupt (MSI) (int)
parm:           copybreak:Receive copy threshold (int)
parm:           debug:Debug level (0=none,...,16=all) (int)

03:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E Gigabit Ethernet Controller (rev 15)
        Subsystem: Micro-Star International Co., Ltd. Marvell 88E8053 Gigabit Ethernet Controller (MSI)
        Flags: bus master, fast devsel, latency 0, IRQ 90
        Memory at fdbfc000 (64-bit, non-prefetchable) [size=16K]
        I/O ports at 9c00 [size=256]
        [virtual] Expansion ROM at fda00000 [disabled] [size=128K]
        Capabilities: [48] Power Management version 2
        Capabilities: [50] Vital Product Data
        Capabilities: [5c] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable+
        Capabilities: [e0] Express Legacy Endpoint IRQ 0
        Capabilities: [100] Advanced Error Reporting

nerff ~ # cat /proc/interrupts
           CPU0
  0:   34812214    IO-APIC-edge  timer
  1:         60    IO-APIC-edge  i8042
  7:          1    IO-APIC-edge  parport0
  8:          2    IO-APIC-edge  rtc
  9:          0   IO-APIC-level  acpi
 15:     323689    IO-APIC-edge  ide1
 50:    2772154   IO-APIC-level  nvidia
 58:      52572   IO-APIC-level  au8830
 66:    3594728   IO-APIC-level  eth0
 74:          2   IO-APIC-level  ohci1394
 90:    2832261         PCI-MSI  sky2
217:     259375   IO-APIC-level  libata
225:       2244   IO-APIC-level  ehci_hcd:usb1
233:     130450   IO-APIC-level  ohci_hcd:usb2
NMI:          0
LOC:   34813251
ERR:          0
MIS:          0

Problem description:
/*Probably this bugreport is duplicate*/

I'm using onboard Marvell 88E8053 Gigabit Ethernet Controller (MSI) with sky2.
Sometimes, network is going down and I need to run "rmmod sky2 && modprobe sky2" to continue working. 

Steps to reproduce:
Reproduceable: sometimes.

Network is going down when I try to copy about 1Gb from my Linux box over network.I can reproduce it on large files (e.g. movie files) and directories with small files.

Problem was discovered when network interface just stop working on files copying from SAMBA(net-fs/samba-3.0.22-r3).

I can try to reproduce it with another initial conditions if it will be nesessary.
(Sorry for my poor english)
Comment 1 Alex B. Bychko 2006-08-01 09:17:24 UTC
Created attachment 93207 [details]
My emerge --info output
Comment 2 Jakub Moc (RETIRED) gentoo-dev 2006-08-01 09:20:52 UTC

*** This bug has been marked as a duplicate of 137504 ***
Comment 3 Daniel Drake (RETIRED) gentoo-dev 2006-08-20 07:55:18 UTC
Reopening to keep one sky2 bug per user
Comment 4 Alex B. Bychko 2006-08-20 09:11:09 UTC
Sorry, I can't reproduce this bug on my home box right now. I run copying from Linux box to Windows  box directory with movies (about 72G) and copying do not stop. I break this process about 50% of all files.
But - i found some strange thing:
test interface is eth1, and TX bytes value goes back from about 3G tranferred files! Please look at my ifconfig output

nerff ~ # ifconfig
eth0      Link encap:Ethernet  HWaddr 00:13:D3:9A:38:9C
          inet addr:192.168.1.2  Bcast:192.168.1.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:10340 errors:0 dropped:0 overruns:0 frame:0
          TX packets:10679 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:7628579 (7.2 Mb)  TX bytes:1785652 (1.7 Mb)
          Interrupt:10 Base address:0xe000

eth1      Link encap:Ethernet  HWaddr 00:13:D3:9A:33:69
          inet addr:192.168.200.1  Bcast:192.168.200.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:7180558 errors:0 dropped:0 overruns:0 frame:0
          TX packets:13733112 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:494644241 (471.7 Mb)  TX bytes:3151262129 (3005.2 Mb)
          Interrupt:10                                        ^^^^^^^^^^

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:13622 errors:0 dropped:0 overruns:0 frame:0
          TX packets:13622 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:6241189 (5.9 Mb)  TX bytes:6241189 (5.9 Mb)

nerff ~ # ifconfig
eth0      Link encap:Ethernet  HWaddr 00:13:D3:9A:38:9C
          inet addr:192.168.1.2  Bcast:192.168.1.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:10353 errors:0 dropped:0 overruns:0 frame:0
          TX packets:10689 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:7629619 (7.2 Mb)  TX bytes:1787162 (1.7 Mb)
          Interrupt:10 Base address:0xe000

eth1      Link encap:Ethernet  HWaddr 00:13:D3:9A:33:69
          inet addr:192.168.200.1  Bcast:192.168.200.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:8339258 errors:0 dropped:0 overruns:0 frame:0
          TX packets:15951136 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:574303510 (547.6 Mb)  TX bytes:2141482572 (2042.2 Mb)
          Interrupt:10                                       ^^^^^^^^^^^

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:13622 errors:0 dropped:0 overruns:0 frame:0
          TX packets:13622 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:6241189 (5.9 Mb)  TX bytes:6241189 (5.9 Mb)

nerff ~ # ifconfig
eth0      Link encap:Ethernet  HWaddr 00:13:D3:9A:38:9C
          inet addr:192.168.1.2  Bcast:192.168.1.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:10368 errors:0 dropped:0 overruns:0 frame:0
          TX packets:10703 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:7630857 (7.2 Mb)  TX bytes:1789480 (1.7 Mb)
          Interrupt:10 Base address:0xe000

eth1      Link encap:Ethernet  HWaddr 00:13:D3:9A:33:69
          inet addr:192.168.200.1  Bcast:192.168.200.255  Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:9730557 errors:0 dropped:0 overruns:0 frame:0
          TX packets:18611718 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:669921266 (638.8 Mb)  TX bytes:1787593359 (1704.7 Mb)
          Interrupt:10                                        ^^^^^^^^^^

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:13664 errors:0 dropped:0 overruns:0 frame:0
          TX packets:13664 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:6247991 (5.9 Mb)  TX bytes:6247991 (5.9 Mb)
Comment 5 Daniel Drake (RETIRED) gentoo-dev 2006-08-20 09:25:31 UTC
How soon were those ifconfigs invoked after each other?

It is normal for the TX bytes counter to wrap around to 0 after it reaches a certain value. It's hard to say if that is what is happening with no idea of timing.
Comment 6 Alex B. Bychko 2006-08-21 18:36:14 UTC
(In reply to comment #5)
> How soon were those ifconfigs invoked after each other?
> 
The time interval was... about 10 - 30 seconds. 

I think I know how to reproduce this bug and I will run such test today.
Initial conditions should include more than 2-3 copy sessions.
Comment 7 Daniel Drake (RETIRED) gentoo-dev 2006-08-27 09:04:53 UTC
Any news? gentoo-sources-2.6.17-r6 includes a new sky2 which may fix any problems you are still seeing.
Comment 8 Alex B. Bychko 2006-08-27 23:10:17 UTC
Thanks, I try to reproduce bug on gentoo-sources-2.6.17-r6 today on my home Linux box.

Problem with gentoo-sources-2.6.17-r4 still reproduceable (sometimes). To reproduce:
Try to copy data by portions (1-3G). At 3th or 4th session driver just stop all traffic on interface.
I can't reproduce this problem if I try to copy files in single session.


Comment 9 Alex B. Bychko 2006-08-28 07:55:30 UTC
looks like fixed
sky2 gentoo-sources-2.6.17-r6

nerff ~ # modinfo sky2
filename:       /lib/modules/2.6.17-gentoo-r6/kernel/drivers/net/sky2.ko
description:    Marvell Yukon 2 Gigabit Ethernet driver
author:         Stephen Hemminger <shemminger@osdl.org>
license:        GPL
version:        1.6
vermagic:       2.6.17-gentoo-r6 preempt mod_unload K8 REGPARM gcc-4.1

WFM using sky2 1.6 driver. Network speed about 10000 Kb/s (+|- 350). All works just fine, thanks. 
P.S. I can't run valid load-testing, sorry. It's home box, not server. 
 
Comment 10 Daniel Drake (RETIRED) gentoo-dev 2006-08-28 08:00:56 UTC
OK - thanks for letting us know.
Comment 11 Alex B. Bychko 2006-09-10 09:20:45 UTC
reopen
I had run to backup all windows box on samba share (about 40G). While data restoring from winbox, networking is unusable again.
Sorry for poor english :(
Comment 12 Daniel Drake (RETIRED) gentoo-dev 2006-09-10 13:11:46 UTC
Please reproduce with 2.6.17-r8
Comment 13 Ludek Stepan 2006-09-23 15:02:53 UTC
(In reply to comment #12)
> Please reproduce with 2.6.17-r8
> 

Hello, exactly the same problem here. Kernel is 2.6.17-r8. The NIC hangs when under load. Disabling "CONFIG_X86_UP_APIC" in the kernel .config fixed the problem for me.

Hope this is helpful.

Ludek Stepan
Comment 14 Daniel Drake (RETIRED) gentoo-dev 2006-10-15 16:39:48 UTC
Please retest with gentoo-sources-2.6.18-r1
Comment 15 Daniel Drake (RETIRED) gentoo-dev 2006-10-20 19:53:54 UTC
Add this patch as well: http://marc.theaimsgroup.com/?l=linux-netdev&m=116138920228073&q=raw
Comment 16 Alex B. Bychko 2006-10-20 21:33:07 UTC
Ok, I'll run some tests this week. 
Comment 17 Daniel Drake (RETIRED) gentoo-dev 2006-11-14 11:31:23 UTC
Please reopen if this is still a problem
Comment 18 Alex B. Bychko 2006-12-28 09:04:07 UTC
sorry for my absence :(

reopen for:

filename:       /lib/modules/2.6.19-gentoo-r2/kernel/drivers/net/sky2.ko
description:    Marvell Yukon 2 Gigabit Ethernet driver
author:         Stephen Hemminger <shemminger@osdl.org>
license:        GPL
version:        1.10
vermagic:       2.6.19-gentoo-r2 mod_unload K8 REGPARM

Reproduced on data copying from DVD-9 disk (files ~ 1G).
Comment 19 Sergiy Borodych 2007-01-31 18:21:40 UTC
See also 

http://bugzilla.kernel.org/show_bug.cgi?id=6839
workaround (Comment #12) and work patch (Comment #55) here

also reload sky2 module is workaround

Why it still exists in stable hardened ???

Bug appear here too :(

kernel: 2.6.18-hardened
Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E Gigabit Ethernet Controller
Comment 20 Daniel Drake (RETIRED) gentoo-dev 2007-03-10 04:43:59 UTC
Should be better in gentoo-sources-2.6.20-r2, please reopen if that is not the case
Comment 21 Sergiy Borodych 2007-03-23 12:47:37 UTC
but what about hardened kernel ?
Comment 22 Daniel Drake (RETIRED) gentoo-dev 2007-03-23 15:23:18 UTC
File a new bug with them if you have confirmed that the newer driver does solve your problem and they can then consider backporting it
Comment 23 Sergiy Borodych 2007-03-23 17:56:55 UTC
sorry but I can't check it now
we change ethernet card to intel at this server

but when I try different version of hardened sources
2 months ago - problem exists

I try 2.6.18-hardened and 2.6.19-hardened-r5