Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 56383 - http://packages.gentoo.org is behind a firewall that kills connections from 2.6.7-mm6
Summary: http://packages.gentoo.org is behind a firewall that kills connections from 2...
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: [OLD] Core system (show other bugs)
Hardware: All All
: High normal (vote)
Assignee: Kurt Lieber (RETIRED)
URL: http://packages.gentoo.org
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-07-07 15:11 UTC by bert hubert
Modified: 2004-07-27 22:29 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description bert hubert 2004-07-07 15:11:05 UTC
The firewall (or router) behind which packages.gentoo.org hides mangles the 'wscale' TCP option, causing packages.g.o to believe the client has an impossibly small TCP window, leading to a dead connection (due to SWS avoidance).

To fix, recent 2.6.7 users should issue:
echo 0 > /proc/sys/net/ipv4/tcp_default_win_scale 

The real fix is for the network maintainers to fix the firewall or router servicing packages.gentoo.org. 

Relevant thread: 
http://lkml.org/lkml/2004/7/6/146
Comment 1 bert hubert 2004-07-07 16:39:41 UTC
See also: http://lkml.org/lkml/2004/7/7/235
Comment 2 Brian S. Stephan 2004-07-07 18:21:22 UTC
This bug should probably be against "Web-www.gentoo.org"?
Comment 3 Albert Hopkins (RETIRED) gentoo-dev 2004-07-08 04:45:39 UTC
I don't admin this site.  I don't even have root access. And further more, I haven't been able to get to packages.g.o or www.g.o either. ;-)
Comment 4 bert hubert 2004-07-08 05:10:21 UTC
Yet more information and workaround at:
http://lkml.org/lkml/2004/7/8/19

Current theory is that there is some uebersmart firewall in between that tries to verify windows and sequence numbers (and not smart enough to grok window scaling).

Another possibility is that there is a very dumb machine in between that stamps out the wscale option. This latter possibility can be verified by root@packages.gentoo.org performing a tcpdump on a selected IP address that has the problem, and watching the absence or presence of wscale.
Comment 5 Kurt Lieber (RETIRED) gentoo-dev 2004-07-08 06:42:08 UTC
It's definitely not our firewall as we run the same firewall version and script on all our boxes.  It's possible it's the upstream router.  I will contact the ISP and ask them if they know anything about it.
Comment 6 bert hubert 2004-07-08 08:39:24 UTC
You can ask the ISP if they run with 'ip inspect' if they have a Cisco, which they appear to do. 'ip inspect' does things with window sizes and validation.

Thanks!
Comment 7 bert hubert 2004-07-08 08:51:39 UTC
From DaveM:
It's the netfilter patches added to the gentoo WOLK kernel running  
on packages.gentoo.org

Specifically, it's the tcp-window-tracking patch from netfilter's
patch-o-matic.  There's some bug in there wrt. it's window scaling
support.

I bet if the tcp-window-scaling diff is removed from the kernel running
there, the problem will totally go away.
 
I note that it is using a very old version of the tcp-window-tracking
patch, the current version is 2.2 and probably fixes this bug.  The
gentoo linux-2.4.20-wolk-4.14 kernel is using version 1.7 
Comment 8 Lance Albertson (RETIRED) gentoo-dev 2004-07-27 18:20:32 UTC
We upgraded the kernel on vulture today and it seems to have fixed it. I also found out what proc setting will fix this. We had issues with vulture's new kernel today, so I didn't want to try and break finch too. I figured out that the below command fixes the problem by disabling the scaling patch.

echo 0 > /proc/sys/net/ipv4/tcp_window_scaling

Please try the site now, and vulture.gentoo.org to see if the issue is still there. I had one person confirm that it was fixed on both.
Comment 9 Alexander M. Turek 2004-07-27 22:29:51 UTC
Both URLs work fine for me now. Thanks a lot. :-)