Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 755065 - sys-kernel/gentoo-sources-5.4.72 : RX errors (overruns) with ATL1E driven ethernet controller
Summary: sys-kernel/gentoo-sources-5.4.72 : RX errors (overruns) with ATL1E driven eth...
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: AMD64 Linux
: Normal normal
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-11-17 13:12 UTC by Eric F. GARIOUD
Modified: 2020-12-24 15:31 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Eric F. GARIOUD 2020-11-17 13:12:39 UTC
Problem :

Since gentoo-sources-5.4.72, I am experiencing a couple of RX errors (overruns) on my network interface. (about half a dozen per 100K packets)
I even, once, experienced a kernel trap.

RX errors stick to Null when I go back to 5.4.66 or older (And 0 trap)

Rx errors have always been 0 on my about 10 years old unchanged hardware from at least gentoo-sources-2.6.38 and as well with all following stable releases up to 5.4.66 included

Hardware : 

02:00.0 Ethernet controller: Qualcomm Atheros AR8121/AR8113/AR8114 Gigabit or Fast Ethernet (rev b0)
Subsystem: ASUSTeK Computer Inc. AR8121/AR8113/AR8114 Gigabit or Fast Ethernet
Kernel driver in use: ATL1E

Possibly related particularity about my configuration of this subsystem :

This ethernet controller is claimed being pci-msi capable and actually is.
However, since linux 3.4 series, a patch has been fired disabling the pci-msi support for that family of devices : http://lkml.iu.edu/hypermail/linux/kernel/1304.0/01335.html

Because I had never experienced any packet corruption before and even much more because when the pci-msi capability of that device is disabled, the ethernet controller falls back onto the same hardware interrupt than my sound card, I systematically revert that patch prior to building any new kernel release.

Reproducible: Always




Portage 3.0.9 (python 3.7.9-final-0, default/linux/amd64/17.1/no-multilib, gcc-9.3.0, glibc-2.32-r2, 5.4.66-0707a.0 x86_64)
=================================================================
System uname: Linux-5.4.66-0707a.0-x86_64-Intel-R-_Core-TM-2_Duo_CPU_E8400_@_3.00GHz-with-gentoo-2.7
KiB Mem:     8166964 total,   4819480 free
KiB Swap:    4194300 total,   4194300 free
Timestamp of repository gentoo: Tue, 17 Nov 2020 08:00:01 +0000
Head commit of repository gentoo: d5fb2f328838c51abf685c6b036843eef179f109
sh bash 5.0_p18
ld GNU ld (Gentoo 2.34 p6) 2.34.0
app-shells/bash:          5.0_p18::gentoo
dev-java/java-config:     2.3.1::gentoo
dev-lang/perl:            5.30.3::gentoo
dev-lang/python:          2.7.18-r4::gentoo, 3.7.9::gentoo, 3.8.6::gentoo, 3.9.0::gentoo
dev-util/cmake:           3.17.4-r1::gentoo
sys-apps/baselayout:      2.7::gentoo
sys-apps/openrc:          0.42.1::gentoo
sys-apps/sandbox:         2.20::gentoo
sys-devel/autoconf:       2.13-r1::gentoo, 2.69-r5::gentoo
sys-devel/automake:       1.13.4-r2::gentoo, 1.16.1-r1::gentoo
sys-devel/binutils:       2.34-r2::gentoo
sys-devel/gcc:            9.3.0-r1::gentoo
sys-devel/gcc-config:     2.3.2::gentoo
sys-devel/libtool:        2.4.6-r6::gentoo
sys-devel/make:           4.2.1-r4::gentoo
sys-kernel/linux-headers: 5.4-r1::gentoo (virtual/os-headers)
sys-libs/glibc:           2.32-r2::gentoo
Repositories:

gentoo
    location: /var/db/repos/gentoo
    sync-type: rsync
    sync-uri: rsync://rsync.gentoo.org/gentoo-portage
    priority: -1000
    sync-rsync-verify-metamanifest: no
    sync-rsync-extra-opts: 
    sync-rsync-verify-max-age: 24
    sync-rsync-verify-jobs: 1

Tux
    location: /var/db/repos/Tux
    masters: gentoo

ACCEPT_KEYWORDS="amd64"
ACCEPT_LICENSE="*"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-march=native -O2 -pipe"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/lib64/libreoffice/program/sofficerc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/themes/oxygen-gtk/gtk-2.0"
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/dconf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo"
CXXFLAGS="-march=native -O2 -pipe"
DISTDIR="/var/cache/distfiles"
EMERGE_DEFAULT_OPTS="--color y --alphabetical --autounmask=n"
ENV_UNSET="CARGO_HOME DBUS_SESSION_BUS_ADDRESS DISPLAY GOBIN GOPATH PERL5LIB PERL5OPT PERLPREFIX PERL_CORE PERL_MB_OPT PERL_MM_OPT XAUTHORITY XDG_CACHE_HOME XDG_CONFIG_HOME XDG_DATA_HOME XDG_RUNTIME_DIR"
FCFLAGS="-march=native -O2 -pipe"
FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs config-protect-if-modified distlocks ebuild-locks fail-clean fixlafiles ipc-sandbox merge-sync multilib-strict network-sandbox news parallel-fetch pid-sandbox preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync"
FFLAGS="-march=native -O2 -pipe"
GENTOO_MIRRORS="http://distfiles.gentoo.org/ "
INSTALL_MASK="/usr/share/locale -/usr/share/locale/en -/usr/share/locale/en_GB -/usr/share/locale/en_US -/usr/share/locale/fr"
LANG="fr_FR.UTF-8"
LDFLAGS="-Wl,-O1 -Wl,--as-needed"
LINGUAS="en fr"
MAKEOPTS="-j2"
PKGDIR="/var/cache/binpkgs"
PORTAGE_CONFIGROOT="/"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages --exclude=/.git"
PORTAGE_TMPDIR="/var/tmp"
Comment 1 Mike Pagano gentoo-dev 2020-11-17 18:49:14 UTC
you have a log with the errors?
Comment 2 Eric F. GARIOUD 2020-12-24 15:31:51 UTC
Thank you Mike for, after so many years, still taking care of that package.

I apologize for the noise and would consider fair to close this bug (INVALID / WORKSFORME... whatever suits you best), considering :

A/ I made similar observations under vanilla-sources. (Overruns on ATL1E with > 5.4.66)

B/ This bug is definitely one (my principal) workload dependent. I just cannot reproduce it under more "standard" network use.

C/ Testing under an un-fiddled kernel (keeping pci-msi disabled for ATL1E) + problematic workload is just meaningless as, very logically, audio & ethernet devices sharing the same IRQ and therefore handled by the same core will lead to many more overruns than under the fiddled kernel (pci-msi for ATL1E handled by a different core than the one handling the audio device IRQ)

D/ My suboptimal ethernet device does not support ethtool's commands for increasing the buffer size

E/ Booting "mitigations=off", I am not running after whatever security patch and therefore, won't complain for having to stick with perfectly working 5.4.66.

So please don't waste you time with this one, Mike, and take care.