Maybe related to 938952. I have recently started to upgrade my. My first step in this direction was This includes media-libs/mesa-24.2.7 to media-libs/mesa-24.3.2. I noticed that my laptop crashed, it was not reachable anymore over ssh. In one session I noticed that a kernel process "amdgpu reset dev" eats 100% cpu. After that I have upgraded the kernel via gentoo-sources 6.12.6 to kernel-6.12.7. All other packages have been upgraded as well, including chromium. I am also now running the latest mesa-25.0_pre20241224, with clang 19. I have noticed this crash in both 6.12 kernels. The output is from the 6.12.6. I'll try to get some additional information. Reproducible: Always Steps to Reproduce: 1. browse a few hours in chromium, (first time, browsing to YouTube for a few minutes was enough). Actual Results: Attached. Portage 3.0.66.1 (python 3.12.8-final-0, default/linux/amd64/23.0/split-usr/no-multilib, gcc-14, glibc-2.40-r7, 6.12.7-gentoo x86_64) ================================================================= System uname: Linux-6.12.7-gentoo-x86_64-AMD_Ryzen_5_2500U_with_Radeon_Vega_Mobile_Gfx-with-glibc2.40 KiB Mem: 16055712 total, 10668620 free KiB Swap: 33554428 total, 33554428 free Timestamp of repository gentoo: Sun, 29 Dec 2024 11:00:00 +0000 Head commit of repository gentoo: 2d73b90eda2491958590bd7f73c4656c7c153a38 Timestamp of repository bentoo: Wed, 25 Dec 2024 03:18:12 +0000 Head commit of repository bentoo: 6eeb266fd4d12c9b87e9ffe8d0d4c9dc2dc12a17 Timestamp of repository ceamac: Tue, 17 Dec 2024 22:21:17 +0000 Head commit of repository ceamac: 57b447d4e0bb86e8fe4482e886197f50a268f0ca Timestamp of repository guru: Sun, 29 Dec 2024 09:33:31 +0000 Head commit of repository guru: c6cfafb2eb77f084e023af5e71cbba90b900e9ad sh bash 5.2_p37 ld GNU ld (Gentoo 2.43 p3) 2.43.1 ccache version 4.10.2 [disabled] app-misc/pax-utils: 1.3.8::gentoo app-shells/bash: 5.2_p37::gentoo dev-build/autoconf: 2.13-r8::gentoo, 2.69-r9::gentoo, 2.71-r7::gentoo, 2.72-r1::gentoo dev-build/automake: 1.16.5-r2::gentoo, 1.17-r1::gentoo dev-build/cmake: 3.31.3::gentoo dev-build/libtool: 2.5.4::gentoo dev-build/make: 4.4.1-r100::gentoo dev-build/meson: 1.6.1::gentoo dev-java/java-config: 2.3.4::gentoo dev-lang/perl: 5.40.0-r1::gentoo dev-lang/python: 2.7.18_p16-r2::gentoo, 3.10.16_p1::gentoo, 3.11.11_p1::gentoo, 3.12.8::gentoo dev-lang/rust: 1.82.0-r100::gentoo, 1.83.0-r1::gentoo dev-util/ccache: 4.10.2-r1::gentoo llvm-core/clang: 18.1.8-r6::gentoo, 19.1.6::gentoo llvm-core/lld: 18.1.8::gentoo, 19.1.6::gentoo llvm-core/llvm: 18.1.8-r6::gentoo, 19.1.6::gentoo sys-apps/baselayout: 2.17::gentoo sys-apps/openrc: 0.41.2::gentoo sys-apps/sandbox: 2.42::gentoo sys-apps/systemd: 254.22::gentoo sys-devel/binutils: 2.37_p1-r2::gentoo, 2.41-r4::gentoo, 2.42-r2::gentoo, 2.43-r2::gentoo sys-devel/binutils-config: 5.5.2::gentoo sys-devel/gcc: 13.2.1_p20240210::gentoo, 14.2.1_p20241221::gentoo sys-devel/gcc-config: 2.12.1::gentoo sys-kernel/linux-headers: 6.12::gentoo (virtual/os-headers) sys-libs/glibc: 2.40-r7::gentoo Repositories: gentoo location: /var/db/repos/gentoo sync-type: rsync sync-uri: rsync://rsync.gentoo.org/gentoo-portage priority: -1000 volatile: False sync-rsync-verify-jobs: 1 sync-rsync-extra-opts: sync-rsync-verify-max-age: 24 sync-rsync-verify-metamanifest: yes bentoo location: /var/db/repos/bentoo sync-type: git sync-uri: https://github.com/gentoo-mirror/bentoo.git masters: gentoo volatile: False ceamac location: /var/db/repos/ceamac sync-type: git sync-uri: https://github.com/gentoo-mirror/ceamac.git masters: gentoo volatile: False guru location: /var/db/repos/guru sync-type: git sync-uri: https://github.com/gentoo-mirror/guru.git masters: gentoo volatile: False local location: /var/db/repos/local masters: gentoo volatile: False ACCEPT_KEYWORDS="amd64 ~amd64" ACCEPT_LICENSE="*" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -march=znver1 -O2 -pipe -pipe" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/share/gnupg/qualified.txt /usr/share/maven-bin-3.6/conf /usr/share/maven-bin-3.8/conf /usr/share/maven-bin-3.9/conf /var/bind" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/dconf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo /etc/texmf/language.dat.d /etc/texmf/language.def.d /etc/texmf/updmap.d /etc/texmf/web2c" CXXFLAGS="-O2 -march=znver1 -O2 -pipe -pipe" DISTDIR="/var/cache/distfiles" ENV_UNSET="CARGO_HOME DBUS_SESSION_BUS_ADDRESS DISPLAY GDK_PIXBUF_MODULE_FILE GOBIN GOPATH PERL5LIB PERL5OPT PERLPREFIX PERL_CORE PERL_MB_OPT PERL_MM_OPT XAUTHORITY XDG_CACHE_HOME XDG_CONFIG_HOME XDG_DATA_HOME XDG_RUNTIME_DIR XDG_STATE_HOME" FCFLAGS="-O2 -march=znver1 -O2 -pipe -pipe" FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs binpkg-multi-instance buildpkg-live config-protect-if-modified distlocks ebuild-locks fixlafiles ipc-sandbox merge-sync merge-wait multilib-strict network-sandbox news parallel-fetch pid-sandbox pkgdir-index-trusted preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync xattr" FFLAGS="-O2 -march=znver1 -O2 -pipe -pipe" GENTOO_MIRRORS="http://mirror.leaseweb.com/gentoo/ http://ftp.snt.utwente.nl/pub/os/linux/gentoo" LANG="en_US.utf8" LDFLAGS="-Wl,-O1 -Wl,--as-needed -Wl,-z,pack-relative-relocs" LEX="flex" MAKEOPTS="-j7" PKGDIR="/var/cache/binpkgs" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages --exclude=/.git" PORTAGE_TMPDIR="/mnt/storage/portage" SHELL="/bin/bash" USE="X aac acl adwaita-icon-theme alsa amd64 apng aptx autohint avif bluetooth bluray brotli bzip2 cairo caps cet clang codec2 color-management corefonts cpudetection crypt curl dav1d dbus default-stack-clash-protection default-znow designer dri3 dtrace echo-cancel egl eglfs evdev exif exiv2 ffmpeg fftw font_types_otf font_types_ttf fontconfig fontforge gdal gdbm geos gimp gles2 gold graphite gtk gtk3 hackrf harfbuzz heif http2 hunspell iconv icu infinality ipv6 jack jpeg jpeg2k jumbo-build krita ladspa lapack lcms libdrm libnotify libplacebo libsamplerate libtirpc lm-sensors lv2 lz4 lzma minizip ncurses ntfs ocamlopt ogg opencv openexr opengl openmp optimization opus orc pam pcre pdf pgo pigz pipewire png policykit postgres pulseaudio qml qt6 readline rtlsdr rubberband scripttools seccomp sftp shaderrc sndfile speex split-usr sqlite srt ssl startup-notification strong-optimization svg synaptics system-bootstrap system-jsoncpp system-libvpx system-llvm systemd sysv-utils test-rust theora tiff tracepath truetype udev udisks unicode upower user-session v4l vaapi vdpau vim-syntax vorbis vpx vulkan webp webstart woff2 x264 x265 xattr xinerama xrandr zeromq zlib zstd" ABI_X86="64" ADA_TARGET="gcc_12" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_anon authn_dbm authn_file authz_dbm authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir env expires ext_filter file_cache filter headers include info log_config logio mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="karbon sheets words" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" CPU_FLAGS_X86="aes avx avx2 f16c fma3 mmx mmxext pclmul popcnt sha sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock greis isync itrax navcom oceanserver oncore rtcm104v2 rtcm104v3 sirf skytraq superstar2 tsip tripmate tnt ublox" GUILE_SINGLE_TARGET="3-0" GUILE_TARGETS="3-0" INPUT_DEVICES="libinput" KERNEL="linux" L10N="en-US nl-NL nl" LCD_DEVICES="bayrad cfontz glk hd44780 lb216 lcdm001 mtxorb text" LLVM_TARGETS="AMDGPU BPF WebAssembly" LUA_SINGLE_TARGET="lua5-4" LUA_TARGETS="lua5-1" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php8-2" POSTGRES_TARGETS="postgres14" PYTHON_SINGLE_TARGET="python3_12" PYTHON_TARGETS="python2_7 python3_11 python3_12" RUBY_TARGETS="ruby32 ruby33 ruby34" UWSGI_PLUGINS="cache carbon cheaper_busyness corerouter fastrouter http logfile logsocket mongodblog nagios ping rawrouter redislog router_basicauth router_cache router_expires router_hash router_http router_memcached router_metrics router_redirect router_redis router_rewrite router_static router_uwsgi rpc signal spooler symcall syslog transformation_chunked transformation_gzip transformation_offload transformation_tofile ugreen zergpool" VIDEO_CARDS="amdgpu radeonsi radeon" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipp2p iface geoip fuzzy condition tarpit sysrq proto logmark ipmark dhcpmac delude chaos account" Unset: ADDR2LINE, AR, ARFLAGS, AS, ASFLAGS, CC, CCLD, CONFIG_SHELL, CPP, CPPFLAGS, CTARGET, CXX, CXXFILT, ELFEDIT, EMERGE_DEFAULT_OPTS, EXTRA_ECONF, F77FLAGS, FC, GCOV, GPROF, INSTALL_MASK, LC_ALL, LD, LFLAGS, LIBTOOL, LINGUAS, MAKE, MAKEFLAGS, NM, OBJCOPY, OBJDUMP, PORTAGE_BINHOST, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, PYTHONPATH, RANLIB, READELF, RUSTFLAGS, SIZE, STRINGS, STRIP, YACC, YFLAGS 05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] (rev c4)
Created attachment 915491 [details] dmesg
Created attachment 915492 [details] dmesg 6.12.7 Load of the system goes to 19. Process that causes the load is kworker/u32:0+amdgpu-reset-dev. This time YouTube and Sketchup Web were open.
https://gitlab.freedesktop.org/drm/amd/-/issues/3842 suggests that there may be an issues with the latest linux-firmware. Currently reverting the firmware.
Reverting to linux-firmware/20241110::gentoo (and recompiling the kernel) does not resolve the issue.
New upstream issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12407
So is it a gpu driver or mesa bug? Seems it's gpu driver bug no? On the amd issue tracker they told you to file a new issue but you filed it in mesa instead of amd? Does downgrading to mesa 24.2.7 fix the crash?
(In reply to wxviolation from comment #6) > So is it a gpu driver or mesa bug? Seems it's gpu driver bug no? A newer version of mesa crashes the gpu. User space obviously should never be able to do that. So I would agree this is a driver bug. > On the amd issue tracker they told you to file a new issue but you filed it > in mesa instead of amd? I thought that was the intention. But if it should also be moved to amd, I might need to ping. > Does downgrading to mesa 24.2.7 fix the crash? At this moment running on 24.2.8. Haven't had the crash yet.
(In reply to Stefan de Konink from comment #7) > (In reply to wxviolation from comment #6) > > So is it a gpu driver or mesa bug? Seems it's gpu driver bug no? > > A newer version of mesa crashes the gpu. User space obviously should never > be able to do that. So I would agree this is a driver bug. > No it can (at least mesa can) https://gitlab.freedesktop.org/drm/amd/-/issues/3497#note_2533998 > > > On the amd issue tracker they told you to file a new issue but you filed it > > in mesa instead of amd? > > I thought that was the intention. But if it should also be moved to amd, I > might need to ping. > > > > Does downgrading to mesa 24.2.7 fix the crash? > > At this moment running on 24.2.8. Haven't had the crash yet. 👍
Updated summary; this is exceptionally unlikely to be related to Chromium, regardless of whether or not using that software triggers the crash. Based on upstream's response this may be related to https://gitlab.freedesktop.org/drm/amd/-/issues/3861 Current upstream ticket for this is https://gitlab.freedesktop.org/drm/amd/-/issues/3874
Hi, Just a confirmation. I have two AMD-Systems with iGPU (2400G 8GB-Memory / 7900X 32GB-Memory) and just the 2400G has the same problem on heavy memory workloads. Chrome/Vivaldi/Libreoffice for example. But i am also sure it is not caused by them. So going back to mesa-24.2.8 is "solving" the problem for the moment. The 7900X has just some rare opcode hickups since >mesa-24.3 but does not freeze. happy new year to all of you Karl
Hi, i have laptop with 05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] (rev c4) and I'm having the same problem: When I use media-libs/mesa-24.2.8, everything work, with higher version of media-libs/mesa the system freees at 100% CPu usage by some kworker/*amd*. This is independent of kernel or firmware version. I use Thunderbird, Firefox and LibreOffice, but it only takes one Xterm with running emerge to crash.
For those interested - as i reported those crashes seemed only to happen on older AMD_GPUs like Raven Ridge with mesa version 24.x - i updated the system with the 2400G to mesa-25 and changed the desktop from lxqt to kde-plasma-wayland. No crashes anymore. So it might be cause of mesa-25 and some kind of change there or it was caused because some coincidence with the desktop environment. On top of that : kernel now ist 6.13.x So it would be interesting to see if mesa-25 is solving this problem on different configurations/desktops. have fun Karl
(In reply to Karl Ernst Brunk from comment #12) > For those interested - as i reported those crashes seemed only to happen on > older AMD_GPUs like Raven Ridge with mesa version 24.x - i updated the > system with the 2400G to mesa-25 and changed the desktop from lxqt to > kde-plasma-wayland. > No crashes anymore. > So it might be cause of mesa-25 and some kind of change there or it was > caused because some coincidence with the desktop environment. On top of that > : kernel now ist 6.13.x > So it would be interesting to see if mesa-25 is solving this problem on > different configurations/desktops. > have fun > Karl Confirm, with media-libs/mesa-25.0.0 is system fully working. I use xfce desktop. Thanks