Summary: | kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! in gentoo-sources 6.9.x and crashing of Wayland display | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | MrSnivvel <gentoobugs> |
Component: | Current packages | Assignee: | Gentoo Kernel Bug Wranglers and Kernel Maintainers <kernel> |
Status: | RESOLVED UPSTREAM | ||
Severity: | critical | CC: | gentoobugs |
Priority: | Normal | ||
Version: | unspecified | ||
Hardware: | All | ||
OS: | Linux | ||
URL: | https://gitlab.freedesktop.org/drm/amd/-/issues/3096 | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- |
Description
MrSnivvel
2024-06-01 18:12:14 UTC
journalctl: Jun 01 12:20:12 threadripper kernel: [drm] Fence fallback timer expired on ring comp_1.0.0 Jun 01 12:20:22 threadripper kernel: [drm:amdgpu_job_timedout] *ERROR* ring gfx_0.0.0 timeout, signaled seq=8412938, emitted seq=8412940 Jun 01 12:20:22 threadripper kernel: [drm:amdgpu_job_timedout] *ERROR* Process information: process Indiana-Win64-S pid 768939 thread vkd3d_queue pid 769101 Jun 01 12:20:22 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: GPU reset begin! Jun 01 12:20:22 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: BACO reset Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: GPU reset succeeded, trying to resume Jun 01 12:20:24 threadripper kernel: [drm] PCIE GART of 512M enabled (table at 0x00000081FEE00000). Jun 01 12:20:24 threadripper kernel: [drm] VRAM is lost due to GPU reset! Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: PSP is resuming... Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: reserve 0x900000 from 0x81fd000000 for PSP TMR Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: RAS: optional ras ta ucode is not available Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: RAP: optional rap ta ucode is not available Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: SMU is resuming... Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: use vbios provided pptable Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: smc_dpm_info table revision(format.content): 4.5 Jun 01 12:20:24 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: SMU is resumed successfully! Jun 01 12:20:25 threadripper kernel: [drm] kiq ring mec 2 pipe 1 q 0 Jun 01 12:20:25 threadripper kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode). Jun 01 12:20:25 threadripper kernel: [drm] JPEG decode initialized successfully. Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 8 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 8 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 8 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8 Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: recover vram bo from shadow start Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: recover vram bo from shadow done Jun 01 12:20:25 threadripper kernel: amdgpu 0000:63:00.0: amdgpu: GPU reset(42) succeeded! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper kernel: [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! Jun 01 12:20:25 threadripper systemd[1]: Started Process Core Dump (PID 769386/UID 0). [I] media-libs/mesa Available versions: 24.0.7^t (~)24.0.8^t (~)24.0.8-r1^t (~)24.1.0^t (~)24.1.0-r1^t **9999*l^t {+X d3d9 debug gles1 +gles2 +llvm lm-sensors opencl +opengl osmesa +proprietary-codecs selinux test unwind vaapi valgrind vdpau vulkan vulkan-overlay wayland xa zink +zstd ABI_MIPS="n32 n64 o32" ABI_S390="32 64" ABI_X86="32 64 x32" CPU_FLAGS_X86="sse2" LLVM_SLOT="15 16 +17" VIDEO_CARDS="d3d12 freedreno intel lavapipe lima nouveau nvk panfrost r300 r600 radeon radeonsi v3d vc4 virgl vivante vmware"} Installed versions: 24.1.0-r1^t(17:41:34 05/29/24)(X llvm lm-sensors opengl proprietary-codecs vaapi vdpau vulkan vulkan-overlay wayland zstd -d3d9 -debug -opencl -osmesa -selinux -test -unwind -valgrind -xa -zink ABI_MIPS="-n32 -n64 -o32" ABI_S390="-32 -64" ABI_X86="32 64 -x32" CPU_FLAGS_X86="sse2" LLVM_SLOT="17 -15 -16" VIDEO_CARDS="radeonsi -d3d12 -freedreno -intel -lavapipe -lima -nouveau -nvk -panfrost -r300 -r600 -radeon -v3d -vc4 -virgl -vivante -vmware") Since I'm loading firmware for the GPU: [ 2.692347] [drm] amdgpu kernel modesetting enabled. [ 2.692417] amdgpu: Virtual CRAT table created for CPU [ 2.692433] amdgpu: Topology: Add CPU node [ 2.692530] amdgpu 0000:63:00.0: enabling device (0006 -> 0007) [ 2.697693] amdgpu 0000:63:00.0: amdgpu: Fetched VBIOS from VFCT [ 2.697698] amdgpu: ATOM BIOS: 113-D3250500-101 [ 2.697707] Loading firmware: amdgpu/navi14_sos.bin [ 2.697712] Loading firmware: amdgpu/navi14_asd.bin [ 2.697717] Loading firmware: amdgpu/navi14_ta.bin [ 2.697722] Loading firmware: amdgpu/navi14_smc.bin [ 2.697728] Loading firmware: amdgpu/navi14_pfp_wks.bin [ 2.697732] Loading firmware: amdgpu/navi14_me_wks.bin [ 2.697736] Loading firmware: amdgpu/navi14_ce_wks.bin [ 2.697740] Loading firmware: amdgpu/navi14_rlc.bin [ 2.697744] Loading firmware: amdgpu/navi14_mec_wks.bin [ 2.697749] Loading firmware: amdgpu/navi14_mec2_wks.bin [ 2.697753] Loading firmware: amdgpu/navi14_sdma.bin [ 2.697758] Loading firmware: amdgpu/navi14_sdma1.bin [ 2.697769] Loading firmware: amdgpu/navi14_vcn.bin [ 2.697778] amdgpu 0000:63:00.0: vgaarb: deactivate vga console [ 2.697783] amdgpu 0000:63:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default) [ 2.697840] amdgpu 0000:63:00.0: BAR 2 [mem 0x14020000000-0x140201fffff 64bit pref]: releasing [ 2.697847] amdgpu 0000:63:00.0: BAR 0 [mem 0x14010000000-0x1401fffffff 64bit pref]: releasing [ 2.697915] amdgpu 0000:63:00.0: BAR 0 [mem 0x10200000000-0x103ffffffff 64bit pref]: assigned [ 2.697928] amdgpu 0000:63:00.0: BAR 2 [mem 0x10100000000-0x101001fffff 64bit pref]: assigned [ 2.698034] amdgpu 0000:63:00.0: amdgpu: VRAM: 8176M 0x0000008000000000 - 0x00000081FEFFFFFF (8176M used) [ 2.698041] amdgpu 0000:63:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF [ 2.698247] [drm] amdgpu: 8176M of VRAM memory ready [ 2.698251] [drm] amdgpu: 193292M of GTT memory ready. [ 2.699190] amdgpu 0000:63:00.0: amdgpu: Will use PSP to load VCN firmware [ 2.766066] amdgpu 0000:63:00.0: amdgpu: reserve 0x900000 from 0x81fd000000 for PSP TMR [ 2.836272] amdgpu 0000:63:00.0: amdgpu: RAS: optional ras ta ucode is not available [ 2.847584] amdgpu 0000:63:00.0: amdgpu: RAP: optional rap ta ucode is not available [ 2.847591] amdgpu 0000:63:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [ 2.847658] amdgpu 0000:63:00.0: amdgpu: use vbios provided pptable [ 2.847664] amdgpu 0000:63:00.0: amdgpu: smc_dpm_info table revision(format.content): 4.5 [ 2.880287] amdgpu 0000:63:00.0: amdgpu: SMU is initialized successfully! [ 2.972712] amdgpu: HMM registered 8176MB device memory [I] sys-kernel/linux-firmware Available versions: 20240312^bstd 20240410^bstd 20240513^bstd **99999999*l^bstd {bindist compress-xz compress-zstd deduplicate dist-kernel +initramfs +redistributable savedconfig unknown-license} Installed versions: 20240513^bstd(15:07:48 05/20/24)(initramfs redistributable savedconfig -bindist -compress-xz -compress-zstd -deduplicate -dist-kernel -unknown-license) CONFIG_EXTRA_FIRMWARE="amd-ucode/microcode_amd_fam17h.bin amd/amd_sev_fam17h_model0xh.sbin amd/amd_sev_fam17h_model3xh.sbin amd-ucode/microcode_amd_fam19h.bin amd/amd_sev_fam19h_model0xh.sbin amdgpu/navi14_asd.bin amdgpu/navi14_ce.bin amdgpu/navi14_ce_wks.bin amdgpu/navi14_gpu_info.bin amdgpu/navi14_me.bin amdgpu/navi14_mec2.bin amdgpu/navi14_mec2_wks.bin amdgpu/navi14_mec.bin amdgpu/navi14_mec_wks.bin amdgpu/navi14_me_wks.bin amdgpu/navi14_pfp.bin amdgpu/navi14_pfp_wks.bin amdgpu/navi14_rlc.bin amdgpu/navi14_sdma1.bin amdgpu/navi14_sdma.bin amdgpu/navi14_smc.bin amdgpu/navi14_sos.bin amdgpu/navi14_ta.bin amdgpu/navi14_vcn.bin" [I] sys-kernel/gentoo-sources (6.9.1(6.9.1)@05/20/24 6.9.2(6.9.2)@05/26/24 6.9.3(6.9.3)@05/31/24): The larger dmesg output: [124435.999822] [drm] Fence fallback timer expired on ring comp_1.0.0 [124445.954727] [drm:amdgpu_job_timedout] *ERROR* ring gfx_0.0.0 timeout, signaled seq=8412938, emitted seq=8412940 [124445.954749] [drm:amdgpu_job_timedout] *ERROR* Process information: process Indiana-Win64-S pid 768939 thread vkd3d_queue pid 769101 [124445.954756] amdgpu 0000:63:00.0: amdgpu: GPU reset begin! [124446.064638] amdgpu 0000:63:00.0: amdgpu: BACO reset [124447.988188] amdgpu 0000:63:00.0: amdgpu: GPU reset succeeded, trying to resume [124447.988684] [drm] PCIE GART of 512M enabled (table at 0x00000081FEE00000). [124447.988709] [drm] VRAM is lost due to GPU reset! [124447.988711] amdgpu 0000:63:00.0: amdgpu: PSP is resuming... [124448.047723] amdgpu 0000:63:00.0: amdgpu: reserve 0x900000 from 0x81fd000000 for PSP TMR [124448.118446] amdgpu 0000:63:00.0: amdgpu: RAS: optional ras ta ucode is not available [124448.129547] amdgpu 0000:63:00.0: amdgpu: RAP: optional rap ta ucode is not available [124448.129550] amdgpu 0000:63:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [124448.129554] amdgpu 0000:63:00.0: amdgpu: SMU is resuming... [124448.129596] amdgpu 0000:63:00.0: amdgpu: use vbios provided pptable [124448.129599] amdgpu 0000:63:00.0: amdgpu: smc_dpm_info table revision(format.content): 4.5 [124448.132441] amdgpu 0000:63:00.0: amdgpu: SMU is resumed successfully! [124448.309903] [drm] kiq ring mec 2 pipe 1 q 0 [124448.311410] [drm] VCN decode and encode initialized successfully(under DPG Mode). [124448.311559] [drm] JPEG decode initialized successfully. [124448.311575] amdgpu 0000:63:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 [124448.311578] amdgpu 0000:63:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0 [124448.311581] amdgpu 0000:63:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0 [124448.311584] amdgpu 0000:63:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0 [124448.311586] amdgpu 0000:63:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0 [124448.311589] amdgpu 0000:63:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0 [124448.311591] amdgpu 0000:63:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0 [124448.311594] amdgpu 0000:63:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0 [124448.311596] amdgpu 0000:63:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0 [124448.311599] amdgpu 0000:63:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 11 on hub 0 [124448.311601] amdgpu 0000:63:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0 [124448.311604] amdgpu 0000:63:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0 [124448.311606] amdgpu 0000:63:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 8 [124448.311609] amdgpu 0000:63:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 8 [124448.311611] amdgpu 0000:63:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 8 [124448.311614] amdgpu 0000:63:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8 [124448.314681] amdgpu 0000:63:00.0: amdgpu: recover vram bo from shadow start [124448.332245] amdgpu 0000:63:00.0: amdgpu: recover vram bo from shadow done [124448.332267] amdgpu 0000:63:00.0: amdgpu: GPU reset(42) succeeded! [124448.334006] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.334279] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.334780] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.335240] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.335369] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.335932] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.335986] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.336165] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.336330] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124448.347158] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser -125! [124449.409531] xdg-desktop-por[380645]: segfault at 108 ip 00007f6990ded271 sp 00007fff6b16db78 error 4 in libsdbus-c++.so.1.4.0[7f6990de3000+24000] likely on CPU 24 (core 8, socket 0) [124449.409555] Code: 48 8b 36 48 8b 78 08 48 8b 07 ff a0 08 01 00 00 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 8b 07 48 8b 36 48 8b 78 08 48 8b 07 <ff> a0 08 01 00 00 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 8b 07 I see other core dumps in journalctl but those are all after the kernel error. Can you add your output to the upstream bug ? https://gitlab.freedesktop.org/drm/amd/-/issues/3096 Added with a link back to this one. (In reply to MrSnivvel from comment #5) > Added with a link back to this one. "... using AMDGPU under kernel 6.8.1 while..." oops (In reply to Mike Pagano from comment #6) > "... using AMDGPU under kernel 6.8.1 while..." oops Since the original poster said it magically stopped happening I'm doubtful it was looked into deeply. Whereas I have triggered this problem several times by the procedure I outlined 100% of the time, even if it is a very convoluted means of triggering it. When 6.9.4 is released, I expect I'll be posting a comment to that same report saying it's still happening. During this waiting time, is there anything I can do to increase the debugging output on this so it's more than just a single line error or help this in other ways? We'll follow the upstream bug and backport any fixes identified. |