Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 927014 - x11-drivers/nvidia-drivers-550: crash with lenovo laptops
Summary: x11-drivers/nvidia-drivers-550: crash with lenovo laptops
Status: RESOLVED UPSTREAM
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: AMD64 Linux
: Normal normal (vote)
Assignee: Ionen Wolkens
URL: https://forums.developer.nvidia.com/t...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-03-14 10:38 UTC by Михаил
Modified: 2024-03-23 13:22 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
kernel config (config.gz,39.73 KB, application/gzip)
2024-03-14 15:31 UTC, Михаил
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Михаил 2024-03-14 10:38:09 UTC
I can't find a way how to reproduce this issue.

I've reproduced this issue in versions 6.6 6.7 and 6.8
May be new systemd version cause this issue... 


kernel: BUG: unable to handle page fault for address: 0000000000004bfa
kernel: #PF: supervisor read access in kernel mode
kernel: #PF: error_code(0x0000) - not-present page
kernel: PGD 0 P4D 0 
kernel: Oops: 0000 [#1] PREEMPT SMP PTI
kernel: CPU: 0 PID: 604 Comm: systemd Tainted: P     U     O       6.6.21-gentoo #1
kernel: Hardware name: LENOVO 20H9CTO1WW/20H9CTO1WW, BIOS N1VET64W (1.54 ) 07/14/2023
kernel: RIP: 0010:rb_first+0xb/0x30
kernel: Code: 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 48 8b 07 48 85 c0 74 18 48 89 c2 <48> 8b 40 10 48 85 c0 75 f4 48 89 d0 31 d2 31 ff c3 cc cc cc cc 31
kernel: RSP: 0018:ffffc90001df7ca8 EFLAGS: 00010202
kernel: RAX: 0000000000004bea RBX: ffff88815b645480 RCX: 0000000000000000
kernel: RDX: 0000000000004bea RSI: 0000000000000000 RDI: ffff88815b6656c8
kernel: RBP: ffff88815b74b800 R08: 0000000000000000 R09: 0000000000000000
kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
kernel: R13: ffff88815b6656c8 R14: 0000000000000000 R15: ffffffffa332e9f8
kernel: FS:  00007f1650c34900(0000) GS:ffff888490600000(0000) knlGS:0000000000000000
kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: CR2: 0000000000004bfa CR3: 000000011537c001 CR4: 00000000003706f0
kernel: Call Trace:
kernel:  <TASK>
kernel:  ? __die+0x1f/0x70
kernel:  ? page_fault_oops+0x15d/0x460
kernel:  ? __kmem_cache_alloc_node+0x2aa/0x430
kernel:  ? cgroup_file_open+0x4b/0x130
kernel:  ? exc_page_fault+0x387/0x5f0
kernel:  ? asm_exc_page_fault+0x22/0x30
kernel:  ? rb_first+0xb/0x30
kernel:  simple_xattrs_free+0x25/0xa0
kernel:  kernfs_put.part.0+0x60/0x150
kernel:  kernfs_remove_by_name_ns+0x7d/0xd0
kernel:  cgroup_addrm_files+0x28e/0x320
kernel:  ? step_into+0x10/0x750
kernel:  ? __pfx_percpu_ref_switch_to_atomic_rcu+0x10/0x10
kernel:  ? rcu_is_watching+0xd/0x40
kernel:  css_clear_dir+0x4b/0xd0
kernel:  cgroup_destroy_locked+0xb1/0x1c0
kernel:  cgroup_rmdir+0x27/0xe0
kernel:  kernfs_iop_rmdir+0x4c/0x80
kernel:  vfs_rmdir+0x76/0x1f0
kernel:  do_rmdir+0x13c/0x180
kernel:  __x64_sys_rmdir+0x3e/0x80
kernel:  do_syscall_64+0x59/0x90
kernel:  ? do_syscall_64+0x68/0x90
kernel:  entry_SYSCALL_64_after_hwframe+0x6e/0xd8
kernel: RIP: 0033:0x7f165172c507
kernel: Code: f0 ff ff 73 01 c3 48 8b 0d 1e 99 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 66 90 b8 54 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 01 c3 48 8b 15 f1 98 0c 00 f7 d8 64 89 02 b8
kernel: RSP: 002b:00007ffcf6c399d8 EFLAGS: 00000246 ORIG_RAX: 0000000000000054
kernel: RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f165172c507
kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: 000055f4cef1eb10
kernel: RBP: 000055f4ceecd840 R08: 00000000ffffffff R09: 0000000000000000
kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 00007f1651adcc41
kernel: R13: 0000000000000001 R14: 0000000000000000 R15: 000055f4cee6e080
kernel:  </TASK>
kernel: Modules linked in: udp_diag tls tcp_diag inet_diag usbhid wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha tun snd_seq_dummy snd_hrtimer snd_s>
kernel:  intel_pch_thermal roles ledtrig_audio platform_profile usb_common snd tpm_tis soundcore i2c_hid_acpi tpm_tis_core i2c_hid tpm intel_wmi_thunderbolt wmi_bmof think_lmi firmware_attributes_class i915 cec >
kernel: CR2: 0000000000004bfa
kernel: ---[ end trace 0000000000000000 ]---
kernel: RIP: 0010:rb_first+0xb/0x30
kernel: Code: 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 48 8b 07 48 85 c0 74 18 48 89 c2 <48> 8b 40 10 48 85 c0 75 f4 48 89 d0 31 d2 31 ff c3 cc cc cc cc 31
kernel: RSP: 0018:ffffc90001df7ca8 EFLAGS: 00010202
kernel: RAX: 0000000000004bea RBX: ffff88815b645480 RCX: 0000000000000000
kernel: RDX: 0000000000004bea RSI: 0000000000000000 RDI: ffff88815b6656c8
kernel: RBP: ffff88815b74b800 R08: 0000000000000000 R09: 0000000000000000
kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
kernel: R13: ffff88815b6656c8 R14: 0000000000000000 R15: ffffffffa332e9f8
kernel: FS:  00007f1650c34900(0000) GS:ffff888490600000(0000) knlGS:0000000000000000
kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: CR2: 0000000000004bfa CR3: 000000011537c001 CR4: 00000000003706f0
kernel: note: systemd[604] exited with irqs disabled
Comment 1 Mike Pagano gentoo-dev 2024-03-14 11:44:49 UTC
Full dmesg, .config.

Are you running the nvidia drivers?
Comment 2 Михаил 2024-03-14 15:31:57 UTC
Created attachment 887647 [details]
kernel config
Comment 3 Михаил 2024-03-14 15:39:43 UTC
(In reply to Mike Pagano from comment #1)
> Full dmesg, .config.
I can't get dmesg after crash ((
Previous message from systemd-journald.

> Are you running the nvidia drivers?
Yep. Now i try to reproduce without.
Comment 4 Mike Pagano gentoo-dev 2024-03-21 18:05:36 UTC
(In reply to Михаил from comment #3)
> (In reply to Mike Pagano from comment #1)
> > Full dmesg, .config.
> I can't get dmesg after crash ((
> Previous message from systemd-journald.
> 
> > Are you running the nvidia drivers?
> Yep. Now i try to reproduce without.

Anything to report here ?
Comment 5 Михаил 2024-03-23 09:42:04 UTC
(In reply to Mike Pagano from comment #4)
> Anything to report here ?

This issue reproducing with nvidia-drivers-550

Works stable ithout nvidida-drivers or with nvidia-drivers-535 version.
Comment 6 Mike Pagano gentoo-dev 2024-03-23 13:00:13 UTC
Known issue with nvidia drivers
Comment 7 Ionen Wolkens gentoo-dev 2024-03-23 13:22:34 UTC
>kernel: Hardware name: LENOVO 20H9CTO1WW/20H9CTO1WW,
Nothing I can do about this, but pretty sure it's the same as $URL so you can track progress there.

Lenovo laptop users should stick to 535 meanwhile (even after 550 is stabilized, which will happen given it's a production branch, becoming overdue, and this does not affect most hardware).

Note 535 will be around for a long time as it's still supported by nvidia and getting bugfix/security updates and we're not removing it until that stops.