Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 949581 - x11-drivers/nvidia-drivers-570.86.16-r1: kernel oops and black screen with gnome+wayland
Summary: x11-drivers/nvidia-drivers-570.86.16-r1: kernel oops and black screen with gn...
Status: RESOLVED UPSTREAM
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: AMD64 Linux
: Normal normal
Assignee: Ionen Wolkens
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-02-10 19:07 UTC by daniel
Modified: 2025-02-11 08:04 UTC (History)
3 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments
emerge --info (emerge_info.txt,8.78 KB, text/plain)
2025-02-10 19:10 UTC, daniel
Details
kernel oops (kernel-oops.txt,16.25 KB, text/plain)
2025-02-10 19:11 UTC, daniel
Details

Note You need to log in before you can comment on or make changes to this bug.
Description daniel 2025-02-10 19:07:28 UTC
I'm running Gnome 47 with GDM and wayland, but happens with X11 too.
After installing nvidia-drivers-570.86.16 (and -r1), GDM Login screen shows up normally. After typing my credentials and hitting enter, screen goes black and system becomes unusable. Even switching to a console isn't possible. 
I can, however, login via ssh to gain access.
Looking at the journal, I can see a "kernel oops" with "kernel: BUG: kernel NULL pointer dereference" right after gdm-password. Attached it as a file.
reverting to nvidia-drivers-565.77 (which unfortunately has been removed from tree) manually, everything works again. I tried this several times with kernels 6.13.0, 6.13.1 and 6.13.2 without success. I'm actually running kernel 6.12.10 and nvidia-drivers-565.77 without problems.

Reproducible: Always

Steps to Reproduce:
1. install kernel 6.13.* and nvidia-drivers-570.86.16(-r1)
2. restart
3. try to login
Actual Results:  
kernel oops and black screen

Expected Results:  
successfully loading Gnome desktop environment
Comment 1 daniel 2025-02-10 19:10:59 UTC
Created attachment 918621 [details]
emerge --info
Comment 2 daniel 2025-02-10 19:11:28 UTC
Created attachment 918622 [details]
kernel oops
Comment 3 Ionen Wolkens gentoo-dev 2025-02-11 08:04:00 UTC
Can't reproduce with my hardware (a single 1070 card), used gentoo-kernel-bin-6.13.2 + nvidia-drivers-570.86.16-r1 and then logged into Gnome+Wayland and everything "seems" fine.

First time I hear such report, so I assume it's not widespread. Maybe there's something in the nvidia 570 feedback thread[1] or other distros' bug trackers, but I haven't looked closely.

Could be hardware specific (this happens often, esp. with laptops), specific to some kernel option (try kernel-bin if in doubt), or "maybe" some gnome settings (I'm using stock settings for testing).

Either way, do not think anything that we can do downstream about this and will need to report to nvidia instead (or wait for the next version while hoping it got fixed).

I "would" restore 565.77 for convenience, but that branch is:
  1. dead (no longer supported by nvidia)
  2. broken with kernel 6.13.x (bug #948478)
  3. affected by known security vulnerabilities (bug #948212), and we try to avoid keeping vulnerable packages in the tree, can't really suggest using it

Instead should use the production branch (stable 550.x) meanwhile if have issues (which works with 6.13.x and has fixed the security issues, it's *newer* than 565.77), albeit it does have worse wayland support and may have to use Xorg for now depending on setup (esp. Plasma, gnome might be fine).

That aside, to be sure, your user is in the video group right? It "shouldn't" cause a kernel OOPS but when the login manager works (which is in video group) but the DE itself doesn't, it's often related and I'm not sure what effect not being in the group have on wayland esp. with the newer related changes in 570.

[1] https://forums.developer.nvidia.com/t/570-release-feedback-discussion/321956