Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 922234 - sys-kernel/gentoo-sources-6.7.0 OCFS2 and NBD Driver (Kernel Configuration)
Summary: sys-kernel/gentoo-sources-6.7.0 OCFS2 and NBD Driver (Kernel Configuration)
Status: RESOLVED UPSTREAM
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: All Linux
: Normal critical (vote)
Assignee: Gentoo Kernel Bug Wranglers and Kernel Maintainers
URL: https://bugzilla.kernel.org/show_bug....
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-01-16 16:57 UTC by Urban Oettli
Modified: 2024-03-01 14:56 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Urban Oettli 2024-01-16 16:57:40 UTC
OCFS2 and the NBD driver statically configured in the kernel, then the OCFS2 hangs on mount.

NBD Driver configured as a module, then mounting the OCFS2 file system works well.

Filesystems
<*> OCFS2 file system support                                                                                                                  
<*> O2CB Kernelspace Clustering

Device Drivers > Block devices
<*>   Network block device support

[Jan16 23:54] rcu: INFO: rcu_preempt self-detected stall on CPU
[  +0.000005] rcu:      2-....: (15197 ticks this GP) idle=8c94/1/0x4000000000000000 softirq=3376/3377 fqs=7006
[  +0.000004] rcu:      (t=15000 jiffies g=2009 q=65 ncpus=4)
[  +0.000003] CPU: 2 PID: 11 Comm: kworker/u8:0 Not tainted 6.7.0-gentoo #18
[  +0.000002] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 4.2023.08-2 11/24/2023
[  +0.000001] Workqueue: o2net o2net_accept_many
[  +0.000012] RIP: 0010:queued_spin_lock_slowpath+0x12/0x192
[  +0.000004] Code: 02 00 00 e9 1b a0 5d ff 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 48 89 fa 66 90 b9 01 00 00 00 8b 02 85 c0 74 04 f3 90 <eb> f6 f0 0f b1 0a 85 c0 75 ee c3 cc cc cc cc 81 fe 00 01 00 00 75
[  +0.000014] RSP: 0018:ffffc9000006fdc8 EFLAGS: 00000202
[  +0.000002] RAX: 0000000000000001 RBX: ffff88800a849b00 RCX: 0000000000000001
[  +0.000001] RDX: ffff88800a849b98 RSI: 0000000000000001 RDI: ffff88800a849b98
[  +0.000001] RBP: ffff88800a849b98 R08: 00000000801a0019 R09: 00000000801a0019
[  +0.000001] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888006855180
[  +0.000001] R13: ffff888009b12a00 R14: ffffffff82b93a88 R15: ffff888001978e00
[  +0.000001] FS:  0000000000000000(0000) GS:ffff88807bd00000(0000) knlGS:0000000000000000
[  +0.000003] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0.000001] CR2: 00005643f05c83c8 CR3: 0000000009e60000 CR4: 00000000000006b0
[  +0.000001] Call Trace:
[  +0.000008]  <IRQ>
[  +0.000003]  ? rcu_dump_cpu_stacks+0x8f/0xb3
[  +0.000004]  ? rcu_sched_clock_irq+0x2ff/0x927
[  +0.000002]  ? update_load_avg+0x327/0x350
[  +0.000003]  ? timekeeping_update+0xe2/0x110
[  +0.000002]  ? __pfx_tick_nohz_highres_handler+0x10/0x10
[  +0.000003]  ? update_process_times+0x56/0x75
[  +0.000002]  ? tick_nohz_highres_handler+0x38/0x68
[  +0.000002]  ? __hrtimer_run_queues+0x90/0xf3
[  +0.000002]  ? ktime_get_update_offsets_now+0x3d/0xa3
[  +0.000002]  ? hrtimer_interrupt+0x8d/0x15f
[  +0.000003]  ? __sysvec_apic_timer_interrupt+0x5b/0x6a
[  +0.000003]  ? sysvec_apic_timer_interrupt+0x5f/0x79
[  +0.000003]  </IRQ>
[  +0.000000]  <TASK>
[  +0.000001]  ? asm_sysvec_apic_timer_interrupt+0x16/0x20
[  +0.000003]  ? queued_spin_lock_slowpath+0x12/0x192
[  +0.000002]  lock_sock_nested+0x19/0x3e
[  +0.000003]  inet_accept+0x57/0x91
[  +0.000003]  o2net_accept_many+0xac/0x398
[  +0.000002]  ? __schedule+0x9b9/0x9f3
[  +0.000003]  ? flush_to_ldisc+0x8e/0x14b
[  +0.000002]  process_scheduled_works+0x152/0x208
[  +0.000004]  worker_thread+0x1c1/0x21b
[  +0.000002]  ? __pfx_worker_thread+0x10/0x10
[  +0.000003]  kthread+0xe3/0xeb
[  +0.000002]  ? __pfx_kthread+0x10/0x10
[  +0.000002]  ret_from_fork+0x1f/0x31
[  +0.000003]  ? __pfx_kthread+0x10/0x10
[  +0.000002]  ret_from_fork_asm+0x1b/0x30
[  +0.000003]  </TASK>
[  +0.000003] Sending NMI from CPU 2 to CPUs 3:
[  +0.000018] NMI backtrace for cpu 3
[  +0.000018] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 6.7.0-gentoo #18
[  +0.000002] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 4.2023.08-2 11/24/2023
[  +0.000001] RIP: 0010:queued_read_unlock+0x9/0xe
[  +0.000004] Code: 45 85 e4 0f 94 c0 5b 5d 41 5c 41 5d c3 cc cc cc cc 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 b8 00 fe ff ff f0 0f c1 07 <c3> cc cc cc cc 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 31
[  +0.000002] RSP: 0018:ffffc90000144aa8 EFLAGS: 00000247
[  +0.000001] RAX: 0000000000000200 RBX: ffff88800a849b00 RCX: 00000000ab69bed1
[  +0.000001] RDX: 0000000000000001 RSI: 00000000fffffe00 RDI: ffff88800a849d18
[  +0.000001] RBP: ffffffff814c4e81 R08: 0000000000000430 R09: 0000000000000000
[  +0.000001] R10: 000000000e00a8c0 R11: 0000000000000006 R12: ffff88800a849d18
[  +0.000001] R13: 00000000ab69bed1 R14: 0000000000000020 R15: 0000000000000020
[  +0.000001] FS:  0000000000000000(0000) GS:ffff88807bd80000(0000) knlGS:0000000000000000
[  +0.000002] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0.000001] CR2: 00007ffc7e51af44 CR3: 000000001cb4c000 CR4: 00000000000006b0
[  +0.000001] Call Trace:
[  +0.000002]  <NMI>
[  +0.000001]  ? nmi_cpu_backtrace+0x94/0xc1
[  +0.000003]  ? nmi_cpu_backtrace_handler+0x8/0x10
[  +0.000002]  ? nmi_handle+0x4f/0xd6
[  +0.000002]  ? default_do_nmi+0x3f/0x1ff
[  +0.000003]  ? exc_nmi+0x9e/0x107
[  +0.000002]  ? end_repeat_nmi+0xf/0x60
[  +0.000002]  ? __pfx_o2net_listen_data_ready+0x10/0x10
[  +0.000002]  ? queued_read_unlock+0x9/0xe
[  +0.000002]  ? queued_read_unlock+0x9/0xe
[  +0.000002]  ? queued_read_unlock+0x9/0xe
[  +0.000002]  </NMI>
[  +0.000000]  <IRQ>
[  +0.000001]  _raw_read_unlock_bh+0x5/0x13
[  +0.000002]  o2net_listen_data_ready+0x56/0x73
[  +0.000002]  tcp_data_queue+0x416/0x90f
[  +0.000004]  tcp_rcv_established+0x2f1/0x3a3
[  +0.000002]  tcp_v4_do_rcv+0xb8/0x18f
[  +0.000003]  tcp_v4_rcv+0x6ca/0x938
[  +0.000002]  ip_protocol_deliver_rcu+0x97/0x134
[  +0.000002]  ip_local_deliver_finish+0x79/0x87
[  +0.000002]  ip_sublist_rcv_finish+0x28/0x38
[  +0.000002]  ip_sublist_rcv+0x166/0x18e
[  +0.000002]  ? __netif_receive_skb_core.constprop.0+0x689/0x6fc
[  +0.000003]  ip_list_rcv+0xe7/0x10f
[  +0.000002]  __netif_receive_skb_list_core+0xf1/0x119
[  +0.000002]  netif_receive_skb_list_internal+0x1cd/0x206
[  +0.000002]  gro_normal_list+0x1d/0x3f
[  +0.000002]  napi_complete_done+0x76/0x10b
[  +0.000007]  virtnet_poll+0x274/0x394 [virtio_net]
[  +0.000007]  __napi_poll.constprop.0+0x2a/0x111
[  +0.000002]  net_rx_action+0xff/0x1f6
[  +0.000002]  __do_softirq+0xb8/0x1b6
[  +0.000003]  __irq_exit_rcu+0x3d/0x7c
[  +0.000002]  common_interrupt+0x73/0x8b
[  +0.000002]  </IRQ>
[  +0.000001]  <TASK>
[  +0.000000]  asm_common_interrupt+0x22/0x40
[  +0.000002] RIP: 0010:default_idle+0xb/0x11
[  +0.000002] Code: 00 4d 29 c8 4c 01 c7 4c 29 c2 e9 6e ff ff ff 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 eb 07 0f 00 2d f7 b0 38 00 fb f4 <fa> c3 cc cc cc cc 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90
[  +0.000001] RSP: 0018:ffffc900000b7ee8 EFLAGS: 00000206
[  +0.000001] RAX: ffff88807bda00e0 RBX: ffff888001988e00 RCX: 000000cd42e4dffb
[  +0.000001] RDX: 0000000000000000 RSI: 00000014d791c300 RDI: 0000000000028504
[  +0.000001] RBP: 0000000000000000 R08: 0000000000000004 R09: 00000014d8fff900
[  +0.000000] R10: 0000000000000202 R11: 000000000000000f R12: 0000000000000000
[  +0.000001] R13: 0000000000000003 R14: 0000000000000000 R15: 0000000000000000
[  +0.000001]  default_idle_call+0x1f/0x2a
[  +0.000002]  do_idle+0xc6/0x1d3
[  +0.000002]  cpu_startup_entry+0x25/0x27
[  +0.000002]  start_secondary+0xf3/0xf3
[  +0.000003]  secondary_startup_64_no_verify+0x165/0x16b
[  +0.000002]  </TASK>
Comment 1 Urban Oettli 2024-01-19 12:16:02 UTC
(In reply to Urban Oettli from comment #0)
> OCFS2 and the NBD driver statically configured in the kernel, then the OCFS2
> hangs on mount.
> 
> NBD Driver configured as a module, then mounting the OCFS2 file system works
> well.
> 
> Filesystems
> <*> OCFS2 file system support                                               
> 
> <*> O2CB Kernelspace Clustering
> 
> Device Drivers > Block devices
> <*>   Network block device support
> 

The same effect (as described above) is achieved if
Networking support -> [*] Wireless --->
is switched on.

There is no possibility to compile this as a module and that is fatal.

Networking support -> [ ] Wireless (OCFS2 can be mounted)
Networking support -> [* ] Wireless (OCFS2 hangs forever on mount)

--- config-6.7.0-gentoo.old     2024-01-19 12:13:39.119598742 +0100
+++ config-6.7.0-gentoo         2024-01-19 12:31:38.593967801 +0100
@@ -1731,7 +1731,13 @@
 CONFIG_STREAM_PARSER=y
 # CONFIG_MCTP is not set
 CONFIG_FIB_RULES=y
-# CONFIG_WIRELESS is not set
+CONFIG_WIRELESS=y
+# CONFIG_CFG80211 is not set
+
+#
+# CFG80211 needs to be enabled for MAC80211
+#
+CONFIG_MAC80211_STA_HASH_MAX_SIZE=0
 # CONFIG_RFKILL is not set
 # CONFIG_NET_9P is not set
 # CONFIG_CAIF is not set
Comment 2 Thomas Oettli 2024-01-19 19:19:11 UTC
I am also directly affected by this issue. Many thanks to Urban to find out the dependency on other modules being compiled static into the kernel binary or loaded as module.
A fix would be greatly appreciated.
Comment 3 Mike Pagano gentoo-dev 2024-03-01 14:56:43 UTC
We're watching the upstream bug and will backport any fixes identified.  Please contribute there if you have information to add