188125 – sys-cluster/drbd-8.0.4 waits forever for connection on startup

Bug 188125 - sys-cluster/drbd-8.0.4 waits forever for connection on startup

Summary: sys-cluster/drbd-8.0.4 waits forever for connection on startup

Status:	RESOLVED FIXED

Alias:	None

Product:	Gentoo Linux
Classification:	Unclassified
Component:	[OLD] Core system (show other bugs)
Hardware:	All Linux

Importance:	High critical
Assignee:	Gentoo Cluster Team

URL:	http://lists.linbit.com/pipermail/drb...
Whiteboard:
Keywords:

Depends on:
Blocks:

Reported:	2007-08-08 16:07 UTC by Luca Lesinigo
Modified:	2010-09-10 18:51 UTC (History)
CC List:	0 users

See Also:
Package list:
Runtime testing required:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Luca Lesinigo 2007-08-08 16:07:00 UTC

I'm experiencing exactly the same problem Werner Fisher explained on DRBD-User for his 0.7.x setup: the drbd startup waits forever for a connection to the other node, ignoring the wfc-timeout and degr-wfc-timeout values specified in drbd.conf.

As you can see at the above URL, it was a bug and it should have been already corrected in 0.7.x. My suspect is that the same bug propagated to the 0.8.x codebase and hasn't been corrected there.

I've already contacted upstream, I'll add to this bug if I get any response.

If I correctly identified this bug, drbd-0.8.4 shouldn't be used since it's b0rked and will hang your cluster forever should a node reboot with the other being down/absent/hang. Has anyone else tried drbd-0.8.4?

Reproducible: Always

Steps to Reproduce:
1. emerge drbd-0.8.4
2. degrade cluster (turn off a node)
3. start drbd on the remaining node

Actual Results:  
drbd startup says wfc-timeout and degr-wfc-timeout are 0 (-> infinite) and waits forever, even if the user specified different values in drbd.conf

Expected Results:  
should read correct values from config and honor them

Comment 1 Luca Lesinigo 2007-08-09 00:48:53 UTC

The bug is fixed in drbd-8.0.5.

I just copied the ebuild to drbd-8.0.5.ebuild, made digest, emerged.
It compiles and works ok. It honors the [degr-]wfc-timeout values both in single resources and in common section. I made some tests and everything works ok on x86_64 hardened profile, I'm going to use it for production systems starting tomorrow and suggest a version bump in portage.
As far as I'm concerned, it should also be unmasked since it's stable.

Comment 2 Christian Zoffoli (RETIRED) gentoo-dev

2007-08-09 09:18:30 UTC

8.0.5 in portage.

I'm stress testing 8.0.x from the begin of June, 8.0.3 has some stability problems, 8.0.4 has some bugs in drbdmeta that affects large storages but seems to be stable ...I'll mark 8.0.x stable when all the relevant parts seems to be stable.