Bug 53607

Summary:	anydbm requires external locking
Product:	Gentoo Linux	Reporter:	Jason Wever (RETIRED) <weeve>
Component:	[OLD] Core system	Assignee:	Portage team <dev-portage>
Status:	RESOLVED FIXED
Severity:	normal	Keywords:	InVCS
Priority:	High
Version:	unspecified
Hardware:	Sparc
OS:	All
See Also:	https://bugs.gentoo.org/show_bug.cgi?id=736473
Whiteboard:
Package list:		Runtime testing required:	---
Bug Depends on:
Bug Blocks:	349307

Description Jason Wever (RETIRED) gentoo-dev

2004-06-10 19:56:24 UTC

When usinng the anydbm module feature for portage, repoman has a tendency to crash by not being able to open the database while an emerge is running.  The following is the backtrace repoman tosses out;

excelsior wterm # repoman

RepoMan scours the neighborhood...
Traceback (most recent call last):
  File "/usr/bin/repoman", line 667, in ?
    myaux=portage.db["/"]["porttree"].dbapi.aux_get(catdir+"/"+y,allvars,strict=1)
  File "/usr/lib/portage/pym/portage.py", line 4496, in aux_get
    self.auxdb[cat] = self.auxdbmodule(self.cachedir,cat,auxdbkeys,uid,portage_gid)
  File "/usr/lib/portage/pym/portage_db_anydbm.py", line 22, in __init__
    self.db = anydbm.open(self.filename, "n", 0664)
  File "/usr/lib/python2.3/anydbm.py", line 83, in open
    return mod.open(file, flag, mode)
gdbm.error: (11, 'Resource temporarily unavailable')


The contents of my /etc/portage/modules file are below;
portdbapi.auxdbmodule="portage_db_anydbm.database"
eclass_cache.dbmodule="portage_db_anydbm.database"

Emerge info is as follows;

Portage 2.0.50-r8 (default-sparc64-1.4, gcc-3.3.3, glibc-2.3.3_pre20040420-r0, 2.6.6)
=================================================================
System uname: 2.6.6 sparc64 sun4u
Gentoo Base System version 1.4.16
distcc 2.14 sparc-unknown-linux-gnu (protocols 1 and 2) (default port 3632) [disabled]
ccache version 2.3 [enabled]
Autoconf: sys-devel/autoconf-2.59-r4
Automake: sys-devel/automake-1.8.5
ACCEPT_KEYWORDS="sparc ~sparc"
AUTOCLEAN="yes"
CFLAGS="-mcpu=ultrasparc -mtune=ultrasparc -O2 -g -pipe"
CHOST="sparc-unknown-linux-gnu"
COMPILER="gcc3"
CONFIG_PROTECT="/etc /etc/tomcat /usr/X11R6/lib/X11/xkb /usr/kde/2/share/config /usr/kde/3.1/share/config /usr/kde/3.2/share/config /usr/kde/3/share/config /usr/lib/mozilla/defaults/pref /usr/share/config /usr/share/texmf/dvipdfm/config/ /usr/share/texmf/dvips/config/ /usr/share/texmf/tex/generic/config/ /usr/share/texmf/tex/platex/config/ /usr/share/texmf/xdvi/ /var/bind /var/qmail/control"
CONFIG_PROTECT_MASK="/etc/gconf /etc/terminfo /etc/env.d"
CXXFLAGS="-mcpu=ultrasparc -mtune=ultrasparc -O2 -g -pipe"
DISTDIR="/opt/gentoo/distfiles"
FEATURES="autoaddcvs buildpkg ccache cvs digest nostrip sandbox"
GENTOO_MIRRORS="ftp://gentoo.mirrors.pair.com/ http://gentoo.seren.com/gentoo"
MAKEOPTS="-j2"
PKGDIR="/usr/portage/packages"
PORTAGE_TMPDIR="/var/tmp"
PORTDIR="/usr/gentoo-x86"
PORTDIR_OVERLAY="/opt/gentoo/overlay"
SYNC="rsync://rsync.gentoo.org/gentoo-portage"
USE="X Xaw3d aac aalib acl adns apache2 arts artswrappersuid athena audiofile avi berkdb bidi bonobo cap caps cdr chroot cjk clamav crypt cups curl dga dillo doc drac dvd dvdr emacs emacs-w3 encode esd etwin expat f77 faad fam fbcon fftw flac flash foomaticdb freetds freetype freewnn gd gd-external gdbm ggi gif ginac gmp gnome gnutls gphoto2 gpm gstreamer gtk gtk2 gtkhtml guile hbci hdf5 ieee1394 imap imlib imlib2 innodb jabber jack java javamail jikes joystick jpeg junit kde kerberos krb4 ldap libcaca libwww live mad maildir matroska mcal md5sum memlimit mikmod minimal monkey motif mozilla mpeg mpi msn mule mysql ncurses neXt nls nntp ntlm oav objc odbc ofx oggvorbis openal opengl oscar oss pam pda pdflib pear-db perl physfs png portaudio postgres ppds prelude python qhull qt quicktime quotes readline rplay ruby samba sasl scanner sdl skey slang slp socks5 sparc speex spell ssl stroke tcltk tcpd tetex theora threads tiff truetype unicode usb v4l vanilla wxwindows xface xgetdefault xml xml2 xmms xosd xv yahoo yaz zlib"

Comment 1 Brian Harring (RETIRED) gentoo-dev

2004-06-10 20:21:54 UTC

running multiple emerges by chance?

Comment 2 Brian Harring (RETIRED) gentoo-dev

2004-06-10 20:37:00 UTC

Err, bah, pardon- pays to read the summary careful :)
need try/except locks in portage_db_anydbm.database for when anydbm throws the exception- lockfiles possibly?
Would be nice if the actual dbm instance would just block till it has access...

Comment 3 Brian Harring (RETIRED) gentoo-dev

2004-06-10 21:17:37 UTC

Jason, line 22 of portage_db_anydbm.py (roughly, dependant on version), if you could chuck this in-

-except:
+except Exception, e:
+print "exception caught e(%s)" % str(e)

(obviously adjust whitespace)
Looks like the initial open fails, although I'd be curious what the specific exception is- after the open fails, portage tries to create a new db, resulting in GDB throwing a GDBM_READER_CANT_DELETE (11) error.

The initial open could be failing due to access denied, or some other crazy crap.  Adding the print for the exception will help to identify the issue.

Comment 4 Jason Wever (RETIRED) gentoo-dev

2004-06-11 04:48:25 UTC

Will do once this emerge finishes (it's still going).

I've noticed in the past that when I run into this, typically it is during an emerge -uvD world that has package updates that will take some time (like KDE).  When the emerge first starts  up, I can still use repoman, but after roughly an hour or two I cannot until the emerge finishes.  Hopefully that will be of some help/

Comment 5 Jason Wever (RETIRED) gentoo-dev

2004-06-12 16:16:04 UTC

Here are the results after changing/usr/lib/portage/pym/portage_db_anydbm.py.

excelsior webmin # repoman

RepoMan scours the neighborhood...
exception caught e((11, 'Resource temporarily unavailable'))
Traceback (most recent call last):
  File "/usr/bin/repoman", line 667, in ?
    myaux=portage.db["/"]["porttree"].dbapi.aux_get(catdir+"/"+y,allvars,strict=
1)
  File "/usr/lib/portage/pym/portage.py", line 4496, in aux_get
    self.auxdb[cat] = self.auxdbmodule(self.cachedir,cat,auxdbkeys,uid,portage_g
id)
  File "/usr/lib/portage/pym/portage_db_anydbm.py", line 23, in __init__
    self.db = anydbm.open(self.filename, "n", 0664)
  File "/usr/lib/python2.3/anydbm.py", line 83, in open
    return mod.open(file, flag, mode)
gdbm.error: (11, 'Resource temporarily unavailable')

Comment 6 Brian Harring (RETIRED) gentoo-dev

2004-08-02 10:00:04 UTC

Jason, haven't forgot about this bug, although addressing it is going to be a bit of a pita.
Long story short, portage_db_anybm is rather flawed- anydbm either uses bsddb, or gdbm.

gdbm flat out denies multiple writers from working on a file.  Good behaviour, annoying for our needs though (this is what you're hitting btw).

bsddb on the other hand, is dumb.  Really, *really* dumb.
Multiple writers can work on a file, problem is there isn't any coordination, so pretty much the last writer to close the file is the one that gets it's changes into the file.  This of course is assuming that the file hasn't been corrupted beyond belief.  Changes made by writer A, do not show up in reader B, unless A syncs, and reader B close's/reopens the db in question.

Pretty much, we ought to either A) default to bsddbm, and uses locks to signal when readers are out of sync w/ what's on disk (and must close/reopen), or B) add a use flag, dbm, and have it pull in dev-python/bsddb3 , a reworked bsddb module that allows for concurrent access sanely.

Comment 7 Brian Harring (RETIRED) gentoo-dev

2004-08-02 10:06:36 UTC

Gah... s/bsddbm/bsddb/ for above.
To verify what I'm saying,
for gdbm-
echo "import gdbm; db=gdbm.open('dar','n',0664);db2=gdbm.open('dar','r',0664);" | python
implodes when attempting to open a reader- you *cannot* open a writer when readers have the db open, and vice versa, no opening reader's when a writer is fooling with it.

for bsddb
just pop open two python instances, and run this in both-
import anydbm
db=anydbm.open("dar.bsddb","c",0644)

now start assigning records into the db (calling db.sync() as you please), and check the available keys via db.keys()
You'll note they differ.  Annoying, but can be dealt with.  sync the db's (noting the order they've been synced), and close them.  Reopen the db, and note they weren't merged, one writer overwrote the other.

If you're lucky, you'll get to see it corrupt the db, which is *really* fun :)

Comment 8 Brian Harring (RETIRED) gentoo-dev

2004-08-04 21:07:41 UTC

*** Bug 54280 has been marked as a duplicate of this bug. ***

Comment 9 Matt Turner gentoo-dev

2010-12-26 06:53:52 UTC

Can someone confirm that this is a long-since-dead bug? (Last modified: 2005-10-07)

Comment 10 Zac Medico gentoo-dev

2010-12-26 20:14:31 UTC

The gdbm module still raises 'Resource temporarily unavailable' if the db is already opened for write by another process. Python's gdbm module docs show a 'u' flag that can be used to open the database without locking, which might work for our purposes.

It seems that bsddb deprecated in python-2.x, and removed in python-3.x, so that's not worth supporting.

Comment 11 Zac Medico gentoo-dev

2010-12-26 21:26:17 UTC

It's fixed here to use the "u" flag with gdbm, for concurrent writers, and it always creates gdbm type databases when python is built with gdbm suport:

http://git.overlays.gentoo.org/gitweb/?p=proj/portage.git;a=commit;h=460fa368db599a21769e1be267d19cd3a5bd9572

Comment 12 Zac Medico gentoo-dev

2010-12-31 10:02:22 UTC

This is fixed in 2.1.9.27.