[Date Prev][Date Next] [Chronological] [Thread] [Top]

back-mdb locking strategies

To: OpenLDAP Devel <openldap-devel@openldap.org>
Subject: back-mdb locking strategies
From: Howard Chu <hyc@symas.com>
Date: Thu, 24 Sep 2009 16:31:07 -0700
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; rv:1.9.1b5pre) Gecko/20090909 SeaMonkey/2.0a1pre Firefox/3.0.3

There are two main problems being addressed by the mdb approach - eliminatingmultiple layers of cache copying, and eliminating multiple layers of locking.

Currently we have per-entry mutexes and also BDB locks for every cached item.The mutex overhead is ridiculous, 40-some bytes per mutex x 10 million entriesis a lot of wasted storage. Especially since a single slapd can only actuallylock 2-3 items per thread - i.e., we only need on the order of 100 locks,ever. The sane thing to do here is continue to use the BDB approach tolocking, but with fewer options.

By default BDB allocates a new locker for every database handle and everycursor that's created. This actually causes us a lot of trouble since itallows two cursors created by the same thread to deadlock each other. (Andit's why we had to go to the trouble of wrapping read operations in a TXN,even though we don't need any of the ACID properties for reads.) For mdb wewill want an API for which (essentially) threadID == lockerID and cursors andsuch will not have their own lockerIDs.

Actual locks will be managed in separate tables. The most common case will belocks indexed by entryID. For this case we can easily structure the table tominimize lock and cache contention. E.g., for 16 threads, use a 16x16 array ofstructures. Each element contains a vector of 16 (mutex,locker,entryID)tuples. The element to lock for a given entryID is ((entryID & 0xff0) >> 4).In the worst case of all 16 threads going after the same entry, they will allbe waiting on a single mutex which is no worse than what we have now. For all16 threads going after the same slot, they will simply have to use other slotsin the lock vector.

But in the typical case, all threads will be going after different mutexesresiding in different processor cachelines. And, a thread that is iteratingsequentially through the database will be re-using the same mutex/cachelinefor 16 entries at a time.

(16 was chosen just for the purpose of this example. The actual numbers shouldbe derived from the maximum number of threads allowed to access the database.)


--
  -- Howard Chu
  CTO, Symas Corp.           http://www.symas.com
  Director, Highland Sun     http://highlandsun.com/hyc/
  Chief Architect, OpenLDAP  http://www.openldap.org/project/

Follow-Ups:
- Re: back-mdb locking strategies
  - From: Howard Chu <hyc@symas.com>

Prev by Date: Re: Problems compiling OpenLDAP in 64 bit due to old libtool version included
Next by Date: Re: back-mdb locking strategies
Index(es):
- Chronological
- Thread