[Date Prev][Date Next] [Chronological] [Thread] [Top]

mmap'd DB

To: OpenLDAP-devel@openldap.org
Subject: mmap'd DB
From: Howard Chu <hyc@symas.com>
Date: Wed, 22 Jun 2011 18:26:51 -0700
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:7.0a1) Gecko/20110612 Firefox/7.0a1 SeaMonkey/2.4a1

The main goal of mdb is to eliminate extra copies of data in various caches. Asecondary goal is simplifying config (by eliminating extra caches, there'snothing left to tune/configure). One of the key requirements for this to workis to make the on-disk Entry format identical to the in-memory format. Sincethe in-memory format is based on pointers, we seem to need a way to nail downthe memory map to a fixed address that is constant from run to run. (Justre-capping...)

The tricky part is that Entries contain AttributeDescriptions and theseobviously depend on whatever schema is loaded on a particular run. If you takeno special steps, then adding or deleting schema elements from run to run willinvalidate their addresses, and make the on-disk pointers useless.

I had a notion to replace AttributeDescription pointers with indices into anarray. The array would be persisted into a file, and would only ever grow;items would never be deleted from it. Then Entries could reference the arrayindices instead of the pointers. A scheme like this could work, but the sheervolume of code that directly (de-)references AD pointers is pretty huge. Notsure if it's the right path to take.

Another option was to setup a dedicated ad_alloc() function that carves up itsown mmap'd file. Then AD pointers will always be constant from run to run. Thetrouble here is how to configure the path to this file; all of the hard-codedschema is generated before we ever get to reading the config.

A possible approach is to first allocate the AD pointers in an anonymousmmap'd region, then once the config file is read we can open the configuredfile and pull from there. For this to work we have to reserve a fixed sizechunk of memory for the hard-coded schema, so that if we add/delete otherhard-coded schema elements down the road, the map remains useful. All in allit seems extremely fragile and a major pain for compatibility between revisions.

Another possibility is to collect all the schema elements at build time, andgenerate a .c file containing a seed AD map which is hard-coded (and again, isonly allowed to grow, no elements modified in place or deleted) so that thecore AD map is always the same from run to run. The more I think of it, themore I think this will work. It's essentially the same solution we used to getrid of the ucdata-path item...

--
  -- Howard Chu
  CTO, Symas Corp.           http://www.symas.com
  Director, Highland Sun     http://highlandsun.com/hyc/
  Chief Architect, OpenLDAP  http://www.openldap.org/project/

Prev by Date: RE24 testing call #2 (2.4.26)
Next by Date: Re: RE24 testing call #2 (2.4.26)
Index(es):
- Chronological
- Thread