[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: LMDB Fixed memory address mapping documentation and example(s)?

To: John Daly <john.p.daly1@gmail.com>, "OpenLDAP-devel@openldap.org" <OpenLDAP-devel@openldap.org>
Subject: Re: LMDB Fixed memory address mapping documentation and example(s)?
From: Howard Chu <hyc@symas.com>
Date: Wed, 13 Jun 2018 21:36:45 +0100
In-reply-to: <CAMVaYPCyZh2XLx0h8pv+=ykhZ7L2jCCjWASM5Ob37ssUUcc7QQ@mail.gmail.com>
References: <CAMVaYPCyZh2XLx0h8pv+=ykhZ7L2jCCjWASM5Ob37ssUUcc7QQ@mail.gmail.com>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:56.0) Gecko/20100101 Firefox/56.0 SeaMonkey/2.53a1

John Daly wrote:

Hi Howard --


(cc'ing openldap-devel for general interest)

Hi John,

thanks for your interest. Unfortunately, this is a feature that we neverfully implemented. I can give you a rundown of how it was intended to be used,though. You have the overall concept - you must make sure that any objectyou want to store is marshalled into a single contiguous blob.

In OpenLDAP, the back-mdb backend already does this for us when storing LDAPentries, but it only goes half way.


Follow along in slap.h:
http://www.openldap.org/devel/gitweb.cgi?p=openldap.git;a=blob;f=servers/slapd/slap.h;h=b51b5571b652e508aed5b31eee5b2ce9e930cc24;hb=HEAD#l1207

We have a struct Entry with a couple of struct berval names and then a linkedlist of Attributes.


http://www.openldap.org/devel/gitweb.cgi?p=openldap.git;a=blob;f=servers/slapd/slap.h;h=b51b5571b652e508aed5b31eee5b2ce9e930cc24;hb=HEAD#l1167

Attributes point to an AttributeDescription and then have two arrays of structbervals for values, and a next pointer.

So to serialize all of this into an LMDB blob we would malloc a single blob tohold an Entry struct contiguous with all of its Attribute structs and followedby all of its value arrays. The various struct fields would then point toaddresses within this contiguous blob. This would be passed into mdb_put ormdb_cursor_put. Laid out appropriately, the object could be used as-is with nodeserialization needed, on read.

The MDB_rel_func callback on the DB would have previously been set to afunction that knows the layout of this Entry blob, and would increment ordecrement all of these blob-internal pointers whenever the blob itself wasmoved by an LMDB operation. The instances where this occurs are when copying aread-only page to make a writable copy, when moving items in a page to fillthe gap from deleting a node, and when splitting a page to insert a new node.

This latter part has never been implemented, because it turned out we neverreally needed it. Most LDAP entries are larger than half a page, so they windup being written to an overflow page. Once on an overflow page, they are nevermoved or relocated.

In my experience, C++ doesn't give you much control over object storagelayout, particularly not if you're using standard templates and other suchstuff. I'm not sure how well you could leverage this capability from C++.

If you're looking at our implementation in back-mdb/id2entry.c to see how weactually store the complete Entry's, you'll notice that it's more complicatedthan I just described. The main reason here is that the AttributeDescriptionis a pointer into the slapd schema, and schema elements aren't (currently)stored in LMDB so we can't guarantee constant pointer values for thoseobjects. Instead we had to use a mapping table from 16-bit integers to schemainstances. Because of this, back-mdb still has to do some minor processing toturn an on-disk entry into a slapd in-memory entry. Doing the completetransition to persistent schema was deferred to OpenLDAP 2.6 or later.

We're investigating database (persistence engines) for use in an embeddedenvironment and stumbled across your LMDB database. The features of LMDB mapquite nicely on to our needs and the fact that its very small, very fast, andhighly configurable are also extremely attractive.
Our current (home-grown) OO persistence solution/framework has a long list ofproblems (complicated, large, slow, etc.) that I won't bore you with. Onefeature/mode of your LMDB that piqued our interest is the use of 'fixedaddress memory mapping'. If we understand the feature correctly, we couldeffectively eliminate all our OO serialization/deserialization transformationsby using a custom memory allocator to ensure objects-to-be-persisted aremapped into LMDB memory-mapped page space, thereby allowing LMDB (and theunderlying OS virtual memory management system) to handle all our persistenceneeds. Is this correct?
I've watched several talks you've given on LMDB, read thru the documentation,plus many articles & blogs on LMDB, but I haven't come across much information(or explicit examples) on using the 'fixed memory mapped feature', which leadsme to my questions:
Are we interpreting this feature correctly? Can LMDB be used as a persistenceengine as described? If so, can you point us to documentation and/or examplesthat illustrate this use case? If not, any pointers on LMDBs application asan OO persistence engine would be much appreciated.
Thanks in advance,

-John
P.S. Our application is currently written in a combination of C# and C++.We're in the process of rearchitecting to address a number of issues and willbe moving to a completely native (C++) implementation in the future.



--
  -- Howard Chu
  CTO, Symas Corp.           http://www.symas.com
  Director, Highland Sun     http://highlandsun.com/hyc/
  Chief Architect, OpenLDAP  http://www.openldap.org/project/

Prev by Date: Config questions for back-ldap, back-meta, and back-asyncmeta
Next by Date: [LMDB] how does reader thread read meta page without locking and avoid data race at the same time?
Index(es):
- Chronological
- Thread