[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Issues arising from creating powerdns backend based on LMDB

To: Howard Chu <hyc@symas.com>
Subject: Re: Issues arising from creating powerdns backend based on LMDB
From: Mark Zealey <spam@markandruth.co.uk>
Date: Fri, 23 Aug 2013 10:46:08 +0300
Cc: openldap-technical@openldap.org
In-reply-to: <5216C127.1070001@symas.com>
References: <52165B85.8060802@markandruth.co.uk> <52165F7A.7040901@markandruth.co.uk> <52167683.1060807@symas.com> <52167B77.5020109@markandruth.co.uk> <52167DA0.8060303@symas.com> <5216833E.7010007@markandruth.co.uk> <52168E82.3010102@symas.com> <5216C127.1070001@symas.com>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130510 Thunderbird/17.0.6

On 23/08/13 04:55, Howard Chu wrote:

Howard Chu wrote:
Yes, I see it here, and I see the problem. LMDB was not originallydesigned tohandle transactions of unlimited size. It originally had a txnsizelimit of
about 512MB. In 0.9.7 we added some code to raise this limit, and it's
performing quite poorly here. I've tweaked my copy of the code toalleviatethat problem but your test program still fails here because thevolume of databeing written also exceeds the map size. You were able to run this tocompletion?
Two things... I've committed a patch to mdb.master to help this caseout. It sped up my run of your program, using only 10M records, from19min to 7min.
Additionally, if you change your test program to commit every 2Mrecords, and avoid running into the large txn situation, then the 10Mrecords are stored in only 1m51s.
Running it now with the original 100M count. Will see how it goes.

I never actually ran it through (hence the map size issue) it was morejust an unlimited number to investigate the slowdown - 10M seems fine. Ijust pulled from git (assumed this was better than the patch you sent)and rebuilt, certainly seems a bit better now although at around 6mrecords (ext4) it has some awful IO - drops to 1mb/sec in places on ournormal disk (first few writes are 100mb/s then it starts writing allover the place). I've tried on both ext4 and xfs with no special tuningand pretty much the same thing happens although closer to 7m records onxfs. This is with NOSYNC option too. If I set the commit gap to 1mrecords performance is ok up to around 8.4m records on ext4 and thenjust stops for a minute or two doing small writes. Same thing at about9.4m. It seems that the patch has pushed the performance dropoff back abit and perhaps improved on it but there is still an issue there as faras I can see.

The test program with 10m records committing every 1m completes in 1m10suser time, but 5m30s real time because of all the pausing for diskwrites (ext4 but as above doesn't seem to make much difference comparedto xfs)... Same program&latest git on an SSD-backed system (ie massivenumber of small write transactions don't cause any issues) with slightlyfaster CPU - user time 47sec, real time 1min. On the SSD-backed boxwithout any commits - 5m30s user time, 6min real time.

So committing every 1-2m records is much better. I don't mind usingshort transactions (in fact the program doesn't actually need anytransactions). Perhaps it would be good to have a "Allow LMDB toautomatically commit+reopen this transaction for optimal performance"flag, or some way of easily knowing when the txn should be committed andreopened rather than trying to guess roughly how many bytes i've writtensince the last txn and commit if > a magic number of 400mb?

Also I don't know how intentional the 512mb limit you mention is butperhaps that could be set at runtime - in that way I could just set tohalf the box's mem size and ensure I don't need to write anything untilI have the whole thing generated?

By the way, looking at `free` output seems to imply that `top` is lyingabout how much memory the program is using - residential looks like itis capped at 500mb but it keeps rising along with shared which ispresumably the pages in the mmap that are in memory at the moment.

wrt the ssd vs hdd performance differences, I did see similar disk writeissues in kyoto. So for that we generate onto a memdisk, however itseems a bit strange to have to do this with LMDB given it's advertisedas a memory database.


Mark

Follow-Ups:
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Howard Chu <hyc@symas.com>

References:
- Issues arising from creating powerdns backend based on LMDB
  - From: Mark Zealey <spam@markandruth.co.uk>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Howard Chu <hyc@symas.com>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Mark Zealey <spam@markandruth.co.uk>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Howard Chu <hyc@symas.com>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Mark Zealey <spam@markandruth.co.uk>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Howard Chu <hyc@symas.com>
- Re: Issues arising from creating powerdns backend based on LMDB
  - From: Howard Chu <hyc@symas.com>

Prev by Date: How to two-way synchronization on openldap 2.4.33 version
Next by Date: Re: Openldap configuration import LDIF
Index(es):
- Chronological
- Thread