[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: slow slapadd?

To: Diego Figueroa <dfiguero@yorku.ca>
Subject: Re: slow slapadd?
From: Oskar Pearson <oskar@deckle.co.za>
Date: Sun, 17 May 2009 10:31:40 +0100
Cc: openldap-software@openldap.org
In-reply-to: <OFB3ADCF08.3B7B3B16-ON852575B7.006D2BAF-852575B7.006D5BEE@yorku.ca>
References: <OFE5EA9700.A6186731-ON852575B7.00621719-852575B7.0063CDA6@yorku.ca> <2C5E4A88E474B99E3142A701@STONEKING-LM.CORP.YAHOO.COM> <OFB3ADCF08.3B7B3B16-ON852575B7.006D2BAF-852575B7.006D5BEE@yorku.ca>

Hi Diego

On 15 May 2009, at 20:54, Diego Figueroa wrote:

Thanks for your input Quanah,
I also just noticed that top is reporting 50-90% I/O waiting times.I might have to look at my disks to further improve things.

That can be an over-simplification - you may be right, but it could bean over-simplification.

Random seeks will always create a performance slowdown on physicaldisks. If you optimise the DB so that you reduce the number of randomseeks, you'll get dramatically faster performance.

Realise that if your db is, say, 200mb, you could probably write thewhole file contiguously in 3-4 seconds on most server PCs. But if youdo 1 seek per object in your 500k item database with reasonable seek-time disks (say 6.5ms), you'll be doing 500000 seeks *6.5ms = 3250000ms = 3250 seconds = 54 minutes.

http://www.oracle.com/technology/documentation/berkeley-db/db/ref/transapp/throughput.htmlsays that every write can do the following seeks:

	1 Disk seek to database file
	2 Database file read
	3 Disk seek to log file
	4 Log file write
	5 Flush log file information to disk

6 Disk seek to update log file metadata (for example, inodeinformation)

	7 Log metadata write
	8 Flush log file metadata to disk

So, what to do? Well, if you update cache values, you'll find lessreads. If you assume each item above is equal in wall-clock time, youcould remove the first 3 items and speed things up 37.5% for every oneof the cache hits.

You could also put the log file on a separate disk. Or you couldperhaps put the log file on a ramdisk for your build, and move it to astable disk after it completes. I'm assuming you don't have sufficientram to store the whole db on a ramdisk, which would be the ideal forthe build process.

You can also mount your filesystems with -noatime, which will help byremoving step 6. Note you'll have to check whether this breaks otherthings on your system.

You could also try fiddle with the DB_TXN_WRITE_NOSYNC andDB_TXN_NOSYNC flags. I've not done that, and you'd have to be 100%sure that once your db goes live, this flag is then turned off or youdisk disaster if your db server reboots. I wonder if it's possible forslapadd to turn these on automatically for the load process (perhapsit already does - I'm ignorant on that fact, unfortunately).

If you're feeling brave, and are building on a throwaway system (whereyou can reinstall due to filesystem corruption), you could also usesomething like hdparm under linux to change the disks so that theyalways return writes as successful immediately, even if the datahasn't been written to disk. I don't recommend this, but I've beenknown to do it when testing on a dev system. I don't have any stats onhow much it'd help.

Another thing: I read an article a while back where someone found thatinnodb file fragmentation on mysql dbs created a massive slowdown overtime with random small writes to to the file. The solution was fairlysimple - move the files to a different directory, make a copy backinto the original directory, and start the db again running off thecopy. The new files will be written contiguously with very littlefragmentation. It's not possible to do this mid-stream in the load ona new DB, but it may be a good practice once you have a very largecomplete DB file that's been built over time.

I'd be really interested in which of those items helps the most. Letpeople know if you play and find something interesting. Hopefully ithelps someone else!


Oskar

Follow-Ups:
- Re: slow slapadd?
  - From: Howard Chu <hyc@symas.com>
- Re: slow slapadd?
  - From: Buchan Milne <bgmilne@staff.telkomsa.net>

References:
- slow slapadd?
  - From: Diego Figueroa <dfiguero@yorku.ca>
- Re: slow slapadd?
  - From: Quanah Gibson-Mount <quanah@zimbra.com>
- Re: slow slapadd?
  - From: Diego Figueroa <dfiguero@yorku.ca>

Prev by Date: Re: slow slapadd?
Next by Date: Re: slow slapadd?
Index(es):
- Chronological
- Thread