[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: syncrepl: large datasets and expediting consumer's initialization

To: openldap-software@openldap.org
Subject: Re: syncrepl: large datasets and expediting consumer's initialization
From: Paul Fardy <paul.fardy@utoronto.ca>
Date: Mon, 5 Apr 2010 21:46:53 -0230
Cc: Paul Fardy <paul.fardy@utoronto.ca>
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/simple; d=utoronto.ca; s=beta; t=1270513018; bh=EJCK0Taqen4ZjJcXl5wUCQTY8/vMY/q0laBvZHUAQmA=; h=Cc:Message-Id:From:To:In-Reply-To:Content-Type: Content-Transfer-Encoding:Mime-Version:Subject:Date:References; b=4Mduoths/lZYvwQkb3CcV4qVYOai7+297ulkcEdTOdN+rSkB9ATE2Pw+Zc7RJDTpr S4t1vFLnkSqIaVUV4cQlyhA36PciXUgXBnANXs2ecoOR563F6bjJJ8xWka5y9E3d1i WXH5KQq3EMNTbeCFW/vd+KjXckQAUUK7QtXQlxhg=
In-reply-to: <561A3CA2A80CF9F95461E17D@[192.168.1.2]>
References: <84E1440F-999F-4820-BD70-8A8A1DC41C4E@utoronto.ca> <561A3CA2A80CF9F95461E17D@[192.168.1.2]>

My DB_CONFIG:

set_cachesize 0 268435456 1
set_lg_regionmax 262144
set_lg_bsize 2097152
set_lg_dir logs



The filesystem is ext3 on RHEL5.

-q enable quick (fewer integrity checks) mode. Does fewerconsis-tency checks on the input data, and no consistency checkswhenwriting the database. Improves the load time but if anyerrorsor interruptions occur the resulting database will beunusable.

That last bit was enough for me not to use the -q, but it did reduceload time to 17 minutes.

The performance of slapadd is significant, but what about syncrepl?Why is the consumer reviewing every object? Reviewing "-q", I discovered

-w write syncrepl context information. After all entriesareadded, the contextCSN will be updated with the greatestCSN in
        the database.


And that looks like an option that would prime my syncrepl info. So

	slapadd -q -w -l SLAPCAT.LDIF

took 14 minutes to build and then 3 minutes to close the databases.This consumer has the same hardware as the provider that took 35minutes to rebuild the database.

That "slapadd -w" looks like the fix. Would someone confirm or rejectthat?

The provider's log file still shows it's reviewing many records. Iguess it's not returning them. Will the log file show the DNs ofresults (as opposed to visited)?

I restarted the provider with less logging; logs of a full syncreplscans are sucking up disk space. Only 5 or 6 records would have changed.

Is it normal for the provider to visit many (all?) objects even whenthe consumer would have a very current CSN?


Thanks for your help,

Paul

Follow-Ups:
- Re: syncrepl: large datasets and expediting consumer's initialization
  - From: masarati@aero.polimi.it
- Re: syncrepl: large datasets and expediting consumer's initialization
  - From: Quanah Gibson-Mount <quanah@zimbra.com>

References:
- syncrepl: large datasets and expediting consumer's initialization
  - From: Paul Fardy <paul.fardy@utoronto.ca>
- Re: syncrepl: large datasets and expediting consumer's initialization
  - From: Quanah Gibson-Mount <quanah@zimbra.com>

Prev by Date: Re: syncrepl: large datasets and expediting consumer's initialization
Next by Date: Re: syncrepl: large datasets and expediting consumer's initialization
Index(es):
- Chronological
- Thread