[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: syncrepl problem?

To: openldap-technical@openldap.org
Subject: Re: syncrepl problem?
From: Matthew Backes <mbackes@symas.com>
Date: Thu, 13 Aug 2009 16:23:55 -0700
Cc: tyler@beloit.edu
In-reply-to: <007e01ca1c4c$7541cfe0$5fc56fa0$@edu>
References: <007e01ca1c4c$7541cfe0$5fc56fa0$@edu>

We are running 2.3.43 Openldap on Centos 5.3 systems. I have oneprovider and two consumers. I believe the consumers were workingfine in terms of receiving replication data and staying synchronizeduntil today. I have this entry in slapd.conf


Consider upgrading, but that should be unrelated.

syncrepl rid=102
    type=refreshAndPersist
    interval=00:01:00:00


interval= is for refreshOnly

You want retry= to specify a retry period, or else any interruptionwill halt replication.

The problem is that I had to completely restore the provider’sentire ldap database from a backup ldif file after screwing up over200 accounts. I got the provider back to the way I wanted, but nowthe consumers won’t synchronize (replicate) any more.

Hopefully that was a backup taken with slapcat, or preserving all ofthe metadata using a careful search. (Check for entryUUID/entryCSN)Worst case you can pull that from the replica.

Did you remember to slapadd on the master side with -w so thatcontextCSN exists and is up to date?

What do you see in the logs? Does your restored database still haveyour replication account, sufficient ACLs/limits, etc in theconfiguration? What does contextCSN look like on each side? DoentryUUIDs match on objects with matching DNs?

1. Should syncrepl ultimately be able to replicate after amajor change to the provider such as a ldif restoration? Or shouldI expect to have to reload the consumer entries from scratch from aprovider generated ldif in situations like this?

If you loaded the right LDIF, (i.e. didn't generate entirely new andunrelated data with different uuid/csn info) then this really shouldnot be a major change.

If you loaded correct but old data with a lower contextCSN than thecontextCSN on the replica, then you will probably lose all of thechanges still present on the replica.

I see no reason why you would want to reload the consumer. In theevent of catastrophic master failure like you describe (lost alldrives in your RAID set, someone did rm -rf /, building fire, etc),you should use the data from the replica. That's one of the mainreasons for having a replica in the first place.

2. I thought I read once that the interval settings was stillimportant for when refreshandpersist missed an update. Is that true?


No.  See retry=

Matthew Backes
Symas Corporation
mbackes@symas.com

Follow-Ups:
- RE: syncrepl problem?
  - From: "Tim Tyler" <tyler@beloit.edu>

References:
- syncrepl problem?
  - From: "Tim Tyler" <tyler@beloit.edu>

Prev by Date: syncrepl problem?
Next by Date: Re: OpenLDAP + Kerberos on FreeBSD 7.2, close to working but not quite
Index(es):
- Chronological
- Thread