[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: (ITS#4088) hdb deadlock between syncrepl/slapcat



richton@nbcs.rutgers.edu wrote:
> Full_Name: Aaron Richton
> Version: 2.3.11
> OS: Solaris 9
> URL: ftp://ftp.openldap.org/incoming/richton-051014-debuginfo.txt
> Submission from: (NULL) (67.85.180.27)
>
>
> Kicking the tires on 2.3.11 on a test machine. I've got a slapd doing syncrepl
> (consumer), and a slapcat, that appear deadlocked:
>
> slapd/6:lwp_cond_wait(0xFDE492F0, 0xFDE492D8, 0x00000000) (sleeping...)     
> slapcat:lwp_cond_wait(0xFE8FBB30, 0xFE8FBB18, 0x00000000) (sleeping...)
>
> I don't want to spam a 15k e-mail out; see FTP URL for debug traces.
>
>   
Interesting. Your current slapcat is using locker ID ec, the page in 
conflict is held by locker ID ef. A slapcat process only uses a single 
cursor to dump the id2entry DB so it probably is not the owner of ef. 
Locker ef has no other locks and is not waiting for any other locks; my 
guess is that it's a stale lock from a previously aborted slapcat run. I 
don't see why any part of the slapd task would have acquired that lock. 
Unfortunately BDB doesn't record process or thread IDs with locker IDs 
so there's no quick way to find the culprit.

-- 
  -- Howard Chu
  Chief Architect, Symas Corp.  http://www.symas.com
  Director, Highland Sun        http://highlandsun.com/hyc
  OpenLDAP Core Team            http://www.openldap.org/project/