[Date Prev][Date Next] [Chronological] [Thread] [Top]

(ITS#6158) syncprov: assert causing slapd to core dump



Full_Name: Jonathan Clarke
Version: 2.3.43
OS: Solaris
URL: ftp://ftp.openldap.org/incoming/
Submission from: (NULL) (213.41.243.192)


Hi,

I have a 2.3.43 running on a Solaris Sparc server, which crashes occasionally -
once every week or two, always during the night. At this particular time a large
number of operations are performed, including mass deletes and adds. I haven't
been able to reproduce this bug, just watch it happen on the production server
every now and again...

I managed to obtain a coredump, and a backtrace (at the end of this message). I
realize this isn't much to go on, but I'm rather unfamiliar with this part of
the code, so I wondered if anyone has an idea what's going on here?

FWIW, the dynlist and chain overlays are in use on the server, and the database
is bdb, with a syncrepl consumer as well as syncprov overlay.


Backtrace follows:
8<-------------------------------------------------------------
Thread 1 (process 1054014    ):
#0  0xfee4aa58 in _lwp_kill () from /lib/libc.so.1
#1  0xfede5a64 in raise () from /lib/libc.so.1
#2  0xfedc1954 in abort () from /lib/libc.so.1
#3  0xfedc1b90 in _assert () from /lib/libc.so.1
#4  0xff30ef44 in ldap_pvt_runqueue_resched (rq=0x16c630, entry=0xee6c0a0,
defer=0) at rq.c:165
#5  0xfe7f4a94 in syncprov_qstart (so=0x10acb540) at syncprov.c:933
#6  0xfe7f4d6c in syncprov_qresp (opc=0x1b1bfaf8, so=0x10acb540, mode=2) at
syncprov.c:982
#7  0xfe7f5aa4 in syncprov_matchops (op=0xf6bffa50, opc=0x1b1bfaf8, saveit=0) at
syncprov.c:1175
#8  0xfe7f7490 in syncprov_op_response (op=0xf6bffa50, rs=0xf6bff644) at
syncprov.c:1561
#9  0x000575cc in ?? ()
#10 0x000575cc in ?? ()
8<-------------------------------------------------------------

Thanks in advance for any pointers!

Regards,
Jonathan