[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: 2.4.14 prerelease call for testing



On 09.02.2009 17:11, Quanah Gibson-Mount wrote:
Please test current RE24 CVS.
test049-sync-config says it succeeds, but then hangs forever with the consumer slapd never stopping.

Output is:
8<----------------------------------------------
>>>>> Starting test049-sync-config ...
running defines.sh
Starting producer slapd on TCP/IP port 9011...
Using ldapsearch to check that producer slapd is running...
Inserting syncprov overlay on producer...
Starting consumer slapd on TCP/IP port 9012...
Using ldapsearch to check that consumer slapd is running...
Configuring syncrepl on consumer...
Waiting 10 seconds for syncrepl to receive changes...
Using ldapsearch to check that syncrepl received config changes...
Adding schema and databases on producer...
Using ldapadd to populate producer...
Waiting 20 seconds for syncrepl to receive changes...
Using ldapsearch to check that syncrepl received database changes...
Replacing olcSyncrepl on producer...
Waiting seconds for syncrepl to receive changes...
sleep: opÃrande manquante
Pour en savoir davantage, faites: Â sleep --help Â.
Using ldapsearch to read config from the producer...
Using ldapsearch to read config from the consumer...
Filtering producer results...
Filtering consumer results...
Comparing retrieved configs from producer and consumer...
Using ldapsearch to read all the entries from the producer...
Using ldapsearch to read all the entries from the consumer...
Filtering producer results...
Filtering consumer results...
Comparing retrieved entries from producer and consumer...
>>>>> Test succeeded
[hangs forever]
8<----------------------------------------------

ps shows:
16443 pts/1 Sl+ 0:00 | \_ /home/jclarke/cvs/openldap-RE24/openldap-src/tests/../servers/slapd/slapd -s0 -F ./slapd.d -h ldap://localhost:9012/ -d 261


And the end of slapd.2.log is:
8<----------------------------------------------
daemon: shutdown requested and initiated.
=>do_syncrepl rid=001
=>do_syncrepl rid=002
=>do_syncrepl rid=002
connection_get(19)
connection_get(13)
connection_get(13): got connid=0
connection_get(19): got connid=0
ldap_free_request (origid 2, msgid 2)
ldap_free_request (origid 2, msgid 2)
ldap_free_connection 1 1
ldap_send_unbind
slapd shutdown: waiting for 4 operations/tasks to finish
ldap_free_connection 1 1
ldap_send_unbind
ber_flush2: 7 bytes to sd 13
ber_flush2: 7 bytes to sd 19
ldap_free_connection: actually freed
ldap_free_connection: actually freed
8<----------------------------------------------

Note the "waiting for 4 operations/tasks to finish".

I get this on two different boxes, and have observed similar behaviour with RE24 yesterday while testing on yet another box (not the test suite). If it works OK for you, let me know what data I can provide from my hanging slapd. GDB backtraces are below - dunno if they'll help.

Regards,
Jonathan

From GDB:
(gdb) thread apply all bt

Thread 4 (Thread 0x40b7cb90 (LWP 16444)):
#0 0x4001a430 in __kernel_vsyscall ()
#1 0x40170075 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib/tls/i686/cmov/libpthread.so.0
#2 0x0818d44f in ldap_pvt_thread_pool_destroy (tpool=0x82917e4, run_pending=1) at tpool.c:570
#3 0x080634d7 in slapd_daemon_task (ptr=0x0) at daemon.c:2598
#4 0x4016c50f in start_thread () from /lib/tls/i686/cmov/libpthread.so.0
#5 0x4044ea0e in clone () from /lib/tls/i686/cmov/libc.so.6


Thread 3 (Thread 0x40f7db90 (LWP 16447)):
#0 0x4001a430 in __kernel_vsyscall ()
#1 0x40172d09 in __lll_lock_wait () from /lib/tls/i686/cmov/libpthread.so.0
#2 0x4016e114 in _L_lock_89 () from /lib/tls/i686/cmov/libpthread.so.0
#3 0x4016da42 in pthread_mutex_lock () from /lib/tls/i686/cmov/libpthread.so.0
#4 0x080c8e11 in syncrepl_updateCookie (si=0x8fe4720, op=0x40f7cd44, pdn=<value optimized out>, syncCookie=0x40f7c9ec) at syncrepl.c:2891
#5 0x080cf40a in do_syncrep2 (op=0x40f7cd44, si=0x8fe4720) at syncrepl.c:894
#6 0x080d1e76 in do_syncrepl (ctx=0x40f7d218, arg=0x8fe3a88) at syncrepl.c:1333
#7 0x0806907b in connection_read_thread (ctx=0x40f7d218, argv=0x8) at connection.c:1225
#8 0x0818d93e in ldap_int_thread_pool_wrapper (xpool=0x8f97580) at tpool.c:663
#9 0x4016c50f in start_thread () from /lib/tls/i686/cmov/libpthread.so.0
#10 0x4044ea0e in clone () from /lib/tls/i686/cmov/libc.so.6


Thread 2 (Thread 0x4147fb90 (LWP 16449)):
#0 0x4001a430 in __kernel_vsyscall ()
#1 0x40172d09 in __lll_lock_wait () from /lib/tls/i686/cmov/libpthread.so.0
#2 0x4016e114 in _L_lock_89 () from /lib/tls/i686/cmov/libpthread.so.0
#3 0x4016da42 in pthread_mutex_lock () from /lib/tls/i686/cmov/libpthread.so.0
#4 0x080d1c56 in do_syncrepl (ctx=0x4147f218, arg=0x8fe3a88) at syncrepl.c:1250
#5 0x0818d93e in ldap_int_thread_pool_wrapper (xpool=0x8f97580) at tpool.c:663
#6 0x4016c50f in start_thread () from /lib/tls/i686/cmov/libpthread.so.0
#7 0x4044ea0e in clone () from /lib/tls/i686/cmov/libc.so.6


Thread 1 (Thread 0x405a9c00 (LWP 16443)):
#0 0x4001a430 in __kernel_vsyscall ()
#1 0x4016cbf7 in pthread_join () from /lib/tls/i686/cmov/libpthread.so.0
#2 0x080620eb in slapd_daemon () at daemon.c:2663
#3 0x0804edf9 in main (argc=8, argv=0xbfd9ddf4) at main.c:948
#0 0x4001a430 in __kernel_vsyscall ()