[Date Prev][Date Next] [Chronological] [Thread] [Top]

slapd shuts down unexpectedly



Hi

We are running OpenLDAP v 2.1.22 with bdb 4.2.52 on Solaris 9. It was
installed two years ago and we didn't have any problems until recently after
we installed an application that does a lot of search operations on ldap.
Everything looked fine for 5 days then slapd stopped. It was restarted by
the cluster but after that it would stop twice a day.
At some point we ran db_recover, restarted the server. It ran for five days
with no problems and then stopped again.
It looks like we are running out of some resource. 

This is what we get in log file - looks like the process is killed by
something:

Nov 10 08:19:19 up26 slapd[22985]: conn=673562 fd=10 closed
Nov 10 08:19:19 up26 slapd[22985]: conn=673353 fd=15 closed
Nov 10 08:19:19 up26 slapd[22985]: slapd shutdown: waiting for 0 threads to
term
inate
Nov 10 08:19:22 up26 slapd[22985]: slapd stopped.
Nov 10 08:19:35 up26 slapd[3074]: bdb_initialize: Sleepycat Software:
Berkeley D
B 4.2.52: (December  3, 2003)
Nov 10 08:19:35 up26 slapd[3074]: bdb_db_init: Initializing BDB database
Nov 10 08:19:35 up26 slapd[3075]: slapd starting

This is DB_CONFIG:

set_cachesize 1 0 0
set_lg_max 209715200
set_lg_bsize 52428800

This is the output from dbstat -m:

1GB   Total cache size.
1 Number of caches.
1GB   Pool individual cache size.
0 Requested pages mapped into the process' address space.
2255077 Requested pages found in the cache (99%).
14989 Requested pages not found in the cache.
58    Pages created in the cache.
14989 Pages read into the cache.
2 Pages written from the cache to the backing file.
0 Clean pages forced from the cache.
0 Dirty pages forced from the cache.
0 Dirty pages written by trickle-sync thread.
15047 Current total page count.
12782 Current clean page count.
2265  Current dirty page count.
131071  Number of hash buckets used for page location.
2284990 Total number of times hash chains searched for a page.
3 The longest hash chain searched for a page.
2291024 Total number of hash buckets examined for page location.
4593195 The number of hash bucket locks granted without waiting.
3 The number of hash bucket locks granted after waiting.
3 The maximum number of times any hash bucket lock was waited for.
30177 The number of region locks granted without waiting.
6 The number of region locks granted after waiting.
15063 The number of page allocations.
0 The number of hash buckets examined during allocations
0 The max number of hash buckets examined for an allocation
0 The number of pages examined during allocations
0 The max number of pages examined for an allocation
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Pool File: objectClass.bdb
4096  Page size.
0 Requested pages mapped into the process' address space.
138012  Requested pages found in the cache (100%).
161   Requested pages not found in the cache.
0 Pages created in the cache.
161   Pages read into the cache.
1 Pages written from the cache to the backing file.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Pool File: uid.bdb
4096  Page size.
0 Requested pages mapped into the process' address space.
225558  Requested pages found in the cache (100%).
259   Requested pages not found in the cache.
0 Pages created in the cache.
259   Pages read into the cache.
1 Pages written from the cache to the backing file.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Pool File: dn2id.bdb
4096  Page size.
0 Requested pages mapped into the process' address space.
744661  Requested pages found in the cache (99%).
7141  Requested pages not found in the cache.
30    Pages created in the cache.
7141  Pages read into the cache.
0 Pages written from the cache to the backing file.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Pool File: id2entry.bdb
16384 Page size.
0 Requested pages mapped into the process' address space.
1146846 Requested pages found in the cache (99%).
7428  Requested pages not found in the cache.
28    Pages created in the cache.
7428  Pages read into the cache.
0 Pages written from the cache to the backing file.


dbstat -m output:

37    Last allocated locker ID.
2147M Current maximum unused locker ID.
9 Number of lock modes.
1000  Maximum number of locks possible.
1000  Maximum number of lockers possible.
1000  Maximum number of lock objects possible.
6 Number of current locks.
31    Maximum number of locks at any one time.
33    Number of current lockers.
34    Maximum number of lockers at any one time.
6 Number of current lock objects.
28    Maximum number of lock objects at any one time.
2996109 Total number of locks requested.
2996103 Total number of locks released.
0 Total number of lock requests failing because DB_LOCK_NOWAIT was set.
153   Total number of locks not immediately available due to conflicts.
0 Number of deadlocks.
0 Lock timeout value.
0 Number of locks that have timed out.
0 Transaction timeout value.
0 Number of transactions that have timed out.
400KB The size of the lock region..
834   The number of region locks granted after waiting.
4669455 The number of region locks granted without waiting.

I realize that we are having pretty old version and believe that bdb was
never patched - we probably need to upgrade to a more
recent version. But still what can it be? I am quite new to this so I'll
appreciate any help or advise.

Thank you
Alla


-----------------------------------------
The information in this message may be proprietary and/or confidential,
and protected from disclosure.  If the reader of this message is not
the intended recipient, or an employee or agent responsible for
delivering this message to the intended recipient, you are hereby
notified that any dissemination, distribution or copying of this
communication is strictly prohibited. If you have received this
communication in error, please notify First Data immediately by
replying to this message and deleting it from your computer.