[Date Prev][Date Next] [Chronological] [Thread] [Top]

Strange hang scenario, resumes after idletimeout, but plenty of FDs available

To: openldap-technical@openldap.org
Subject: Strange hang scenario, resumes after idletimeout, but plenty of FDs available
From: Kartik Subbarao <subbarao@computer.org>
Date: Wed, 01 Jun 2011 08:43:13 -0400
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.17) Gecko/20110428 Fedora/3.1.10-1.fc14 Thunderbird/3.1.10

I'm running into the following scenario. Shortly after slapd getsbombarded by a burst of operations (from several different clients) onexisting connections (well under the max number of connections, about3000 out of 16384), it suddenly hangs. It's not responsive to any newconnections, and doesn't process operations on existing connections.Load average is near zero during this time, so it's not doing anything.After 20 minutes (idletimeout), slapd frees several connections (maybesay 1000), and resumes working again as if nothing happened.

The load pattern that gets it into this state happens every hour, almoston the hour (most likely associated with nslcd and cron jobs, whichwe're looking to mitigate elsewise). Another strange thing is that slapdwill survive one instance's worth of bombardment without hanging, butthe *next* hour will go into a hang state.

Are there any resources other than file descriptors that are freed upduring the idletimeout processing? Are there any other parameters thatcan be tuned besides idletimeout here? Could it possibly be a case ofdeadlock somewhere, something grabbing all the locks? Would things likeset_lk_max_locks be relevant to investigate here? Any log level settingsthat might reveal more of what's happening here?


Thanks for any suggestions on things to look at and try.

	-Kartik

Follow-Ups:
- Re: Strange hang scenario, resumes after idletimeout, but plenty of FDs available
  - From: Kartik Subbarao <subbarao@computer.org>
- Re: Strange hang scenario, resumes after idletimeout, but plenty of FDs available
  - From: David Hawes <dhawes@vt.edu>

Prev by Date: Valid reasons to choose OpenLDAP over Oracle Directory Server for Linux clients?
Next by Date: Re: Strange hang scenario, resumes after idletimeout, but plenty of FDs available
Index(es):
- Chronological
- Thread