Issue 77 - Deadlock
Summary: Deadlock
Status: VERIFIED FIXED
Alias: None
Product: OpenLDAP
Classification: Unclassified
Component: slapd (show other issues)
Version: unspecified
Hardware: All All
: --- normal
Target Milestone: ---
Assignee: OpenLDAP project
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 1999-02-18 17:17 UTC by gpayet@oceangroup.com
Modified: 2014-08-01 21:06 UTC (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this issue.
Description gpayet@oceangroup.com 1999-02-18 17:17:39 UTC
Full_Name: Guillermo Payet
Version: 1.2
OS: Red Hat Linux 5.1
URL: 
Submission from: (NULL) (208.242.218.40)



Hello,

We've been running OpenLDAP 1.0.3 over dn 1.85 for a while, 
with no major probelms.   Last Friday, I downloaded 1.2 and 
DB 2.3.16.  After some initial trouble compiling the new 
version (problems with config.cache), I got a system up and 
running.

Now slapd will run fine for a few hours, and then it will 
deadlock.  I haven't been able to trigger the problem at will.  
Sometimes it hangs, sometimes it doesn't, and it seems to 
be independent of load.  Once it hangs I have to killall -9 
slapd.

I thought it might be a threading issue, but if I pound the 
server hard with a few clients at the same time, it will run 
fine, and then it will freeze a few hours later with only one 
user.

How should I approach this?  what loglevel should I use to 
obtain a meaningful log file?

    thanks

    --Guillermo Payet


Comment 1 gpayet@oceangroup.com 1999-02-18 22:22:06 UTC
Froze again!!

This time I managed to get some more info...

We were running slapd with errorlevel 4.  Now every time I attempt a
call to the server, it logs:

	ldap_pvt_thread_create failed (1)

If I attempt a "killall slapd", slapd logs:

	slapd shutting down - waiting for 10 threads to terminate

and just waits...  only way to kill it is with "kill -9"

I have gdb attached to slapd, but in my ignorance, I have no clue as to 
what to do next!

	any hints?

	--G



At 09:17 AM 2/18/99 , you wrote:
>Full_Name: Guillermo Payet
>Version: 1.2
>OS: Red Hat Linux 5.1
>URL: 
>Submission from: (NULL) (208.242.218.40)
>
>
>
>Hello,
>
>We've been running OpenLDAP 1.0.3 over dn 1.85 for a while, 
>with no major probelms.   Last Friday, I downloaded 1.2 and 
>DB 2.3.16.  After some initial trouble compiling the new 
>version (problems with config.cache), I got a system up and 
>running.
>
>Now slapd will run fine for a few hours, and then it will 
>deadlock.  I haven't been able to trigger the problem at will.  
>Sometimes it hangs, sometimes it doesn't, and it seems to 
>be independent of load.  Once it hangs I have to killall -9 
>slapd.
>
>I thought it might be a threading issue, but if I pound the 
>server hard with a few clients at the same time, it will run 
>fine, and then it will freeze a few hours later with only one 
>user.
>
>How should I approach this?  what loglevel should I use to 
>obtain a meaningful log file?
>
>    thanks
>
>    --Guillermo Payet
>
>
>


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Guillermo Payet                         voice:       408.466.0700
O C E A N    G R O U P                  http://www.oceangroup.com
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Comment 2 Kurt Zeilenga 1999-02-18 22:57:06 UTC
moved from Incoming to Software Bugs
Comment 3 gpayet@oceangroup.com 1999-02-24 23:32:26 UTC
The problem was due to the buggy phtreads implementation in glibc inb Red Hat 
linux 5.1. (http://www.openldap.com/faq/index.cgi?file=92&showLastModified=1)

I upgraded glibc, recompiled, and that fixed it...

   --G


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Guillermo Payet                         voice:       831.466.0700
O C E A N    G R O U P                  http://www.oceangroup.com
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Comment 4 Kurt Zeilenga 1999-03-01 20:01:19 UTC
changed notes
changed state Open to Closed
Comment 5 Peter Duffy 1999-03-02 09:56:01 UTC
gpayet@oceangroup.com wrote:
> 
> The problem was due to the buggy phtreads implementation in glibc inb Red Hat
> linux 5.1. (http://www.openldap.com/faq/index.cgi?file=92&showLastModified=1)
> 
> I upgraded glibc, recompiled, and that fixed it...
> 
>    --G
> 
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Guillermo Payet                         voice:       831.466.0700
> O C E A N    G R O U P                  http://www.oceangroup.com
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

-- 
peter@netline.net.uk
Peter Duffy
Unix Developer
Netline (UK) Ltd.
Comment 6 Peter Duffy 1999-03-02 10:06:36 UTC
We're using openldap 1.2.0 and Redhat 5.2 (with glibc-2.0.7-29,
including libpthread-0.7.so, apparently at the same level as described.)
We're still seeing an intermittent deadlock problem which seems to match
this description.

(Sorry about the previous blank followup to this: I hit the send button
prematurely...)


gpayet@oceangroup.com wrote:
> 
> The problem was due to the buggy phtreads implementation in glibc inb Red Hat
> linux 5.1. (http://www.openldap.com/faq/index.cgi?file=92&showLastModified=1)
> 
> I upgraded glibc, recompiled, and that fixed it...
> 
>    --G
> 
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Guillermo Payet                         voice:       831.466.0700
> O C E A N    G R O U P                  http://www.oceangroup.com
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Regards,
Peter

-- 
peter@netline.net.uk
Peter Duffy
Unix Developer
Netline (UK) Ltd.
Comment 7 OpenLDAP project 2014-08-01 21:06:51 UTC
Vendor bug.  see follup #2.