[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: (ITS#3665) Multi-Listener Thread Support

To: openldap-devel@openldap.org
Subject: Re: (ITS#3665) Multi-Listener Thread Support
From: Emmanuel Lécharny <elecharny@apache.org>
Date: Mon, 02 Aug 2010 08:34:15 +0200
In-reply-to: <4C563AE2.20705@symas.com>
Organization: The Apache Software Foundation
References: <4C537C35.2000909@symas.com> <4C538213.1040801@gmail.com> <4C563AE2.20705@symas.com>
User-agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.7) Gecko/20100713 Thunderbird/3.1.1

 On 8/2/10 5:26 AM, Howard Chu wrote:

<snip/>
Why would you have more than one select() ? Wouldn't it be better to
have one thread processing the select() and dispatching the operation to
a pool of threads ?
That's what we have right now, and as far as I can see it's abottleneck that prevents us from utilizing more than 12 cores. (Icould be wrong, and the bottleneck might actually be in the threadpool manager. I haven't got precise enough measurements yet to knowfor sure.)
Here's the situation: suppose you have thousands of clients connectedand active. Even if you have CPUs to spare, the number of connectionsyou can acknowledge and dispatch is limited by the speed of the singlethread that's processing select(). Even if all it does is walk thruthe list of active descriptors and dispatch a job to the thread poolfor each one, it's only possible to dispatch a fixed number ofops/second, no matter how many other CPUs there are.

I'm a bit surprised that the select() processing *is* the bottleneck...All in all, it's just -internally- a matter of processing a bit field tosee which bit is set to 1, and then get back the FD that is associatedwith this bit. You must have some other tasks running that create thisbottleneck.


I will have to check OpenLDAP code here...

Right now on a 24 core server I'm seeing 48,000 searches/second and50% CPU utilization. Adding more clients only seems to increase theoverall latency, but CPU usage and throughput don't increase any further.

Have you tried to do something we did on ADS : remove all the processingto simply have a mock LDAP server, where only the network part is studied ?


ie, we just send back a mock response when a request has been received.

It helps to focus only on the network layer.

(sadly, as the PDU decoding is a costly operation, you may have to takethat into account).

Also in the test I'm looking at, some of the issues I pointed to abovearen't even happening. E.g., none of the operations are blocking onwrite. It's all about handling read events, nothing else. (So again,maybe it's OK to leave the current write behavior as-is.)

One idea Jean-François Arcand devised was to have two select() = one forthe reads, one for the writes. i'm not sure it's really interesting, butthat may worth to be investigated.

Another idea, and this is what we have implemented in ADS, is to havemore than one select() thread for reads/writes (in fact, we compute thenumber of processors, and create as many threads as we have cores, plusone. Each thread process a select() of course). I'm not convinced that,in our case, it helps a lot, but it may be helpful in your case if youdon't want/can't modify the way requests are being processed in OpenLDAP.

As I've noted in the past, I can get double the throughput by runningtwo slapds concurrently, so it's not a question of network or I/Oresources. Just that a single listener thread (and/or a single mutexcontrolling the thread pool) will only scale so far, and no further.
All in all, what costs CPU consumption in a server is most certainly the
processing of incoming and outgoing requests, not the processing of the
select() operation, no ?
In relative costs, sure, but eventually even the small costs add up.
Im' maybe a bit tainted by Java, but it's really based on the same
mechanisms under the hood...
Heh. Indeed. Concurrency issues are the same, no matter what languageyou use to dress them up.
One other realization I had is that the current design makes it veryeasy to build slapd as either threaded or non-threaded. Pushing toomuch activity into other threads would break the non-threaded builds.But at this point, with even cellphones going dual-core, I have towonder how important it is to maintain compatibility for non-threadedslapd builds. ??

Java does not suffer from such limitation, that's for sure, thus wedon't have to care about primitive devices which don't supportthreads... As Android - and supposely iOS - is supporting threads, itshould not be such a constraint.

Now, if you have an installed base of users who still depend on a singlethreaded server, then may be it's time to split the server in two, andhave the single threaded server having its own branch ? Even if you fixsome bugs in trunk, it's very likely that you will be able to apply thepatch to the single threaded branch, and I'm not sure that it has a longlife expectancy anyway :)


Interesting problems :)

--
Regards,
Cordialement,
Emmanuel Lécharny
www.iktek.com

Follow-Ups:
- Re: (ITS#3665) Multi-Listener Thread Support
  - From: Anton Bobrov <anton.bobrov@oracle.com>
- Re: (ITS#3665) Multi-Listener Thread Support
  - From: Howard Chu <hyc@symas.com>

References:
- Re: (ITS#3665) Multi-Listener Thread Support
  - From: Emmanuel Lecharny <elecharny@gmail.com>
- Re: (ITS#3665) Multi-Listener Thread Support
  - From: Howard Chu <hyc@symas.com>

Prev by Date: Re: (ITS#3665) Multi-Listener Thread Support
Next by Date: Re: (ITS#3665) Multi-Listener Thread Support
Index(es):
- Chronological
- Thread