[Date Prev][Date Next] [Chronological] [Thread] [Top]

LDIF parser performance (was: write performance)

To: Howard Chu <hyc@symas.com>, openldap-devel@OpenLDAP.org
Subject: LDIF parser performance (was: write performance)
From: Michael Ströder <michael@stroeder.com>
Date: Thu, 23 Nov 2006 15:56:05 +0100
In-reply-to: <456589FE.3090804@symas.com>
References: <200608282343.k7SNhOjt061559@cantor.openldap.org> <44F3896A.9080002@symas.com> <Pine.SOC.4.64.0608282050580.7225@toolbox.rutgers.edu> <44F64AB0.7080007@symas.com> <4563F8F2.8090109@symas.com> <4564DBF8.8010809@symas.com> <4564E1C9.4030804@stroeder.com> <4564E33F.1080208@symas.com> <4564EA4D.3070705@stroeder.com> <456589FE.3090804@symas.com>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.13) Gecko/20060417

Howard,

(Cc:-ed openldap-devel@OpenLDAP.org in opposite to our off-line
conversation).

Howard Chu wrote:
> I'm wondering if it's worth the effort
> to rewrite the client's LDIF parser as I did for slapadd -q.

As I said I cannot test on the machine where I did the original tests.
But I tried to test OpenLDAP's LDIF parser with -n.

$ time ldapadd -f test.ldif -n
[..]
real    1m30.402s
user    1m29.090s
sys     0m0.376s

Now I have a small Python script which uses the module 'ldif' from
python-ldap for reading in the LDIF file. I've implemented module 'ldif'
in pure Python but off course the string module in the underlying Python
standard lib is implemented in C. And the Python runtime environment
does all the ugly memory management. :-)

$ time python count_members.py < test.ldif
[..]
real    0m19.145s
user    0m18.349s
sys     0m0.568s

I re-ran the tests twice, so test.ldif should have been in the
filesystem cache.

How does that sound to you? I'm not sure what different actions
ldapadd -n does in comparison to my simple script. But at least
count_members.py also reads the complete entries into a tuple containing
the DN as string and the entry as so-called dictionary.

Ciao, Michael.

Follow-Ups:
- Re: LDIF parser performance
  - From: Howard Chu <hyc@symas.com>

References:
- Re: better malloc strategies
  - From: Howard Chu <hyc@symas.com>
- Re: better malloc strategies
  - From: Howard Chu <hyc@symas.com>

Prev by Date: Re: better malloc strategies
Next by Date: Re: LDIF parser performance
Index(es):
- Chronological
- Thread