[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Indexing revisited

To: Quanah Gibson-Mount <quanah@zimbra.com>, OpenLDAP-devel@openldap.org
Subject: Re: Indexing revisited
From: Howard Chu <hyc@symas.com>
Date: Tue, 18 Mar 2014 05:28:03 -0700
In-reply-to: <532837DF.8040109@symas.com>
References: <5322CF11.50500@symas.com> <EE940B73385F105D7D82DE67@[192.168.1.46]> <532837DF.8040109@symas.com>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:29.0) Gecko/20100101 Firefox/29.0 SeaMonkey/2.26a1

Howard Chu wrote:

Quanah Gibson-Mount wrote:

--On Friday, March 14, 2014 3:42 AM -0700 Howard Chu <hyc@symas.com> wrote:

A few thoughts occurred to me today about our indexing code:
     1) we compute a hash preset for each invocation, crunching the syntax
and matching rule's OID, among other things. (It used to be worse, we
used to recompute this for each individual value, even though it's a
constant.) There's no need to always recompute this on each invocation,
we can compute it once at first usage and reuse that result. It should
speed up index generation, particularly on smaller attribute values. I'm
preparing a patch to test this now.
     2) using this precomputed hash, we can drop the syntax, mr, and prefix
arguments from the indexer function signature. That will also speed
things up.
     3) I note that the 'use' argument is also never used in our indexer
functions. Will drop this as well.


Please send the patch my way! ;)


Complete patch is on my indx2 branch on ada. (I just copied the back-mdb
changes over to back-bdb/hdb today; the back-mdb code hasn't changed from last
week.)

For reference, the impact has been tiny, on the data sets I've tested. Around3% improvement at most. But the code is cleaner anyway.


--
  -- Howard Chu
  CTO, Symas Corp.           http://www.symas.com
  Director, Highland Sun     http://highlandsun.com/hyc/
  Chief Architect, OpenLDAP  http://www.openldap.org/project/

References:
- Indexing revisited
  - From: Howard Chu <hyc@symas.com>
- Re: Indexing revisited
  - From: Quanah Gibson-Mount <quanah@zimbra.com>
- Re: Indexing revisited
  - From: Howard Chu <hyc@symas.com>

Prev by Date: Re: Indexing revisited
Next by Date: Re: Global modules and cn=config
Index(es):
- Chronological
- Thread