[Date Prev][Date Next] [Chronological] [Thread] [Top]

Searching behaviour with Unicode


I wonder if anyone could advise how OpenLDAP handles search
matching against entries containing accented characters.  I
know these are stored as UTF-8, which is then MIME encoded
for output as Base64.  But how can we match an accented
character against a non-accented character?

For example, the directory contains the name:

Suárez Quintáns

but searching for Suarez Quintans does not find it.  Is there
a configuration option that will allow this to be found?

I also tried using the accented form on the search (using
command-line ldapsearch) but it failed to find it, I assume
because the character needs to be UTF-8 and Base64 encoded.

I assume that OpenLDAP 2.0 uses the UCData API 
(http://crl.nmsu.edu/~mleisher/ucdata-doc.html) with canonical
decomposition for comparison?

        Paul Gillingwater
        Managing Director
 CSO Lanifex Unternehmensberatung 
 & Softwareentwicklung G.m.b.H.

E-mail:  paul@lanifex.com
Teleph:  +43(1)2198222-20
Fax:     +43(1)2198222-11
Mobile:  +43(699)1922 3085
Webhome: http://www.lanifex.com/
Address: Praterstrasse 60/1/2 
         A-1020 Vienna, Austria