[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: LDAPprep: mapping of " " values

To: Steven Legg <steven.legg@eb2bcom.com>
Subject: Re: LDAPprep: mapping of " " values
From: Rici Lake <rici@ricilake.net>
Date: Tue, 16 Nov 2004 02:01:53 -0500
Cc: ietf-ldapbis@OpenLDAP.org
In-reply-to: <4199A06D.3030302@eb2bcom.com>
References: <6.1.2.0.0.20041114173514.02e0cd80@127.0.0.1> <4199A06D.3030302@eb2bcom.com>


On 16-Nov-04, at 1:38 AM, Steven Legg wrote:

Alternatively, LDAPprep can just reduce consecutive whitespace to a single space in every case and leave the syntaxes draft to nominate the circumstances under which a leading or trailing space is to be removed.


This seems very sensible to me.

A value can only match (l= *) or (l=* ) if it is all whitespace.

Not quite true. According to the insignificant space deletion rule, a space is only a candidate for deletion if it is not followed by a combining mark. Consequently, the sequence U+0041 U+0020 U+0020 U+0301 will be not be altered by ldapprep. (The string is LATIN CAPITAL LETTER A, SPACE, SPACE, COMBINING ACUTE ACCENT.) One doesn't have to go out of one's way to produce that sequence; the sequence U+0041 U+0020 U+00B4 (LATIN CAPITAL LETTER A, SPACE, ACUTE ACCENT) will be ldapprep'd into the first sequence, as a result of the compatibility decomposition of U+00B4 into U+0020 U+0301. (That is, both strings render as "A ´" for anyone with Unicode mail readers.)

Consequently, even (l= * *) could match something, but I believe that
(l= * * ) is truly impossible.

Follow-Ups:
- Re: LDAPprep: mapping of " " values
  - From: Rici Lake <rici@ricilake.net>

References:
- LDAPprep: mapping of " " values
  - From: "Kurt D. Zeilenga" <Kurt@OpenLDAP.org>
- Re: LDAPprep: mapping of " " values
  - From: Steven Legg <steven.legg@eb2bcom.com>

Prev by Date: Re: LDAPprep: mapping of " " values
Next by Date: Re: LDAPprep: mapping of " " values
Index(es):
- Chronological
- Thread