[Date Prev][Date Next] [Chronological] [Thread] [Top]

normalised UTF-8, should it be "decomposed", or "composed"?

To: "'OpenLDAP DEVEL'" <openldap-devel@OpenLDAP.org>
Subject: normalised UTF-8, should it be "decomposed", or "composed"?
From: "John Hughes" <john@Calva.COM>
Date: Tue, 19 Feb 2002 11:43:25 +0100
Importance: Normal

In ldap/libraries/liblunicode/ucstr.c we have around 203:

                /* normalize ucs of length p - ucs */
                uccanondecomp( ucs, p - ucs, &ucsout, &ucsoutlen );
                ucsoutlen = uccanoncomp( ucsout, ucsoutlen );

Why convert to decomposed form then back to composed?  Wouldn't
it be better to us decomposed form as the "normalised" form?

Follow-Ups:
- Re: normalised UTF-8, should it be "decomposed", or "composed"?
  - From: Stig Venaas <Stig@OpenLDAP.org>

Prev by Date: Re: 2.1 loose ends
Next by Date: Re: TODO : Implement localization
Index(es):
- Chronological
- Thread