[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: [ldapext] UTF-8 full support in LDIF / LDIF v2

To: ldapext@ietf.org
Subject: Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
From: Yves Dorfsman <yves@zioup.com>
Date: Thu, 11 Jun 2009 21:58:51 -0600
Delivered-to: ldapext@core3.amsl.com
In-reply-to: <4A311ED1.1030202@stroeder.com>
References: <49C497F9.7010200@zioup.com> <CD3905D4-2A25-4C56-8187-3CE10D46C929@isode.com> <49C870C6.4010803@zioup.com> <E94B7389-9A6D-4CB6-BB2C-649CCD3FD15B@Isode.com> <49CB192E.5050105@zioup.com> <49CB211C.6070108@eb2bcom.com> <49CB87FE.1050809@zioup.com> <49CC01DE.6040506@eb2bcom.com> <4A24557D.7030006@zioup.com> <4A26A05D.8040105@zioup.com> <245BF18B-2066-4E36-9502-16F4A3140D9E@Isode.com> <4A309775.3080406@zioup.com> <4A311ED1.1030202@stroeder.com>
User-agent: Thunderbird 2.0.0.21 (X11/20090409)


Ok, let's split the two issue:

First, is it worth amending the standard to allow non-base64 UTF-8 ?
(forget about multi-line attribute value for a moment).

Looking back (way back) at this thread, I am not the only one seeing value here:


Michael Ströder wrote:

I'm not convinced that removing the ASCII restrictions will be a good
thing. Not only do I doubt it will have a net positive ondisplayability of LDIF for those who have a displayability goal (Idon't this goal), I think it will have a net negative impact oninteroperability and user confusion, such as when the user creates afile using one Unicode normalization algorithm, but is trying to setvalues which require a different Unicode normalization value.
How so ? In the current version, you have to encode your Unicode to
UTF-8, and then encode it again to base64. With my proposal, you would
get the exact same UTF-8 strings as you do today, but they would not be
(or would not have to be) encoded in base64.
I agree with Yves here.



Steven Legg wrote:

LDIF is first and foremost an interchange format.  Conversion from LDAP
PDU->LDIF Record->LDAP PDU MUST produce as output the input, octet for
octet for every "data" component (the DN, every attribute description
and associated values, etc.).
That's highly desirable for directory to directory interchange, but LDIFis also used for composing data from various data sources to put in adirectory and to extract data from a directory to put in other datasources. The octet-for-octet preservation usually doesn't apply in theseother cases and the need to turn line-based data such as XML documentsinto base64 encodings is a serious impediment, hence the reason Andrewand I wrote the Internet-draft.



Ludovic Poitou wrote:

I went back through the mailing list archive, "charset" came up, but I

>> can't make sense of who started with it.

> I probably did.
> In europe there are lots of directory users and administrators that have
>  non ascii data they need to transform to LDIF.
> With simple scripts, turning the data to UTF-8 and then to base64 encoded
> is a pain. Allowing to specify a charset and then letting the tools doing
> the conversion to UTF-8 automatically could simplify their life.




Can we try to think of what sort of problem non-encoded UTF-8 would create ?
If there is none, than, could we implement a version 2 that does this ?
The people that don't care for it, can carry on using version 1 ?

--
Yves.
http://www.sollers.ca/

_______________________________________________
Ldapext mailing list
Ldapext@ietf.org
https://www.ietf.org/mailman/listinfo/ldapext

Follow-Ups:
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Kurt Zeilenga <Kurt.Zeilenga@Isode.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Michael Ströder <michael@stroeder.com>

References:
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Kurt Zeilenga <Kurt.Zeilenga@Isode.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Michael Ströder <michael@stroeder.com>

Prev by Date: Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
Next by Date: Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
Index(es):
- Chronological
- Thread