[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: [ldapext] UTF-8 full support in LDIF / LDIF v2

To: ldapext@ietf.org
Subject: Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
From: Yves Dorfsman <yves@zioup.com>
Date: Sun, 14 Jun 2009 22:46:53 -0600
Delivered-to: ldapext@core3.amsl.com
In-reply-to: <35B2A165-CE5D-4650-AADE-CC233F71470E@Isode.com>
References: <49C497F9.7010200@zioup.com> <CD3905D4-2A25-4C56-8187-3CE10D46C929@isode.com> <49C870C6.4010803@zioup.com> <E94B7389-9A6D-4CB6-BB2C-649CCD3FD15B@Isode.com> <49CB192E.5050105@zioup.com> <49CB211C.6070108@eb2bcom.com> <49CB87FE.1050809@zioup.com> <49CC01DE.6040506@eb2bcom.com> <4A24557D.7030006@zioup.com> <4A26A05D.8040105@zioup.com> <245BF18B-2066-4E36-9502-16F4A3140D9E@Isode.com> <4A309775.3080406@zioup.com> <4A311ED1.1030202@stroeder.com> <4A31D27B.3090208@zioup.com> <35B2A165-CE5D-4650-AADE-CC233F71470E@Isode.com>
User-agent: Thunderbird 2.0.0.21 (X11/20090409)


Kurt Zeilenga wrote:

I think we need clear problem statements, a proposal for addressing eachproblem, a summary of why the proposal does address the problem and astatement of what (known) problems the proposal might introduce.
You have noted that the "diffing problem". But here it's not clearwhether a) you are wish to determine how two LDIF files differ, b) youwish to determine if two LDIF files represent the same LDAP requests, orc) you wish to determine how directory information represented in theLDIF represented LDAP requests differ and, if so, how.
For a), one can use file comparison tools to determine how two LDIFfiles differ.

Yes, but because it displays unreadable characters, it makes it slightlymore complicated. The better case (than simply diffing) I have given in thepast is:


-the directory is broken
-you export to LDIF
-compare this LDIF with a previous one from when the directory was working.

I personally find that in such a case, being able to read the values makesit simpler and faster.

Other case: People have mentioned scripts that build LDIF file from othersource, and have mentioned that encoding the values in base64 is an overheadthey could do without.

Simply having a UTF-8 value encoding option doesn't generally solve anydiffing problem. In cases b and c, the LDIF encoding of the valuedoesn't matter. In case a), you'd be introducing another way for LDIFfiles which represented the same LDAP requests to differ.

If you always do your export with the same tool, with the same options, thenthis shouldn't be an issue. A narrow case I admit, but this is one specificcase I was thinking about.

It may be you are thinking that a human would be better able to visuallydetect certain kinds of differences. However, this assumes thatremoving the ASCII restriction would produce a readily display Unicodetext.

On a modern OS setup properly, Unicode text is displayed properly (myexperience is with UTF-8 on Linux and solaris here).

That, I believe, is a bad assumption. For instance, say a userdiff(1) to LDIF files and get:
% diff -u ?.ldif
--- 1.ldif    2009-06-11 22:08:21.000000000 -0700
+++ 2.ldif    2009-06-11 22:08:47.000000000 -0700
@@ -1 +1 @@
-a: f??
+a:f??
where ? represents character not displayable on the user's screen. Theuser might assume the values here are same when they aren't.
This, I hope, illustrates why general file diff'ing tools, like diff(1),are suitable only for case a but not b.


Ok, but a) is still a valid case.

--
Yves.
http://www.sollers.ca/

_______________________________________________
Ldapext mailing list
Ldapext@ietf.org
https://www.ietf.org/mailman/listinfo/ldapext

Follow-Ups:
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Kurt Zeilenga <Kurt.Zeilenga@Isode.com>

References:
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Kurt Zeilenga <Kurt.Zeilenga@Isode.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Michael Ströder <michael@stroeder.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Yves Dorfsman <yves@zioup.com>
- Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
  - From: Kurt Zeilenga <Kurt.Zeilenga@Isode.com>

Prev by Date: Re: [ldapext] UTF-8 full support in LDIF / LDIF v2
Next by Date: [ldapext] LDAPCon 2009 Call for Papers
Index(es):
- Chronological
- Thread