[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Problems with case folding of UTF-8

To: Stig Venaas <Stig.Venaas@uninett.no>
Subject: Re: Problems with case folding of UTF-8
From: Michael Ströder <michael@stroeder.com>
Date: Sat, 22 Dec 2001 19:35:05 +0100
Cc: Pierangelo Masarati <masarati@aero.polimi.it>, openldap-devel@OpenLDAP.org
Organization: stroeder.com
References: <20011222185930.A31930@sverresborg.uninett.no> <200112221807.fBMI7h324927@server.aero.polimi.it> <20011222191927.A31943@sverresborg.uninett.no>

Stig Venaas wrote:
> 
> adding new entry "cn=Stig VenÃ¥s, dc=my-domain,dc=com"

Well, you have to tell us that this string is improperly interpreted
as ISO-8859-1 by your xterm. Otherwise it's meaningless. ;-)

> The DN in base64 is Y249U3RpZyBWZW7DpXMsIGRjPW15LWRvbWFpbixkYz1jb20

Are you sure about that being properly base64-encoded?

Python 2.1.1 (#5, Nov 18 2001, 17:07:23) 
[GCC 2.95.2 19991024 (release)] on linux2
>>> import base64
>>> base64.decodestring('Y249U3RpZyBWZW7DpXMsIGRjPW15LWRvbWFpbixkYz1jb20')
Traceback (most recent call last):
  File "<stdin>", line 1, in ?
  File "/usr/lib/python2.1/base64.py", line 47, in decodestring
    decode(f, g)
  File "/usr/lib/python2.1/base64.py", line 31, in decode
    s = binascii.a2b_base64(line)
binascii.Error: Incorrect padding
>>>

> Ã¥ is å (a with circle above), and should still be one character
> when normalized (still 2 characters in UTF-8).

For your records Python's UTF-8 encoding:

>>> unicode('Venås','iso-8859-1').encode('utf-8')
'Ven\xc3\xa5s'
>>> 

Ciao, Michael.

Follow-Ups:
- Re: Problems with case folding of UTF-8
  - From: Stig Venaas <Stig.Venaas@uninett.no>
- Re: Problems with case folding of UTF-8
  - From: Pierangelo Masarati <masarati@aero.polimi.it>
- Re: Problems with case folding of UTF-8
  - From: "Kurt D. Zeilenga" <Kurt@OpenLDAP.org>

References:
- Re: Problems with case folding of UTF-8
  - From: Stig Venaas <Stig.Venaas@uninett.no>
- Re: Problems with case folding of UTF-8
  - From: Pierangelo Masarati <masarati@aero.polimi.it>
- Re: Problems with case folding of UTF-8
  - From: Stig Venaas <Stig.Venaas@uninett.no>

Prev by Date: Re: Problems with case folding of UTF-8
Next by Date: Re: Problems with case folding of UTF-8
Index(es):
- Chronological
- Thread