[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Problems with case folding of UTF-8



Stig Venaas wrote:
> 
> adding new entry "cn=Stig Venås, dc=my-domain,dc=com"

Well, you have to tell us that this string is improperly interpreted
as ISO-8859-1 by your xterm. Otherwise it's meaningless. ;-)

> The DN in base64 is Y249U3RpZyBWZW7DpXMsIGRjPW15LWRvbWFpbixkYz1jb20

Are you sure about that being properly base64-encoded?

Python 2.1.1 (#5, Nov 18 2001, 17:07:23) 
[GCC 2.95.2 19991024 (release)] on linux2
>>> import base64
>>> base64.decodestring('Y249U3RpZyBWZW7DpXMsIGRjPW15LWRvbWFpbixkYz1jb20')
Traceback (most recent call last):
  File "<stdin>", line 1, in ?
  File "/usr/lib/python2.1/base64.py", line 47, in decodestring
    decode(f, g)
  File "/usr/lib/python2.1/base64.py", line 31, in decode
    s = binascii.a2b_base64(line)
binascii.Error: Incorrect padding
>>>

> Ã¥ is å (a with circle above), and should still be one character
> when normalized (still 2 characters in UTF-8).

For your records Python's UTF-8 encoding:

>>> unicode('Venås','iso-8859-1').encode('utf-8')
'Ven\xc3\xa5s'
>>> 

Ciao, Michael.