[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Problems with case folding of UTF-8



Pierangelo Masarati wrote:
> 
> Can you, Stig and Michael, provide a set of strings that do not
> work, so that I can try to see what's going on?

Well, Ströder (hopefully properly encoded as ISO-8859-1 in this
e-mail) is one. The hex-escaped string representation produced by
Python's UTF-8 Unicode codec is:

'Str\xc3\xb6der'

Furthermore here are all the german umlauts (each two bytes long):

'äöüÄÖÜß' ->

'\xc3\xa4\xc3\xb6\xc3\xbc\xc3\x84\xc3\x96\xc3\x9c\xc3\x9f'

Strip the single quotes off course.

You're welcome to test with web2ldap's online demo (see
http://sites.inka.de:8002/web2ldap) against your LDAP test server
and use any strange NON-ASCII char you can type on your keyboard.

Ciao, Michael.