[Date Prev][Date Next]
[Chronological]
[Thread]
[Top]
Re: Unicode normalization
At 03:53 PM 11/16/00 +0100, Stig Venås wrote:
>Before I start doing the changes, I would like to let you know what
>I'm thinking, please let me know if this looks wrong.
>
>To lunicode, I'm adding UTF8str2upper and UTF8normalize that does upper-
>casing and unicode normalization on a UTF8 string resp., and return a
>pointer to a new string. They allocate new memory for this, preserving
>the old one. I'm using functions from lldap for this, is it bad that
>lunicode depends on lldap?
This shouldn't be a problem.
>I'm leaving them as two separate functions since we don't always want to
>do uppercasing.
>
>Both should be used by dnNormalize, caseIgnoreIndexer, caseIgnoreFilter,
>caseIgnoreSubstringsIndexer, caseIgnoreSubstringsFilter. They should also
>be used by the approx-functions I guess, but I don't want to think of
>them now.
>
>caseExactMatch, caseExactSubstringsMatch, caseExactIndexer,
>caseExactFilter, caseExactSubstringsIndexer and caseExactSubstringsFilter
>should all use UTF8normalize.
>
>Is it okay if I enable this in head (will require reindexing in many
>cases), or should I ifdef it out.
You should enable new code in head as soon as it functional.
As long as the changes don't interfere with other development,
there is no need for #ifdefs.
Kurt