[Date Prev][Date Next] [Chronological] [Thread] [Top]

Re: Fwd: LMDB and text encoding



* Timur Kristóf:

>> > A path is always a Unicode string, while a DB name can be an arbitrary
>> > binary blob.
>>
>> On many POSIX platforms, a path is a blob which does not contain
>> '\000'.  These systems do not enforce Unicode encoding at all.
>
> My mistake. I was unaware.
> On those platforms, how do you type a path name into a terminal?

There are some files which are not directly nameable.  Many programs
support special sequences such as “Ctrl+V 3 7 7” to enter arbitrary
bytes, but that's not universal.  Depending on the actual
implementation of the terminal, cut-and-paste of funny file names can
work, too.

Older programs have trouble accessing such files even if the user
chooses them in a file selection dialog, but current version are
supposed to have been fixed (including OpenJDK, which took a
ridiculously long time).  Beyond that, it's not much different from
dealing with file names in an unfamiliar script.