nanogui@linuxhacker.org

nanogui@linuxhacker.org


Subject: Re: New Microwindows font support
From: Rob Leslie
Date: Tue, 21 Mar 2000 13:48:49 -0500

> : You are right, with the ugly Surrogate hack in UTF-16, up to
> : 0x10FFFF characters can be supported, which should keep the
> : ISO and ANSI guys busy for a while....
> 
> I'm not aware of the this.  Would you explain a bit?

The Unicode standard defines character codes U+D800 through U+DFFF to be
"surrogates," such that a pair made up from one high-half surrogate (U+D800 -
U+DBFF) and one low-half surrogate (U+DC00 - U+DFFF) represents a single
abstract character with a scalar value defined to be:

  0x10000 + (H - 0xD800) * 0x400 + (L - 0xDC00)

Cheers,

-- 
Robert Leslie
rob@mars.org

nanogui@linuxhacker.org