Re: Unicode string literals versus the world

Поиск

Список

Период

Сортировка

От	Tatsuo Ishii
Тема	Re: Unicode string literals versus the world
Дата	16 апреля 2009 г. 01:52:37
Msg-id	20090416.135202.48521717.t-ishii@sraoss.co.jp обсуждение исходный текст
Ответ на	Re: Unicode string literals versus the world (Tatsuo Ishii <ishii@postgresql.org>)
Ответы	Re: Unicode string literals versus the world
Список	pgsql-hackers

Дерево обсуждения

> > >>> I still stand on my proposal, how about extending E'' strings with
> > >>> unicode escapes (eg. \uXXXX)?  The E'' strings are already more
> > >>> clearly defined than '' and they are our "own", we don't need to
> > >>> consider random standards, but can consider our sanity.
> > >>>       
> > >> I suspect there would be lots more support in the user community, where 
> > >> \uXXXX is well understood in a number of contexts (Java and ECMAScript, 
> > >> for example). It's also tolerably sane.
> > >>     
> > >
> > > By the way, that's an example of how to do it wrong, there are more
> > > than 2^16 unicode characters, you want to be able to support the full
> > > 21-bit range if you're going to do it right.
> > >
> > > FWIW, I prefer the perl syntax which simply extends \x: \x{1344}, which
> > > makes it clear it's hex and doesn't make assumptions as to how many
> > > characters are used.
> > >   
> > 
> > I could live with either. Wikipedia says: "The characters outside the 
> > first plane usually have very specialized or rare use." For years we 
> > rejected all characters beyond the first plane, and while that's fixed 
> > now, the volume of complaints wasn't huge.
> 
> I you mean "first plane" as BMP (i.e. 16bit range), above is not true
> for PostgreSQL 7.3 or later at least.

Oops. I meant 8.2 or later.
--
Tatsuo Ishii
SRA OSS, Inc. Japan

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Unicode string literals versus the world