Re: Invalid EUC_TW character sequence found

Поиск
Список
Период
Сортировка
От Tatsuo Ishii
Тема Re: Invalid EUC_TW character sequence found
Дата
Msg-id 20020626.124206.102120976.t-ishii@sra.co.jp
обсуждение исходный текст
Ответ на Re: Invalid EUC_TW character sequence found  (Tatsuo Ishii <t-ishii@sra.co.jp>)
Список pgsql-bugs
> To me, the third insert is a character that display correctly in my application,
> I do not see any problem.  And I do not know and can not tell how to check that
> 'xx' is not a correct ECU_TW character.   Please give me some hint for checking,
> thanks!!

Ok, here are some rules to verify EUC_TW characters:

(1) if the first byte is 0x8e, then the 8th bit of following three
    bytes must be set

(2) else if the first byte is 0x8f, then the 8th bit of following two
    bytes must be set

(3) else if the 8th bit of the first byte is set, then the 8th bit of
    following one bytes must be set

(4) else (that means the 8th bit of the first byte is not set) then
    that must be an ASCII character.

Apparently 0xa672 does not satisfy all of above.
--
Tatsuo Ishii

В списке pgsql-bugs по дате отправления:

Предыдущее
От: Tatsuo Ishii
Дата:
Сообщение: Re: Invalid EUC_TW character sequence found
Следующее
От: pgsql-bugs@postgresql.org
Дата:
Сообщение: Bug #699: pg_dump not reporting correct start value for sequence