Re: [HACKERS] Unicode combining characters
От | Patrice Hédé |
---|---|
Тема | Re: [HACKERS] Unicode combining characters |
Дата | |
Msg-id | 20011010192819.J14587@idf.net обсуждение исходный текст |
Ответ на | Re: [HACKERS] Unicode combining characters (Tatsuo Ishii <t-ishii@sra.co.jp>) |
Список | pgsql-patches |
> > 1) we support these supplementary characters, knowing that they won't > > work with regexes, > > > > 2) I back out the change, but then anyone using these characters will > > get something weird, since the decoding would be faulty (they would > > be handled as 3 bytes UTF-8 chars, and then the fourth byte would > > become a "faulty char"... not very good, as the 3-byte version is > > still not a valid UTF-8 code !), > > > > 3) we fix the regex engine within the next 24 hours, before the beta > > deadline is activated :/ > > > > What do you think ? > > I think 2) is not very good, and we should reject these 4-bytes UTF-8 > strings. After all, we are not ready for them. If we still recognise them as 4-byte UTF-8 chars (in order to parse the next char correctly) and reject them as invalid chars, that should be OK :) > BTW, other part of your patches looks good. Peter, what do you think? Nice to hear :) Patrice -- Patrice Hédé email: patrice hede à islande org www : http://www.islande.org/
В списке pgsql-patches по дате отправления: