Re: Help with High value unicode characters
От | Michael Fuhr |
---|---|
Тема | Re: Help with High value unicode characters |
Дата | |
Msg-id | 20070808075737.GA46023@winnie.fuhr.org обсуждение исходный текст |
Ответ на | Help with High value unicode characters ("Chris Hoover" <revoohc@gmail.com>) |
Список | pgsql-admin |
On Tue, Aug 07, 2007 at 05:09:35PM -0400, Chris Hoover wrote: > We need some help, we have some what we believe are high value unicode > characters (Unicode 0x2). What do you mean by "high value unicode characters (Unicode 0x2)"? Characters with code points in a plane other than Plane 0 (BMP, Basic Multilingual Plane), i.e., with a code point greater than U+FFFF? > How can you search and replace for these? We are storing this data > in a text field, and having the data contain this unicode value is > violating our xml rules the application uses and causing abends in > our application. If I understand what you're asking then you should be able to use regexp_replace (8.1 and later) to fix the data. Example: UPDATE tablename SET columnname = regexp_replace(columnname, E'[\\U00010000-\\U0010FFFF]+', '', 'g') WHERE columnname ~ E'[\\U00010000-\\U0010FFFF]'; If that doesn't help then please clarify the problem. -- Michael Fuhr
В списке pgsql-admin по дате отправления: