finding rows with invalid characters
От | Sim Zacks |
---|---|
Тема | finding rows with invalid characters |
Дата | |
Msg-id | 4CE8D02F.8060200@compulab.co.il обсуждение исходный текст |
Ответы |
Re: finding rows with invalid characters
|
Список | pgsql-general |
I am using PG 8.2.17 with UTF8 encoding. "PostgreSQL 8.2.17 on i686-pc-linux-gnu, compiled by GCC gcc (GCC) 4.1.1 (Gentoo 4.1.1)" One of my tables somehow has invalid characters in it: > ERROR: invalid byte sequence for encoding "UTF8": 0xa9 > HINT: This error can also happen if the byte sequence does not match > the encoding expected by the server, which is controlled by > "client_encoding". I have already manually found a number of the bad rows by running queries with text functions (upper) between groups of IDs until I found the specific bad row. 1) Is there a quicker way to get a list of all rows with invalid characters 2) Shouldn't the database prevent these rows from being entered in the first place? 3) I have backups of this database (using -Fc) and I noticed that on restore, this table is not restored because of this error. Is there a way to fix the existing backups, or tell the restore to ignore bad rows instead of erroring out the whole table? Thanks Sim
В списке pgsql-general по дате отправления: