Multi-byte character case-folding
От | Thom Brown |
---|---|
Тема | Multi-byte character case-folding |
Дата | |
Msg-id | CAA-aLv5nFfHd72H97u=OnGEsXVn3s-JV-jzMr-HeUePQgX4cEA@mail.gmail.com обсуждение исходный текст |
Ответы |
Re: Multi-byte character case-folding
|
Список | pgsql-hackers |
Hi, At the moment, only single-byte characters in identifiers are case-folded, and multi-byte characters are not. For example, abĉDĚF is case-folded to "abĉdĚf". This can be referred to as "abĉdĚf" or "ABĉDĚF", but not "abĉděf" or "ABĈDĚF". downcase_identifier() has the following comment: /* * SQL99 specifies Unicode-aware case normalization, which we don't yet * have the infrastructure for. Instead we use tolower() to provide a * locale-aware translation. However, there are some locales where this * is not right either (eg, Turkish may do strange things with 'i' and * 'I'). Our current compromise is to use tolower() for characters with * the high bit set, as long as they aren't part of a multi-byte * character, and use an ASCII-only downcasing for 7-bit characters. */ So my question is, do we yet have the infrastructure to make case-folding consistent across all character widths? Thanks Thom
В списке pgsql-hackers по дате отправления: