Обсуждение: Questionabl description in datatype.sgml

Поиск
Список
Период
Сортировка

Questionabl description in datatype.sgml

От
Tatsuo Ishii
Дата:
In "8.13.2. Encoding Handling"
  <para>   When using binary mode to pass query parameters to the server   and query results back to the client, no
characterset conversion   is performed, so the situation is different.  In this case, an   encoding declaration in the
XMLdata will be observed, and if it   is absent, the data will be assumed to be in UTF-8 (as required by   the XML
standard;note that PostgreSQL does not support UTF-16).   On output, data will have an encoding declaration
specifyingthe client encoding, unless the client encoding is   UTF-8, in which case it will be omitted.  </para>
 

In the first sentence shouldn't "no character set conversion" be "no
encoding conversion"? PostgreSQL is doing client/server encoding
conversion, rather than character set conversion.

Best regards,
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese:http://www.sraoss.co.jp



Re: Questionabl description in datatype.sgml

От
Tom Lane
Дата:
Tatsuo Ishii <ishii@postgresql.org> writes:
> In "8.13.2. Encoding Handling"
>    <para>
>     When using binary mode to pass query parameters to the server
>     and query results back to the client, no character set conversion
>     is performed, so the situation is different.  In this case, an
>     encoding declaration in the XML data will be observed, and if it
>     is absent, the data will be assumed to be in UTF-8 (as required by
>     the XML standard; note that PostgreSQL does not support UTF-16).
>     On output, data will have an encoding declaration
>     specifying the client encoding, unless the client encoding is
>     UTF-8, in which case it will be omitted.
>    </para>

> In the first sentence shouldn't "no character set conversion" be "no
> encoding conversion"? PostgreSQL is doing client/server encoding
> conversion, rather than character set conversion.

I think the text is treating "character set conversion" as meaning
the same thing as "encoding conversion"; certainly I've never seen
any place in our docs that draws a distinction between those terms.
If you think there is a difference, maybe we need to define those
terms somewhere.
        regards, tom lane



Re: Questionabl description in datatype.sgml

От
Bruce Momjian
Дата:
On Sat, Jun 18, 2016 at 11:58:58AM -0400, Tom Lane wrote:
> Tatsuo Ishii <ishii@postgresql.org> writes:
> > In "8.13.2. Encoding Handling"
> >    <para>
> >     When using binary mode to pass query parameters to the server
> >     and query results back to the client, no character set conversion
> >     is performed, so the situation is different.  In this case, an
> >     encoding declaration in the XML data will be observed, and if it
> >     is absent, the data will be assumed to be in UTF-8 (as required by
> >     the XML standard; note that PostgreSQL does not support UTF-16).
> >     On output, data will have an encoding declaration
> >     specifying the client encoding, unless the client encoding is
> >     UTF-8, in which case it will be omitted.
> >    </para>
> 
> > In the first sentence shouldn't "no character set conversion" be "no
> > encoding conversion"? PostgreSQL is doing client/server encoding
> > conversion, rather than character set conversion.
> 
> I think the text is treating "character set conversion" as meaning
> the same thing as "encoding conversion"; certainly I've never seen
> any place in our docs that draws a distinction between those terms.
> If you think there is a difference, maybe we need to define those
> terms somewhere.

Uh, I think Unicode is a character set, and UTF8 is an encoding.  I
think Tatsuo is right here.

--  Bruce Momjian  <bruce@momjian.us>        http://momjian.us EnterpriseDB
http://enterprisedb.com

+ As you are, so once was I. As I am, so you will be. +
+                     Ancient Roman grave inscription +



Re: Questionabl description in datatype.sgml

От
Tatsuo Ishii
Дата:
> On Sat, Jun 18, 2016 at 11:58:58AM -0400, Tom Lane wrote:
>> Tatsuo Ishii <ishii@postgresql.org> writes:
>> > In "8.13.2. Encoding Handling"
>> >    <para>
>> >     When using binary mode to pass query parameters to the server
>> >     and query results back to the client, no character set conversion
>> >     is performed, so the situation is different.  In this case, an
>> >     encoding declaration in the XML data will be observed, and if it
>> >     is absent, the data will be assumed to be in UTF-8 (as required by
>> >     the XML standard; note that PostgreSQL does not support UTF-16).
>> >     On output, data will have an encoding declaration
>> >     specifying the client encoding, unless the client encoding is
>> >     UTF-8, in which case it will be omitted.
>> >    </para>
>> 
>> > In the first sentence shouldn't "no character set conversion" be "no
>> > encoding conversion"? PostgreSQL is doing client/server encoding
>> > conversion, rather than character set conversion.
>> 
>> I think the text is treating "character set conversion" as meaning
>> the same thing as "encoding conversion"; certainly I've never seen
>> any place in our docs that draws a distinction between those terms.
>> If you think there is a difference, maybe we need to define those
>> terms somewhere.
> 
> Uh, I think Unicode is a character set, and UTF8 is an encoding.  I
> think Tatsuo is right here.

Yes, a character set is different from an encoding. I though it's a
common understanding among people.

Best regards,
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese:http://www.sraoss.co.jp



Re: Questionabl description in datatype.sgml

От
Bruce Momjian
Дата:
On Fri, Jun 24, 2016 at 07:27:24AM +0900, Tatsuo Ishii wrote:
> > On Sat, Jun 18, 2016 at 11:58:58AM -0400, Tom Lane wrote:
> >> Tatsuo Ishii <ishii@postgresql.org> writes:
> >> > In "8.13.2. Encoding Handling"
> >> >    <para>
> >> >     When using binary mode to pass query parameters to the server
> >> >     and query results back to the client, no character set conversion
> >> >     is performed, so the situation is different.  In this case, an
> >> >     encoding declaration in the XML data will be observed, and if it
> >> >     is absent, the data will be assumed to be in UTF-8 (as required by
> >> >     the XML standard; note that PostgreSQL does not support UTF-16).
> >> >     On output, data will have an encoding declaration
> >> >     specifying the client encoding, unless the client encoding is
> >> >     UTF-8, in which case it will be omitted.
> >> >    </para>
> >>
> >> > In the first sentence shouldn't "no character set conversion" be "no
> >> > encoding conversion"? PostgreSQL is doing client/server encoding
> >> > conversion, rather than character set conversion.
> >>
> >> I think the text is treating "character set conversion" as meaning
> >> the same thing as "encoding conversion"; certainly I've never seen
> >> any place in our docs that draws a distinction between those terms.
> >> If you think there is a difference, maybe we need to define those
> >> terms somewhere.
> >
> > Uh, I think Unicode is a character set, and UTF8 is an encoding.  I
> > think Tatsuo is right here.
>
> Yes, a character set is different from an encoding. I though it's a
> common understanding among people.

Fixed with the attached applied patch.

--
  Bruce Momjian  <bruce@momjian.us>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

+ As you are, so once was I. As I am, so you will be. +
+                     Ancient Roman grave inscription +

Вложения