Re: Abbreviated keys for Numeric

Поиск

Список

Период

Сортировка

От	Peter Geoghegan
Тема	Re: Abbreviated keys for Numeric
Дата	21 февраля 2015 г. 19:33:34
Msg-id	CAM3SWZQgEAG0ij2JzAJtf-GjAkXa6Q5OWdEijJiNC6rxq3YkpQ@mail.gmail.com обсуждение исходный текст
Ответ на	Re: Abbreviated keys for Numeric (Peter Geoghegan <pg@heroku.com>)
Ответы	Re: Abbreviated keys for Numeric
Список	pgsql-hackers

Дерево обсуждения

On Sat, Feb 21, 2015 at 10:57 AM, Peter Geoghegan <pg@heroku.com> wrote:
> On Fri, Feb 20, 2015 at 9:18 PM, Tomas Vondra
> <tomas.vondra@2ndquadrant.com> wrote:
>> The gains for text are also very nice, although in this case that only
>> happens for the smallest scale (1M rows), and for larger scales it's
>> actually slower than current master :-(
>
> That's odd. I have a hard time thinking of why the datum sort patch
> could be at fault, though.

Oh, wait. For queries like this, which I now see in your spreadsheet:

select * from (select * from stuff_text order by randtxt offset
100000000000) foo

There is no reason to think that either patch will improve things over
master branch tip's performance. This doesn't use a datum tuplesort.
So that explains it, I think. Although I cannot easily explain the
disparity in performance between 1M and 5M sized sets for this query:

select count(distinct randtxt) from stuff_text

You did make sure that the queries didn't spill to disk, right? Or
that they did so consistently, at least.

There is also no reason to think that integer or float datum sorts
will be accelerated, because they could always use sortsupport - the
datum sort patch is only about making it also possible to also use
abbreviation for opclasses that support it, like text (and,
eventually, numeric).
-- 
Peter Geoghegan

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Abbreviated keys for Numeric