AllocSetEstimateChunkSpace()

Поиск

Список

Период

Сортировка

От	Jeff Davis
Тема	AllocSetEstimateChunkSpace()
Дата	25 марта 2020 г. 01:12:03
Msg-id	e74ab197868a0095edafe3bdcacf632ddb130a52.camel@j-davis.com обсуждение исходный текст
Ответы	Re: AllocSetEstimateChunkSpace()
Список	pgsql-hackers

Дерево обсуждения

Attached is a small patch that introduces a simple function,
AllocSetEstimateChunkSpace(), and uses it to improve the estimate for
the size of a hash entry for hash aggregation.

Getting reasonable estimates for the hash entry size (including the
TupleHashEntryData, the group key tuple, the per-group state, and by-
ref transition values) is important for multiple reasons:

* It helps the planner know when hash aggregation is likely to spill,
and how to cost it.

* It helps hash aggregation regulate itself by setting a group limit
(separate from the memory limit) to account for growing by-ref
transition values.

* It helps choose a good initial size for the hash table. Too large of
a hash table will crowd out space that could be used for the group keys
or the per-group state.

Each group requires up to three palloc chunks: one for the group key
tuple, one for the per-group states, and one for a by-ref transition
value. Each chunk can have a lot of overhead: in addition to the chunk
header (16 bytes overhead), it also rounds the value up to a power of
two (~25% overhead). So, it's important to account for this chunk
overhead.

I considered making it a method of a memory context, but the planner
will call this function before the hash agg memory context is created.
It seems to make more sense for this to just be an AllocSet-specific
function for now.

Regards,
    Jeff Davis

Вложения

estimatechunkspace.patch

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

AllocSetEstimateChunkSpace()

Вложения