Re: Parallel Scaling of a pgplsql problem

Поиск

Список

Период

Сортировка

От	Yeb Havinga
Тема	Re: Parallel Scaling of a pgplsql problem
Дата	26 апреля 2012 г. 03:49:39
Msg-id	4F98EFE8.5090609@gmail.com обсуждение исходный текст
Ответ на	Re: Parallel Scaling of a pgplsql problem (Venki Ramachandran <venki_ramachandran@yahoo.com>)
Список	pgsql-performance

Дерево обсуждения

On 2012-04-26 04:40, Venki Ramachandran wrote:

Thanks Tom, clock_timestamp() worked. Appreciate it!!! and Sorry was hurrying to get this done at work and hence did not read through.

Can you comment on how you would solve the original problem? Even if I can get the 11 seconds down to 500 ms for one pair, running it for 300k pairs will take multiple hours. How can one write a combination of a bash script/pgplsql code so as to use all 8 cores of a server. I am seeing that this is just executing in one session/process.

You want to compare a calculation on the cross product 'employee x employee'. If employee is partitioned into emp1, emp2, ... emp8, the cross product is equal to the union of emp1 x employee, emp2 x employee, .. emp8 x employee. Each of these 8 cross products on partitions can be executed in parallel. I'd look into dblink to execute each of the 8 cross products in parallel, and then union all of those results.

http://www.postgresql.org/docs/9.1/static/contrib-dblink-connect.html

regards,
Yeb

В списке pgsql-performance по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Parallel Scaling of a pgplsql problem