parallelizing subplan execution (was: explain and PARAM_EXEC)

Поиск
Список
Период
Сортировка
От Dimitri Fontaine
Тема parallelizing subplan execution (was: explain and PARAM_EXEC)
Дата
Msg-id m2tytc3uga.fsf_-_@hi-media.com
обсуждение исходный текст
Ответ на Re: explain and PARAM_EXEC  (Robert Haas <robertmhaas@gmail.com>)
Ответы Re: parallelizing subplan execution (was: explain and PARAM_EXEC)  (Robert Haas <robertmhaas@gmail.com>)
Список pgsql-hackers
Robert Haas <robertmhaas@gmail.com> writes:
> On Sat, Feb 20, 2010 at 6:57 AM, Dimitri Fontaine
> <dfontaine@hi-media.com> wrote:
>> How much does this stuff is dependent on the current state of the
>> backend?
>
> A whole lot.

Bad news.

>> Ok that's a far stretch from the question at hand, but would that be a
>> plausible approach to have parallel queries in PostgreSQL ?
>
> This is really a topic for another thread, but at 100,000 feet it
> seems to me that the hardest question is - how will you decide which
> operations to parallelize in the first place?  Actually making it
> happen is really hard, too, of course, but even to get that that point
> you have to have some model for what types of operations it makes
> sense to parallelize and how you're going to decide when it's a win.

My naive thoughts would be to add some cost parameters. The fact to
fork() another backend first, then model for each supported subplan (we
will want to add more, or maybe have a special rendez-vous-materialise
node) some idea of the data exchange cost.

Now the planner would as usual try to find the less costly plan, and
will be able to compare plans with and without distributing the work.

Overly naive ?

Regards,
--
dim


В списке pgsql-hackers по дате отправления:

Предыдущее
От: Robert Haas
Дата:
Сообщение: Re: explain and PARAM_EXEC
Следующее
От: Tom Lane
Дата:
Сообщение: Re: explain and PARAM_EXEC