Performance tradeoff

Поиск
Список
Период
Сортировка
От Shawn Chisholm
Тема Performance tradeoff
Дата
Msg-id 71837C040963F748B9B94E123A28967805710B@mailserver.sandvine.com
обсуждение исходный текст
Ответы Re: Performance tradeoff  (Josh Berkus <josh@agliodbs.com>)
Список pgsql-performance
Hi All,

I am wondering about the relative performance of  "insert into table1 select distinct a,b from ..." and "insert into
table1select a,b from ... group by a,b" when querying tables of different sizes (10K, 100K, 1s, 10s, 100s of millions
ofrows).   

The distinct way tends to sort/unique and the group by tends to hash aggregate... any opinions on which is better?

I can also change the schema to a certain extent, so would it be worthwhile to put indices on the queried tables (or
refactorthem) hoping the distinct does an index scan instead of sort...  would the query planner take advantage of
that?

Thanks,

Shawn


В списке pgsql-performance по дате отправления:

Предыдущее
От: John Arbash Meinel
Дата:
Сообщение: Re: Help with tuning this query
Следующее
От: Ragnar Hafstað
Дата:
Сообщение: Re: Help with tuning this query