Re: select distinct uses index scan vs full table scan
От | Tom Lane |
---|---|
Тема | Re: select distinct uses index scan vs full table scan |
Дата | |
Msg-id | 9562.1323806277@sss.pgh.pa.us обсуждение исходный текст |
Ответ на | select distinct uses index scan vs full table scan (Jon Nelson <jnelson+pgsql@jamponi.net>) |
Ответы |
Re: select distinct uses index scan vs full table scan
|
Список | pgsql-performance |
Jon Nelson <jnelson+pgsql@jamponi.net> writes: > I've got a 5GB table with about 12 million rows. > Recently, I had to select the distinct values from just one column. > The planner chose an index scan. The query took almost an hour. > When I forced index scan off, the query took 90 seconds (full table scan). Usually, we hear complaints about the opposite. Are you using nondefault cost settings? > The planner estimated 70,000 unique values when, in fact, there are 12 > million (the value for this row is *almost* but not quite unique). > What's more, despite bumping the statistics on that column up to 1000 > and re-analyzing, the planner now thinks that there are 300,000 unique > values. Accurate ndistinct estimates are hard, but that wouldn't have much of anything to do with this particular choice, AFAICS. > How can I tell the planner that a given column is much more unique > than, apparently, it thinks it is? 9.0 and up have ALTER TABLE ... ALTER COLUMN ... SET n_distinct. regards, tom lane
В списке pgsql-performance по дате отправления: