Re: [HACKERS] Much Ado About COUNT(*)

Поиск
Список
Период
Сортировка
От Csaba Nagy
Тема Re: [HACKERS] Much Ado About COUNT(*)
Дата
Msg-id 1105632432.14493.31.camel@localhost.localdomain
обсуждение исходный текст
Ответ на Re: [HACKERS] Much Ado About COUNT(*)  (Greg Stark <gsstark@mit.edu>)
Список pgsql-general
[snip]
> See:
>
>   http://www.jlcomp.demon.co.uk/faq/random.html
>
> I think the Oracle syntax looks like
>
>   SELECT * FROM foo SAMPLE (0.1)
>
> I don't think I would have picked this syntax but it seems like a better idea
> to copy the existing practice rather than invent a new one.
>
> There are some details, like what to do when there's a WHERE clause or joins.
> Oracle disallows joins entirely and I'm unclear what the best thing to do
> about where clauses would be.

The where clauses could be applied exactly as for a normal select, with
the sampling being just a pre-filtering condition for what rows to
consider.

If there would be a way to specify the table on which to apply the
sampling, then the whole construct could be replaced automatically by
the inline view the oracle link recommends.
I doubt there would be any benefit in sampling more than one table in a
query, it should just work to sample the biggest table, and join the
result with the others. Sampling is only useful for really big tables
anyway.

So the syntax above could be extended to:

SELECT * FROM foo SAMPLE (foo, 0.1)

and:

SELECT foo.*, bar.*
  FROM foo, bar
  WHERE foo.key = bar.key
  SAMPLE (foo, 0.1)

which means sample foo and join the result with bar.

All this makes sense from a user point of view, I wonder how big a PITA
is to implement it...

Cheers,
Csaba.




В списке pgsql-general по дате отправления:

Предыдущее
От: Tom Lane
Дата:
Сообщение: Re: Trace triggers
Следующее
От: Alvaro Herrera
Дата:
Сообщение: Re: Réf. : Re: Réf