Re: [HACKERS] Bad n_distinct estimation; hacks suggested?

Поиск
Список
Период
Сортировка
От Josh Berkus
Тема Re: [HACKERS] Bad n_distinct estimation; hacks suggested?
Дата
Msg-id 200505031443.44859.josh@agliodbs.com
обсуждение исходный текст
Ответ на Re: [HACKERS] Bad n_distinct estimation; hacks suggested?  (Mischa Sandberg <mischa.sandberg@telus.net>)
Ответы Re: [HACKERS] Bad n_distinct estimation; hacks suggested?  (John A Meinel <john@arbash-meinel.com>)
Re: [HACKERS] Bad n_distinct estimation; hacks suggested?  (Mischa Sandberg <mischa.sandberg@telus.net>)
Список pgsql-performance
Mischa,

> Okay, although given the track record of page-based sampling for
> n-distinct, it's a bit like looking for your keys under the streetlight,
> rather than in the alley where you dropped them :-)

Bad analogy, but funny.

The issue with page-based vs. pure random sampling is that to do, for example,
10% of rows purely randomly would actually mean loading 50% of pages.  With
20% of rows, you might as well scan the whole table.

Unless, of course, we use indexes for sampling, which seems like a *really
good* idea to me ....

--
--Josh

Josh Berkus
Aglio Database Solutions
San Francisco

В списке pgsql-performance по дате отправления:

Предыдущее
От: Mischa Sandberg
Дата:
Сообщение: Re: [HACKERS] Bad n_distinct estimation; hacks suggested?
Следующее
От: John A Meinel
Дата:
Сообщение: Re: [HACKERS] Bad n_distinct estimation; hacks suggested?