Re: proposal : cross-column stats

Поиск

Список

Период

Сортировка

От	Tomas Vondra
Тема	Re: proposal : cross-column stats
Дата	13 декабря 2010 г. 01:16:27
Msg-id	4D0581FF.3080303@fuzzy.cz обсуждение исходный текст
Ответ на	Re: proposal : cross-column stats (Robert Haas <robertmhaas@gmail.com>)
Ответы	Re: proposal : cross-column stats (Robert Haas <robertmhaas@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

Dne 13.12.2010 03:00, Robert Haas napsal(a):
> Well, the question is what data you are actually storing.  It's
> appealing to store a measure of the extent to which a constraint on
> column X constrains column Y, because you'd only need to store
> O(ncolumns^2) values, which would be reasonably compact and would
> potentially handle the zip code problem - a classic "hard case" rather
> neatly.  But that wouldn't be sufficient to use the above equation,
> because there A and B need to be things like "column X has value x",
> and it's not going to be practical to store a complete set of MCVs for
> column X for each possible value that could appear in column Y.

O(ncolumns^2) values? You mean collecting such stats for each possible
pair of columns? Well, I meant something different.

The proposed solution is based on contingency tables, built for selected
groups of columns (not for each possible group). And the contingency
table gives you the ability to estimate the probabilities needed to
compute the selectivity. Or am I missing something?

regards
Tomas

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Tomas Vondra
Дата: 13 декабря 2010 г., 01:08:52
Сообщение: Re: proposal : cross-column stats

Следующее

От: Andrew Dunstan
Дата: 13 декабря 2010 г., 01:24:38
Сообщение: Re: ALTER TABLE ... ADD FOREIGN KEY ... NOT ENFORCED

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: proposal : cross-column stats

Предыдущее

Следующее