Re: Optimize join selectivity estimation by not reading MCV stats for unique join attributes

Поиск

Список

Период

Сортировка

От	David Geier
Тема	Re: Optimize join selectivity estimation by not reading MCV stats for unique join attributes
Дата	14 ноября 2022 г. 12:19:18
Msg-id	16acb727-24e8-6453-1711-aae339802966@gmail.com обсуждение исходный текст
Ответ на	Re: Optimize join selectivity estimation by not reading MCV stats for unique join attributes (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы	Re: Optimize join selectivity estimation by not reading MCV stats for unique join attributes
Список	pgsql-hackers

Дерево обсуждения

Hi Tom,
> There won't *be* any MCV stats for a column that ANALYZE perceives to
> be unique, so I'm not quite sure where the claimed savings comes from.
We save if one join attribute is unique while the other isn't. In that 
case stored MCV stats are read for the non-unique attribute but then 
never used. This is because MCV stats in join selectivity estimation are 
only used if they're present on both columns
> Please provide a concrete example.

A super simple case already showing a significant speedup is the 
following. The more ways to join two tables and the more joins overall, 
the higher the expected gain.

CREATE TABLE bar(col INT UNIQUE);
CREATE TABLE foo (col INT);
INSERT INTO foo SELECT generate_series(1, 1000000, 0.5);
SET default_statistics_target = 10000;
ANALYZE foo, bar;
\timing on
EXPLAIN SELECT * FROM foo, bar WHERE foo.col = bar.col;

Running the above query five times gave me average runtimes of:

- 0.62 ms without the patch and
- 0.48 ms with the patch.

--
David Geier
(ServiceNow)

В списке pgsql-hackers по дате отправления:

Предыдущее

От: "Hayato Kuroda (Fujitsu)"
Дата: 14 ноября 2022 г., 11:58:10
Сообщение: RE: Time delayed LR (WAS Re: logical replication restrictions)

Следующее

От: Richard Guo
Дата: 14 ноября 2022 г., 13:10:31
Сообщение: Re: A problem about join ordering

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Optimize join selectivity estimation by not reading MCV stats for unique join attributes

Предыдущее

Следующее