Re: Extremely inefficient merge-join

Поиск

Список

Период

Сортировка

От	Marcin Gozdalik
Тема	Re: Extremely inefficient merge-join
Дата	18 марта 2021 г. 00:27:18
Msg-id	CADu1mROz6ZTspcHWgTZKi8zmS64wVy+QV-Wx_38NJFSX9QrA0A@mail.gmail.com обсуждение исходный текст
Ответ на	Re: Extremely inefficient merge-join (Tom Lane <tgl@sss.pgh.pa.us>)
Список	pgsql-performance

Дерево обсуждения

dir_current changes often, but is analyzed after significant changes, so effectively it's analyzed probably once an hour.

The approximate ratio of rows with volume_id=5 to the whole number of rows doesn't change (i.e. volume_id=5 will appear roughly in 1.5M-2M rows, total is around 750-800M rows).

dir_process is created once, analyzed and doesn't change later.

Assuming dir_process is the outer side in plans shown here has only duplicates - i.e. all rows have volume_id=5 in this example.

Do you think there is anything that could be changed with the query itself? Any hints would be appreciated.

śr., 17 mar 2021 o 20:47 Tom Lane <tgl@sss.pgh.pa.us> napisał(a):

Marcin Gozdalik <gozdal@gmail.com> writes:
> Sometimes Postgres will choose very inefficient plan, which involves
> looping many times over same rows, producing hundreds of millions or
> billions of rows:

Yeah, this can happen if the outer side of the join has a lot of
duplicate rows. The query planner is aware of that effect and will
charge an increased cost when it applies, so I wonder if your
statistics for the tables being joined are up-to-date.

regards, tom lane

Marcin Gozdalik

В списке pgsql-performance по дате отправления:

Предыдущее

От: Tom Lane
Дата: 17 марта 2021 г., 23:47:35
Сообщение: Re: Extremely inefficient merge-join

Следующее

От: Manish Lad
Дата: 18 марта 2021 г., 12:14:23
Сообщение: How do we hint a query to use index in postgre

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Extremely inefficient merge-join

Предыдущее

Следующее